7

John RuskinAs much as I enjoy manybooks.net (especially the recent reviews and the ability to create custom PDF and custom html), two things about the site are starting to get on my nerves.

When you download ebooks from manybooks.net in mobipocket format (and presumably other baked formats), you are downloading formats based on text files, not HTML files. Why is this inadequate? It means that the table of contents lack the sorts of links we come to expect. Compare the manybooks.net version of John Ruskin’s On the Old Road with the PG html version. The PG version not only has hyperlinks on the table of contents, but also footnotes. (In the zipped version of other ebooks, there are even illustrations).

This may not be as big a deal for novels (which are read in a basically linear fashion). But with other books (like poetry and essay collections), it’s more natural to want to skip around, to return to the Table of Contents periodically to plan what you will read next.

I am finding that instead of using manybooks.net version I am first checking PG to see if they have HTML versions of the text I am seeking (and then to use mobipocket on my desktop to do the conversions).

The other issue is prefatory legal material. Has anyone noticed that the legalese crud introducing many PG works are really long? The Wizard of Oz etext contains a legal preface of about 1500 words (and that appears at the top of the txt version, which means that the manybooks.net version means you have to page through a good number pages before you begin. This etext is perhaps a special case; it was one of the earliest texts scanned and produced by PG. (Later texts such as this one put the legalese at the end, although it now numbers 2900 words)

Having a preface at the top of an ebook would not be such a big deal if it were only a hyperlink in a Table of Contents which the reader could bypass. That brings us once again to the question of Manybooks.net and the fact it does not use hyperlinked versions of many texts.

I don’t want to sound too harsh on manybooks.net. After all, Project Gutenberg has a team of proofreaders and formatters spanning several continents; later versions of PG texts have much better layout, so it’s hardly surprising that PG should be winning the formatting wars. Also, manybooks is a one-man job (and pretty much a fine one at that–has anyone given manybooks.net a donation lately?)

But as ebook readership increases and PG produces more sophisticated layout for their etexts, readers are going to expect better formatting on ebooks (illustrations, footnotes and other things). Given Project Gutenberg’s existing infrastructure and its ability to do simple ebook conversions already (such as with Plucker), will PG turn out to be the best source for well-laid out public domain ebooks?

Robert Nagle (aka idiotprogrammer) will have his first new artistic video, John Ruskin: An Introduction, appear online this summer.

 
7