11

OpenReaderWhat’s so open about OpenReader?

Is it David Rothman’s contention that every e-book should be able to be opened in more than one e-reader? Laudable as that is, without other incentives it may require divine intervention to prevent OpenReader’s becoming merely an Esperanto of e-book formats in the Tower of eBabel he jeremiads about.

And what’s so open about Jon Noring’s single-author OpenReader spec? Is it his acceptance of most every good idea anyone has proposed for e-books these last seven years? His open attitude to making others’ ideas fit in with his own firm notion of what is needed isn’t exactly the best way to create an industry-wide spec, a notion that he infuriatingly seems to be open to.

To my mind, there’s a different “open” in OpenReader worth pushing for. It’s like the “open” in open-source, and the “extensible” in XML. This open means open-ended.

In open-source, the code is available for anyone to add to its capabilities. Users frustrated by a program’s (or an OS’s) limitations are free to fix the problem; they have the code to work from so they’re not stuck. Nothing’s blocking them from improving on it. Being open-ended is why open-source came about.

XML is open-ended as well, allowing users to choose precisely the terms they need to make sense of their content (data or text). Don’t find the <p> tag enough, but need <poem>, <stanza> and <line-of-verse> to mark up your literary efforts? You can make them. And to prevent a Tower of xBabel, proper XML documents are self-documenting. Anyone who encounters your file should be able to figure out what a <poem> element is for, even if there’s no schema or dtd accompanying it.

All the e-book formats around are based on HTML or its close relative, XHTML. And as anyone who works with text knows, HTML isn’t rich enough to represent all the things we want to encode in our books — you want <poem> for your book of poetry, and I want <journal-entry> and <letter> for my parents’ book about the year they spent in Cuba in 1938.

I want to use my own markup to publish the journal of that gypsy year, not transform it down to html’s level just so it can be displayed in the e-reader. The copy people are reading should be just that: a copy of the master file, not a version with all the non-readable content stripped out. That’s like publishing a screenplay with only the dialog, and no indication of who’s talking, or where, or when scenes change or what the screen is showing.

So the OpenReader I envision doesn’t limit an OR file to a limited set of tags, the way every e-book spec in creation does. It wouldn’t even start with a set of pre-accepted tags, the way OpenReader seems destined to (yes, the ones in XHTML — what else?). Sure, adding DocBook and TEI along the way will push the limit out to something technical publishers and academics will find roomy enough. But that’s not enough.

No matter how many tags you make, there’ll always be another one you want for some book or another.

I say make OpenReader open enough to handle any tag. Forget specifying which tags are OK in advance. XHTML isn’t enough and XHTML plus DocBook and TEI isn’t enough, and really, there’s nothing you can add that will be enough for every circumstance. You just require the OpenReader binder to reference a file (CSS? DTD? XSD? RNG?) that has all that an e-reader needs to know to render whatever it should.

That’s something XML editors have been doing for at least six years. It’s not like I’m proposing something radical.

Let’s make the open in OpenReader mean open-ended.


PS: I suppose you don’t have to have the rendering file in your package. If the e-reader recognizes XHTML, DocBook or TEI vocabularies, then the binder would reference its built-in knowledge. And even with other vocabularies the e-reader is ignorant of, the package doesn’t have to carry the reference file if an extension/plug-in has been added. Publishers would include a format plug-in with e-book purchases — “Put this in your e-reader plug-ins folder, and your OpenReader-compatible e-reader will render any of our nonfiction books marked up using Historical Event ML.”

Coming up in Part II: A more radical meaning of “open”

 
11