<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: &#8216;Checking Out the Machines Behind Book Digitization&#8217;</title>
	<atom:link href="http://www.teleread.com/2006/02/25/checking-out-the-machines-behind-book-digitization/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.teleread.com/openreader/checking-out-the-machines-behind-book-digitization/</link>
	<description>News &#38; views on e-books, libraries, publishing and related topics</description>
	<lastBuildDate>Wed, 15 Feb 2012 05:35:38 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>By: bowerbird</title>
		<link>http://www.teleread.com/openreader/checking-out-the-machines-behind-book-digitization/comment-page-1/#comment-49007</link>
		<dc:creator>bowerbird</dc:creator>
		<pubDate>Sun, 26 Feb 2006 22:54:26 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=4360#comment-49007</guid>
		<description>&gt;   Jon and I will take it a little more seriously.

it&#039;s not important to me that you two &quot;take it seriously&quot;.
you&#039;ve got a different agenda to promote...


&gt;   Beyond the still-remaining issue
&gt;   of ability to handle complex typography, 

i&#039;ve just challenged you on that issue,
and you (wisely) declined to do battle.
so that &quot;issue&quot; is _not_ &quot;still-remaining&quot;.


&gt;   there are others such as interactivity. 
&gt;   An XMLish approach will be far, 
&gt;   far more conducive to that.

that assertion is also false, and you will
likewise decline to take a challenge on it.

so i&#039;ll proceed to issue one of those too:
name your &quot;interactivity&quot; contest, rothman,
and then show me your openreader solution.

let&#039;s see if you&#039;re willing to put some pride
on the line to back up all the hyping you do.
i&#039;m ready to do some sparring.  are you?

-bowerbird</description>
		<content:encoded><![CDATA[<p>&gt;   Jon and I will take it a little more seriously.</p>
<p>it&#8217;s not important to me that you two &#8220;take it seriously&#8221;.<br />
you&#8217;ve got a different agenda to promote&#8230;</p>
<p>&gt;   Beyond the still-remaining issue<br />
&gt;   of ability to handle complex typography, </p>
<p>i&#8217;ve just challenged you on that issue,<br />
and you (wisely) declined to do battle.<br />
so that &#8220;issue&#8221; is _not_ &#8220;still-remaining&#8221;.</p>
<p>&gt;   there are others such as interactivity.<br />
&gt;   An XMLish approach will be far,<br />
&gt;   far more conducive to that.</p>
<p>that assertion is also false, and you will<br />
likewise decline to take a challenge on it.</p>
<p>so i&#8217;ll proceed to issue one of those too:<br />
name your &#8220;interactivity&#8221; contest, rothman,<br />
and then show me your openreader solution.</p>
<p>let&#8217;s see if you&#8217;re willing to put some pride<br />
on the line to back up all the hyping you do.<br />
i&#8217;m ready to do some sparring.  are you?</p>
<p>-bowerbird</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Rothman</title>
		<link>http://www.teleread.com/openreader/checking-out-the-machines-behind-book-digitization/comment-page-1/#comment-49003</link>
		<dc:creator>David Rothman</dc:creator>
		<pubDate>Sun, 26 Feb 2006 20:14:04 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=4360#comment-49003</guid>
		<description>Hey, Bowerbird, when you take your ZML into the standards mainstream (and I don&#039;t just mean the use of ASCII!), Jon and I will take it a little more seriously.  

Beyond the still-remaining issue of ability to handle complex typography, there are others such as interactivity. An XMLish approach will be far, far more conducive to that. 

OK, now you get the last word. 

Thanks,
David</description>
		<content:encoded><![CDATA[<p>Hey, Bowerbird, when you take your ZML into the standards mainstream (and I don&#8217;t just mean the use of ASCII!), Jon and I will take it a little more seriously.  </p>
<p>Beyond the still-remaining issue of ability to handle complex typography, there are others such as interactivity. An XMLish approach will be far, far more conducive to that. </p>
<p>OK, now you get the last word. </p>
<p>Thanks,<br />
David</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: bowerbird</title>
		<link>http://www.teleread.com/openreader/checking-out-the-machines-behind-book-digitization/comment-page-1/#comment-49001</link>
		<dc:creator>bowerbird</dc:creator>
		<pubDate>Sun, 26 Feb 2006 19:37:26 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=4360#comment-49001</guid>
		<description>oh well, since i&#039;ve already broken my
new year&#039;s resolution for 2006 by
commenting here in the first place
-- pretty good, almost 2 months! --
might as well finish this thread right.

you say that, &quot;as usual&quot;, i&#039;m thinking
too simplistically.  after i remark that
it&#039;s unwise for you to underestimate
your rivals, i&#039;ll accept the &quot;challenge&quot;
implicit in your ad hominem charge...

let&#039;s see if i can handle difficult books!

scan 3 books that you think would be
&quot;too hard&quot; for my zen markup system
to handle, and furnish me the scans
and the o.c.r., and i&#039;ll show you that
i am able to handle the books nicely...
(at a bare minimum, as well as paper.)

i predict i&#039;ll be able to handle at least 2.
if i _can&#039;t_ handle a book, then you can
prove the superiority of your system by
showing you _can_ handle that book...

that&#039;s the &quot;difficult&quot; test.  of interest too
is the &quot;average&quot; test -- i.e, how well i can
handle the range of &quot;average&quot; books...

so next, give me 10 numbers between
1 and 18,000.  i will then add a constant
to those numbers, and the new numbers
will be the 10 e-texts from the p.g. library
that i will process using my zen markup.
i expect i&#039;ll be able to handle at least 9...

again here, if any of these books should
prove &quot;too difficult&quot; for z.m.l. to handle,
i&#039;ll look forward to how well openreader
handles those books...

i think my z.m.l. can give better benefits
_and_ lower costs than your openreader,
and that people will then see that it is a
no-brainer as to which they will choose.

why would people voluntarily take on the
burden of doing heavy markup when they
can get better e-books without it?  why?

so, how about it, rothman?  are you up for
a little head-to-head performance rumble?
a _battle_ between z.m.l. and openreader?

or will you recognize that it would be
_far_ too devasting for your precious
openreader to lose, so you chicken out?

-bowerbird</description>
		<content:encoded><![CDATA[<p>oh well, since i&#8217;ve already broken my<br />
new year&#8217;s resolution for 2006 by<br />
commenting here in the first place<br />
&#8211; pretty good, almost 2 months! &#8211;<br />
might as well finish this thread right.</p>
<p>you say that, &#8220;as usual&#8221;, i&#8217;m thinking<br />
too simplistically.  after i remark that<br />
it&#8217;s unwise for you to underestimate<br />
your rivals, i&#8217;ll accept the &#8220;challenge&#8221;<br />
implicit in your ad hominem charge&#8230;</p>
<p>let&#8217;s see if i can handle difficult books!</p>
<p>scan 3 books that you think would be<br />
&#8220;too hard&#8221; for my zen markup system<br />
to handle, and furnish me the scans<br />
and the o.c.r., and i&#8217;ll show you that<br />
i am able to handle the books nicely&#8230;<br />
(at a bare minimum, as well as paper.)</p>
<p>i predict i&#8217;ll be able to handle at least 2.<br />
if i _can&#8217;t_ handle a book, then you can<br />
prove the superiority of your system by<br />
showing you _can_ handle that book&#8230;</p>
<p>that&#8217;s the &#8220;difficult&#8221; test.  of interest too<br />
is the &#8220;average&#8221; test &#8212; i.e, how well i can<br />
handle the range of &#8220;average&#8221; books&#8230;</p>
<p>so next, give me 10 numbers between<br />
1 and 18,000.  i will then add a constant<br />
to those numbers, and the new numbers<br />
will be the 10 e-texts from the p.g. library<br />
that i will process using my zen markup.<br />
i expect i&#8217;ll be able to handle at least 9&#8230;</p>
<p>again here, if any of these books should<br />
prove &#8220;too difficult&#8221; for z.m.l. to handle,<br />
i&#8217;ll look forward to how well openreader<br />
handles those books&#8230;</p>
<p>i think my z.m.l. can give better benefits<br />
_and_ lower costs than your openreader,<br />
and that people will then see that it is a<br />
no-brainer as to which they will choose.</p>
<p>why would people voluntarily take on the<br />
burden of doing heavy markup when they<br />
can get better e-books without it?  why?</p>
<p>so, how about it, rothman?  are you up for<br />
a little head-to-head performance rumble?<br />
a _battle_ between z.m.l. and openreader?</p>
<p>or will you recognize that it would be<br />
_far_ too devasting for your precious<br />
openreader to lose, so you chicken out?</p>
<p>-bowerbird</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: bowerbird</title>
		<link>http://www.teleread.com/openreader/checking-out-the-machines-behind-book-digitization/comment-page-1/#comment-48942</link>
		<dc:creator>bowerbird</dc:creator>
		<pubDate>Sun, 26 Feb 2006 04:51:31 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=4360#comment-48942</guid>
		<description>no thanks...       :+)

-bowerbird</description>
		<content:encoded><![CDATA[<p>no thanks&#8230;       :+)</p>
<p>-bowerbird</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Rothman</title>
		<link>http://www.teleread.com/openreader/checking-out-the-machines-behind-book-digitization/comment-page-1/#comment-48939</link>
		<dc:creator>David Rothman</dc:creator>
		<pubDate>Sun, 26 Feb 2006 04:16:26 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=4360#comment-48939</guid>
		<description>As usual, Bowerbird, you&#039;re thinking of the entire publishing world as having easy requirements. Beyond that, the right creation tools should make it a cinch even for small publishers to do OpenReader format. Other issues arise such as the language ones, the needs of the disabled, etc. The right viewer just isn&#039;t enough. This is an eternal debate, and I know you&#039;ll have a reply, so go ahead and have the last word. - David</description>
		<content:encoded><![CDATA[<p>As usual, Bowerbird, you&#8217;re thinking of the entire publishing world as having easy requirements. Beyond that, the right creation tools should make it a cinch even for small publishers to do OpenReader format. Other issues arise such as the language ones, the needs of the disabled, etc. The right viewer just isn&#8217;t enough. This is an eternal debate, and I know you&#8217;ll have a reply, so go ahead and have the last word. &#8211; David</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: bowerbird</title>
		<link>http://www.teleread.com/openreader/checking-out-the-machines-behind-book-digitization/comment-page-1/#comment-48936</link>
		<dc:creator>bowerbird</dc:creator>
		<pubDate>Sun, 26 Feb 2006 00:40:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=4360#comment-48936</guid>
		<description>&gt;   As for .txt vs. an OpenReader-stype approach, 
&gt;   remember that OpenReader would allow standards 
&gt;   AND good presentation. I want to see real italics,
&gt;   not *this*.

you &#039;re confusing the file-format with the viewer-program.

html uses angle-brackets around an &quot;i&quot; for italics in the file, 
and the viewer-program (your browser) does the formatting.

likewise, a viewer-program for a p.g.-style text-file could
easily change _this_ to its italicized form.  (you used
asterisks, but asterisks more commonly mean *bold*
in plain-text markups like these.)

all in all, it&#039;s a lot easier to have our _tools_ do this,
and not require humans to do &quot;markup&quot;.  for instance,
here is a link to the project gutenberg website:

http://www.gutenberg.org

i didn&#039;t do any of the clumsy &quot;a href&quot; stuff, i just
typed in the u.r.l., and your wordpress software
was smart enough to recognize it as a link, and
do the formatting for me.  (at least it used to be;
i don&#039;t know if this new version does that or not...)

but that&#039;s what we need, software smart like that...

you&#039;ll probably have some &quot;yeah but&quot; response,
because this kind of simplicity threatens your
philosophical mindset.  i won&#039;t bother to reply,
because you can&#039;t convince someone who will
refuse to be convinced, but the movement to
plain-text writing with automatic markup is
fairly well unstoppable now.  you keep calling
for &quot;one standard format&quot;, but do not seem to
recognize that we&#039;ve had it all along -- plain ascii.

-bowerbird</description>
		<content:encoded><![CDATA[<p>&gt;   As for .txt vs. an OpenReader-stype approach,<br />
&gt;   remember that OpenReader would allow standards<br />
&gt;   AND good presentation. I want to see real italics,<br />
&gt;   not *this*.</p>
<p>you &#8216;re confusing the file-format with the viewer-program.</p>
<p>html uses angle-brackets around an &#8220;i&#8221; for italics in the file,<br />
and the viewer-program (your browser) does the formatting.</p>
<p>likewise, a viewer-program for a p.g.-style text-file could<br />
easily change _this_ to its italicized form.  (you used<br />
asterisks, but asterisks more commonly mean *bold*<br />
in plain-text markups like these.)</p>
<p>all in all, it&#8217;s a lot easier to have our _tools_ do this,<br />
and not require humans to do &#8220;markup&#8221;.  for instance,<br />
here is a link to the project gutenberg website:</p>
<p><a href="http://www.gutenberg.org" rel="nofollow">http://www.gutenberg.org</a></p>
<p>i didn&#8217;t do any of the clumsy &#8220;a href&#8221; stuff, i just<br />
typed in the u.r.l., and your wordpress software<br />
was smart enough to recognize it as a link, and<br />
do the formatting for me.  (at least it used to be;<br />
i don&#8217;t know if this new version does that or not&#8230;)</p>
<p>but that&#8217;s what we need, software smart like that&#8230;</p>
<p>you&#8217;ll probably have some &#8220;yeah but&#8221; response,<br />
because this kind of simplicity threatens your<br />
philosophical mindset.  i won&#8217;t bother to reply,<br />
because you can&#8217;t convince someone who will<br />
refuse to be convinced, but the movement to<br />
plain-text writing with automatic markup is<br />
fairly well unstoppable now.  you keep calling<br />
for &#8220;one standard format&#8221;, but do not seem to<br />
recognize that we&#8217;ve had it all along &#8212; plain ascii.</p>
<p>-bowerbird</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Rothman</title>
		<link>http://www.teleread.com/openreader/checking-out-the-machines-behind-book-digitization/comment-page-1/#comment-48933</link>
		<dc:creator>David Rothman</dc:creator>
		<pubDate>Sat, 25 Feb 2006 16:01:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=4360#comment-48933</guid>
		<description>Hey, we&#039;re talking about two very different kinds of &quot;proprietary&quot; (I tweaked the post to be sure that was absolutely clear), but sometimes the same mindset can ishow up in both cases. 

As for proprietary vs. nonproprietary scanning systems, keep in mind the inherent advantages of avoiding a Not Invented Here approach. More input from outsiders--both vendors and librarians in this case. 

Even if the book scanning machines came from private companies and didn&#039;t represent true standards, that might still be preferable to NIH.

As for .txt vs. an OpenReader-stype approach, remember that OpenReader would allow standards AND good presentation. I want to see real &lt;em&gt;italics&lt;/em&gt;, not *this*.

Thanks,
David</description>
		<content:encoded><![CDATA[<p>Hey, we&#8217;re talking about two very different kinds of &#8220;proprietary&#8221; (I tweaked the post to be sure that was absolutely clear), but sometimes the same mindset can ishow up in both cases. </p>
<p>As for proprietary vs. nonproprietary scanning systems, keep in mind the inherent advantages of avoiding a Not Invented Here approach. More input from outsiders&#8211;both vendors and librarians in this case. </p>
<p>Even if the book scanning machines came from private companies and didn&#8217;t represent true standards, that might still be preferable to NIH.</p>
<p>As for .txt vs. an OpenReader-stype approach, remember that OpenReader would allow standards AND good presentation. I want to see real <em>italics</em>, not *this*.</p>
<p>Thanks,<br />
David</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ryanramseyer</title>
		<link>http://www.teleread.com/openreader/checking-out-the-machines-behind-book-digitization/comment-page-1/#comment-48932</link>
		<dc:creator>ryanramseyer</dc:creator>
		<pubDate>Sat, 25 Feb 2006 15:39:38 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=4360#comment-48932</guid>
		<description>&quot;I can somewhat understand the use of proprietary scanners--Google needs something to distinguish it from the competition. Proprietary e-book standards I would not brook, however, given the damage to the flow of knowledge. As a very small shareholder in the company, I hope Google will “do no evil.” If Google is using its own proprietary approach for scanning systems, are library projects suffering?&quot;

I&#039;m not sure I follow your train of thought.  1)  Why would Google&#039;s proprietary scanners distinguish them from the competition if the user never sees the scanner? (Do you mean that their scanners are just better-faster, and so they can do more scanning, and that this distinguishes them?)  2) Why would the proprietary &quot;scanning system&quot; have anything to do with the e-book standard they choose to employ?

Of course I agree that they should use an open standard.  (I&#039;d prefer a clean .txt file like PG that I could do what I want with.)  I don&#039;t,  however, see the connection to the proprietary hardware they use to scan.</description>
		<content:encoded><![CDATA[<p>&#8220;I can somewhat understand the use of proprietary scanners&#8211;Google needs something to distinguish it from the competition. Proprietary e-book standards I would not brook, however, given the damage to the flow of knowledge. As a very small shareholder in the company, I hope Google will “do no evil.” If Google is using its own proprietary approach for scanning systems, are library projects suffering?&#8221;</p>
<p>I&#8217;m not sure I follow your train of thought.  1)  Why would Google&#8217;s proprietary scanners distinguish them from the competition if the user never sees the scanner? (Do you mean that their scanners are just better-faster, and so they can do more scanning, and that this distinguishes them?)  2) Why would the proprietary &#8220;scanning system&#8221; have anything to do with the e-book standard they choose to employ?</p>
<p>Of course I agree that they should use an open standard.  (I&#8217;d prefer a clean .txt file like PG that I could do what I want with.)  I don&#8217;t,  however, see the connection to the proprietary hardware they use to scan.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced
Database Caching using disk: basic
Object Caching 412/436 objects using disk: basic

Served from: www.teleread.com @ 2012-02-15 02:22:08 -->
