<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: &#8216;Digital Text Masters&#8217; (Digitizing the classic public domain books)</title>
	<atom:link href="http://www.teleread.com/2007/02/13/digital-text-masters-digitizing-the-classic-public-domain-books/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.teleread.com/ebooks/digital-text-masters-digitizing-the-classic-public-domain-books/</link>
	<description>News &#38; views on e-books, libraries, publishing and related topics</description>
	<lastBuildDate>Wed, 15 Feb 2012 08:05:45 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>By: MattocG</title>
		<link>http://www.teleread.com/ebooks/digital-text-masters-digitizing-the-classic-public-domain-books/comment-page-1/#comment-1199923</link>
		<dc:creator>MattocG</dc:creator>
		<pubDate>Thu, 20 Jan 2011 13:07:56 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=6174#comment-1199923</guid>
		<description>I have to admire your dedication and enthusiam that you and the few others give to the  works of the English language transcripts.</description>
		<content:encoded><![CDATA[<p>I have to admire your dedication and enthusiam that you and the few others give to the  works of the English language transcripts.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: pearl scanning</title>
		<link>http://www.teleread.com/ebooks/digital-text-masters-digitizing-the-classic-public-domain-books/comment-page-1/#comment-1150471</link>
		<dc:creator>pearl scanning</dc:creator>
		<pubDate>Mon, 07 Dec 2009 14:45:04 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=6174#comment-1150471</guid>
		<description>I believe Google have been using book scanners which read the distance of the pages using infra red 3D scanners, including the curve of the pages. So that when scanned they appear as flat images with little or no black depth marks on that often comes with book scanning. We usually carry out scanning using both ways. But the fastest way is always to slice the book and feed scan the pages if you are able to.

http://www.pearl-repro.co.uk 
http://www.4document-scanning.co.uk 
http://www.forms-data-capture.co.uk 
http://www.microfiche-microfilm-scanning.co.uk</description>
		<content:encoded><![CDATA[<p>I believe Google have been using book scanners which read the distance of the pages using infra red 3D scanners, including the curve of the pages. So that when scanned they appear as flat images with little or no black depth marks on that often comes with book scanning. We usually carry out scanning using both ways. But the fastest way is always to slice the book and feed scan the pages if you are able to.</p>
<p><a href="http://www.pearl-repro.co.uk" rel="nofollow">http://www.pearl-repro.co.uk</a><br />
<a href="http://www.4document-scanning.co.uk" rel="nofollow">http://www.4document-scanning.co.uk</a><br />
<a href="http://www.forms-data-capture.co.uk" rel="nofollow">http://www.forms-data-capture.co.uk</a><br />
<a href="http://www.microfiche-microfilm-scanning.co.uk" rel="nofollow">http://www.microfiche-microfilm-scanning.co.uk</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: The ePub Books Project - Part 1: An Introduction &#124; ePub Books - Information &#38; Resources on the IDPF ePub Standard</title>
		<link>http://www.teleread.com/ebooks/digital-text-masters-digitizing-the-classic-public-domain-books/comment-page-1/#comment-906099</link>
		<dc:creator>The ePub Books Project - Part 1: An Introduction &#124; ePub Books - Information &#38; Resources on the IDPF ePub Standard</dc:creator>
		<pubDate>Mon, 15 Sep 2008 15:15:26 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=6174#comment-906099</guid>
		<description>[...] where I could discuss my experiences as I learn to produce .epub formatted documents from a ‘Master Format’, and with the hope that others may also find the information [...]</description>
		<content:encoded><![CDATA[<p>[...] where I could discuss my experiences as I learn to produce .epub formatted documents from a ‘Master Format’, and with the hope that others may also find the information [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Project Gutenberg News &#187; Digital Text Masters: A Future for Public Domain eBooks?</title>
		<link>http://www.teleread.com/ebooks/digital-text-masters-digitizing-the-classic-public-domain-books/comment-page-1/#comment-501057</link>
		<dc:creator>Project Gutenberg News &#187; Digital Text Masters: A Future for Public Domain eBooks?</dc:creator>
		<pubDate>Sun, 19 Aug 2007 22:22:30 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=6174#comment-501057</guid>
		<description>[...] Several interesting comments were posted about this article at TeleRead. If you wish to read those or post your own comment, you can do so by clicking the link: &#8216;Digital Text Masters&#8217; (Digitizing the classic public domain books). [...]</description>
		<content:encoded><![CDATA[<p>[...] Several interesting comments were posted about this article at TeleRead. If you wish to read those or post your own comment, you can do so by clicking the link: &#8216;Digital Text Masters&#8217; (Digitizing the classic public domain books). [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon Noring</title>
		<link>http://www.teleread.com/ebooks/digital-text-masters-digitizing-the-classic-public-domain-books/comment-page-1/#comment-225959</link>
		<dc:creator>Jon Noring</dc:creator>
		<pubDate>Thu, 15 Feb 2007 16:50:54 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=6174#comment-225959</guid>
		<description>Thanks, Jeroen, for bringing up some important points.

I agree that among the huge corpus of &quot;non-canon&quot; public domain books (in whatever language), there are a significant number which are truly gems that people should know about, and should warrant special digital mastering. They should be added to the &quot;canon.&quot;

Two further comments on this point:

&lt;ol&gt;

&lt;li&gt;&lt;p&gt;The proposed &quot;Digital Text Masters&quot; will not etch in stone the &quot;canon&quot;, but rather will be quite flexible as to what works it encompasses. The emphasis, though, especially in the early years, will be on those public domain books oft-used in education (both K-12 and post-secondary) and regarded by experts and lay enthusiasts to represent the best or the most influential of the public domain.&lt;/p&gt;

&lt;p&gt;If someone believes a particular book should be added to the DTM collection, when it might otherwise not be considered, they may make their case. If they are willing to share in some of the costs (if any) and human effort of the digital mastering, they may get the go-ahead to include that book in the DTM collection. The details of the whole selection process still need to be shaken out, and will probably evolve over time.&lt;/p&gt;

&lt;/li&gt;

&lt;li&gt;&lt;p&gt;Once DTM is established and most of the bugs shaken out of it, DTM can certainly diversify into other books and periodicals that make sense to digitize to DTM quality and for inclusion in the DTM database/archive. It&#039;s hard to know the future, really, but we should definitely be ready to diversify. I have some ideas, but they are still nascent so I won&#039;t describe them here.&lt;/p&gt;&lt;/li&gt;

&lt;/ol&gt;

Finally, DTM is not intended to replace Distributed Proofreaders, which we fully support in its task to digitize the large number of public domain books without regard to &quot;canon.&quot; In fact, I see some sort of working relationship develop between the two organizations (even if DTM proofing ends up not using DP or its system), and with PG which appears to be evolving to a general text archive from many sources.

For example, since DTM is intended to be a formal, more heavily-funded organization (while DP will necessarily always be more of a grassroots, volunteer-driven group even though it does have 501c3 status), I see DTM advancing the various technologies which could be shared with DP and with PG. So in some respects DTM might become the &quot;technology development&quot; arm of the large, multi-organizational community to digitize the public domain texts.

Something to think about, at least. If DTM becomes as successful at generating sustaining revenue as I believe it can, I envision DTM donating funds to DP and PG, preferably in a matching donation sense in order to spur others to donate to those organizations.

Well, I am getting well ahead of myself, since we haven&#039;t yet organized, have no funds, nor produced anything! But I wanted to share how I see DTM relate to DP and PG, and of course to share some long-term goals should DTM get launched.

If nothing else, this exercise provides one vision of the future of the effort to digitize public domain texts, and it will add to the &quot;public domain idea database&quot; that others may draw from.

A related point I think is important to mention which I haven&#039;t yet:

I observe that the public mindshare these days is on the scanning of public domain texts (which is great to see happen!) The downside of this is that in various quarters this quite public focus on scanning is hiding the advantages to users of having structured digital texts. Even though we know the advantages, we are so far not getting the message out &#8212; many see having scanned images plus raw, unproofed OCR text as more than sufficient to use these texts.

I see DTM as helping to get the message out that there&#039;s significant benefit to society to create proofed and structured digital texts of many if not all the public domain works. Focusing on the great classics, though they be few in number compared to the entire corpus of public domain texts, helps in explaining the benefits of structured digital texts. It&#039;s a little tougher when the texts being showcased are really obscure, arcane, and in some cases truly bizarre works. But talk about Mark Twain, or the Brontë sisters, and everyone recognizes them.</description>
		<content:encoded><![CDATA[<p>Thanks, Jeroen, for bringing up some important points.</p>
<p>I agree that among the huge corpus of &#8220;non-canon&#8221; public domain books (in whatever language), there are a significant number which are truly gems that people should know about, and should warrant special digital mastering. They should be added to the &#8220;canon.&#8221;</p>
<p>Two further comments on this point:</p>
<ol>
<li>
<p>The proposed &#8220;Digital Text Masters&#8221; will not etch in stone the &#8220;canon&#8221;, but rather will be quite flexible as to what works it encompasses. The emphasis, though, especially in the early years, will be on those public domain books oft-used in education (both K-12 and post-secondary) and regarded by experts and lay enthusiasts to represent the best or the most influential of the public domain.</p>
<p>If someone believes a particular book should be added to the DTM collection, when it might otherwise not be considered, they may make their case. If they are willing to share in some of the costs (if any) and human effort of the digital mastering, they may get the go-ahead to include that book in the DTM collection. The details of the whole selection process still need to be shaken out, and will probably evolve over time.</p>
</li>
<li>
<p>Once DTM is established and most of the bugs shaken out of it, DTM can certainly diversify into other books and periodicals that make sense to digitize to DTM quality and for inclusion in the DTM database/archive. It&#8217;s hard to know the future, really, but we should definitely be ready to diversify. I have some ideas, but they are still nascent so I won&#8217;t describe them here.</p>
</li>
</ol>
<p>Finally, DTM is not intended to replace Distributed Proofreaders, which we fully support in its task to digitize the large number of public domain books without regard to &#8220;canon.&#8221; In fact, I see some sort of working relationship develop between the two organizations (even if DTM proofing ends up not using DP or its system), and with PG which appears to be evolving to a general text archive from many sources.</p>
<p>For example, since DTM is intended to be a formal, more heavily-funded organization (while DP will necessarily always be more of a grassroots, volunteer-driven group even though it does have 501c3 status), I see DTM advancing the various technologies which could be shared with DP and with PG. So in some respects DTM might become the &#8220;technology development&#8221; arm of the large, multi-organizational community to digitize the public domain texts.</p>
<p>Something to think about, at least. If DTM becomes as successful at generating sustaining revenue as I believe it can, I envision DTM donating funds to DP and PG, preferably in a matching donation sense in order to spur others to donate to those organizations.</p>
<p>Well, I am getting well ahead of myself, since we haven&#8217;t yet organized, have no funds, nor produced anything! But I wanted to share how I see DTM relate to DP and PG, and of course to share some long-term goals should DTM get launched.</p>
<p>If nothing else, this exercise provides one vision of the future of the effort to digitize public domain texts, and it will add to the &#8220;public domain idea database&#8221; that others may draw from.</p>
<p>A related point I think is important to mention which I haven&#8217;t yet:</p>
<p>I observe that the public mindshare these days is on the scanning of public domain texts (which is great to see happen!) The downside of this is that in various quarters this quite public focus on scanning is hiding the advantages to users of having structured digital texts. Even though we know the advantages, we are so far not getting the message out &mdash; many see having scanned images plus raw, unproofed OCR text as more than sufficient to use these texts.</p>
<p>I see DTM as helping to get the message out that there&#8217;s significant benefit to society to create proofed and structured digital texts of many if not all the public domain works. Focusing on the great classics, though they be few in number compared to the entire corpus of public domain texts, helps in explaining the benefits of structured digital texts. It&#8217;s a little tougher when the texts being showcased are really obscure, arcane, and in some cases truly bizarre works. But talk about Mark Twain, or the Brontë sisters, and everyone recognizes them.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeroen Hellingman</title>
		<link>http://www.teleread.com/ebooks/digital-text-masters-digitizing-the-classic-public-domain-books/comment-page-1/#comment-225805</link>
		<dc:creator>Jeroen Hellingman</dc:creator>
		<pubDate>Thu, 15 Feb 2007 08:37:16 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=6174#comment-225805</guid>
		<description>Project Gutenberg and DP quality standards are improving, and so are the tools and procedures in use, and although I believe there will be a market for re-done, certified very high-quality versions of what some may consider the canon of English literature, I also think that a lot of other works warrant attention, in particular those works with considerable added value, such as reference works, or old magazine runs, which are often much more difficult to get access to. The canon of English literature is relative easy to get hold of, but if you could scan and process a set of early magazines, you would add materials that or currently difficult to access.

Projects like Google Books are changing the playground as well, a tremendous number of books are now becoming available, which means our focus can move from scanning to harvesting and adding value (by proofreading and propper tagging) to what is already available.

Another point where PG is gradually improving is metadata. We now have a working catalog. It would also be very nice to build a working &#039;reading room&#039; application around the correction, where people can read, annotate works, and share these annotations with friends or the community.

For languages other than English, we are where PG was 10 to 20 years ago. For both Dutch and Philippine related works, where I have been very active to grow the collection, we are still not through the canon of literature. That is, if such a thing can be demarcated, I have my reservations on an elitist view of literature, and would just as happy an obscure penny novel. Sometimes, these things can be hidden gems, and sometimes they have rightly been delegated to the realm of obscurity. Especially in developing countries, like the Philippines, works have become very difficult to obtain, due to low printing volumes, low quality paper, two devastating wars, and adverse climate conditions. Similarly, Dutch language literature from what is now Indonesia is extremely difficult to obtain, as they were often printed in low volumes on cheap paper, and very few copies have ever reached Europe.

The current long copyright terms are also very harmful for the promotion of literature. In the time-span between commercial non-profitability and entering the public domain, where they are again free to build upon, works get so far out-dated and out-of-touch with living culture, that they have effectively died... One of PG&#039;s purposes, in my opinion, is reviving such works from oblivion.</description>
		<content:encoded><![CDATA[<p>Project Gutenberg and DP quality standards are improving, and so are the tools and procedures in use, and although I believe there will be a market for re-done, certified very high-quality versions of what some may consider the canon of English literature, I also think that a lot of other works warrant attention, in particular those works with considerable added value, such as reference works, or old magazine runs, which are often much more difficult to get access to. The canon of English literature is relative easy to get hold of, but if you could scan and process a set of early magazines, you would add materials that or currently difficult to access.</p>
<p>Projects like Google Books are changing the playground as well, a tremendous number of books are now becoming available, which means our focus can move from scanning to harvesting and adding value (by proofreading and propper tagging) to what is already available.</p>
<p>Another point where PG is gradually improving is metadata. We now have a working catalog. It would also be very nice to build a working &#8216;reading room&#8217; application around the correction, where people can read, annotate works, and share these annotations with friends or the community.</p>
<p>For languages other than English, we are where PG was 10 to 20 years ago. For both Dutch and Philippine related works, where I have been very active to grow the collection, we are still not through the canon of literature. That is, if such a thing can be demarcated, I have my reservations on an elitist view of literature, and would just as happy an obscure penny novel. Sometimes, these things can be hidden gems, and sometimes they have rightly been delegated to the realm of obscurity. Especially in developing countries, like the Philippines, works have become very difficult to obtain, due to low printing volumes, low quality paper, two devastating wars, and adverse climate conditions. Similarly, Dutch language literature from what is now Indonesia is extremely difficult to obtain, as they were often printed in low volumes on cheap paper, and very few copies have ever reached Europe.</p>
<p>The current long copyright terms are also very harmful for the promotion of literature. In the time-span between commercial non-profitability and entering the public domain, where they are again free to build upon, works get so far out-dated and out-of-touch with living culture, that they have effectively died&#8230; One of PG&#8217;s purposes, in my opinion, is reviving such works from oblivion.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michael Lockey (Vasa)</title>
		<link>http://www.teleread.com/ebooks/digital-text-masters-digitizing-the-classic-public-domain-books/comment-page-1/#comment-225111</link>
		<dc:creator>Michael Lockey (Vasa)</dc:creator>
		<pubDate>Wed, 14 Feb 2007 00:27:32 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=6174#comment-225111</guid>
		<description>First of all, I am no part of any cabel; just another foot soldier at DP. By and large, this whole argument seems rather futile: was something done &quot;wrong&quot; by anyone?  New tools and methodologies evolve; and that which was available  years ago, now is antiquated.  It does NOT mean that work is wasted: if nothing else, it served as the inspiration to demand better. How did the first, poor, translations of Gilgamesh make it into print?  OTOH, should those first translators feel their work has been discounted?  Egos, egos.

We throw words like &#039;bowlderized&#039; around as insults- but has anyone ever read his introduction?  Poor Tom did his best, and is treated as the worst.  We each fall somewhat shy of others expectations.

Right now, I&#039;m trying to get my head around the Dublin Protocols which, I feel, would solve a lot of these issues.  Shall we all, then, stop work on everything, until we know we&#039;re working on the ultimate edition?

At least book burning is easier in this day:

pip *.*/del

Come on: we&#039;re supposed to be the good guys!  There&#039;s lots of real enemies.  We don&#039;t have to emulate a Canadian Prime Minister, who described his party as like the early settlers who would find themselves attacked by Indians, circle their wagons, and start firing inwards...</description>
		<content:encoded><![CDATA[<p>First of all, I am no part of any cabel; just another foot soldier at DP. By and large, this whole argument seems rather futile: was something done &#8220;wrong&#8221; by anyone?  New tools and methodologies evolve; and that which was available  years ago, now is antiquated.  It does NOT mean that work is wasted: if nothing else, it served as the inspiration to demand better. How did the first, poor, translations of Gilgamesh make it into print?  OTOH, should those first translators feel their work has been discounted?  Egos, egos.</p>
<p>We throw words like &#8216;bowlderized&#8217; around as insults- but has anyone ever read his introduction?  Poor Tom did his best, and is treated as the worst.  We each fall somewhat shy of others expectations.</p>
<p>Right now, I&#8217;m trying to get my head around the Dublin Protocols which, I feel, would solve a lot of these issues.  Shall we all, then, stop work on everything, until we know we&#8217;re working on the ultimate edition?</p>
<p>At least book burning is easier in this day:</p>
<p>pip *.*/del</p>
<p>Come on: we&#8217;re supposed to be the good guys!  There&#8217;s lots of real enemies.  We don&#8217;t have to emulate a Canadian Prime Minister, who described his party as like the early settlers who would find themselves attacked by Indians, circle their wagons, and start firing inwards&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon Noring</title>
		<link>http://www.teleread.com/ebooks/digital-text-masters-digitizing-the-classic-public-domain-books/comment-page-1/#comment-224983</link>
		<dc:creator>Jon Noring</dc:creator>
		<pubDate>Tue, 13 Feb 2007 20:32:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=6174#comment-224983</guid>
		<description>Thanks, Karen, for clarifying.

We&#039;ll see how this unfolds. Already I&#039;ve gotten a couple private messages from &quot;high placed people&quot; with &quot;well-known organizations&quot;. We&#039;ll see if there&#039;s any legs to these and other inquiries.

No matter what happens, I think the idea of &quot;digital text masters&quot;, no matter how it is implemented, is striking a positive chord with a lot of people. I believe a lot of people do care about the faithfulness and provenance of the public domain texts they read, especially when it is brought to their attention. The typical reaction appears to be &#8220;I hadn&#039;t thought about it before, but yeah, maybe I should be more concerned.&#8221;</description>
		<content:encoded><![CDATA[<p>Thanks, Karen, for clarifying.</p>
<p>We&#8217;ll see how this unfolds. Already I&#8217;ve gotten a couple private messages from &#8220;high placed people&#8221; with &#8220;well-known organizations&#8221;. We&#8217;ll see if there&#8217;s any legs to these and other inquiries.</p>
<p>No matter what happens, I think the idea of &#8220;digital text masters&#8221;, no matter how it is implemented, is striking a positive chord with a lot of people. I believe a lot of people do care about the faithfulness and provenance of the public domain texts they read, especially when it is brought to their attention. The typical reaction appears to be &ldquo;I hadn&#8217;t thought about it before, but yeah, maybe I should be more concerned.&rdquo;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Karen Lofstrom</title>
		<link>http://www.teleread.com/ebooks/digital-text-masters-digitizing-the-classic-public-domain-books/comment-page-1/#comment-224921</link>
		<dc:creator>Karen Lofstrom</dc:creator>
		<pubDate>Tue, 13 Feb 2007 18:56:02 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=6174#comment-224921</guid>
		<description>It&#039;s three proofreading rounds, then two formatting rounds -- that&#039;s five -- and then a post-processor pulls everything together, makes sure that it&#039;s consistent, and then sends it on to someone who checks and uploads the text. 

As for redoing old texts -- someone commented on it in the forums, but I don&#039;t think there&#039;s been any announcement. I&quot;m not sure what&#039;s been tackled other than Robert Louis Stevenson. We&#039;re doing a multi-volume collection of his complete works. 

The question of displaying the images as well as the etext is a vexing one. Many people in DP would like to do so and I understand that we&#039;re holding the scans that we made and we control. However, we&#039;ve done a fair number of books based on  borrowed scans from Gallica, Google Books, and other such sites. Pictures of two-dimensional texts or pictures that are out of copyright can&#039;t be copyrighted and if we were to display the &quot;borrowed&quot; pictures, we&#039;d probably be OK but ... it seems better not to push it. 

I hope that I haven&#039;t gotten myself in hot water here, by seeming to speak for DP. I&#039;ve been volunteering there for four years, but I&#039;m just a foot-soldier, not one of the generals.</description>
		<content:encoded><![CDATA[<p>It&#8217;s three proofreading rounds, then two formatting rounds &#8212; that&#8217;s five &#8212; and then a post-processor pulls everything together, makes sure that it&#8217;s consistent, and then sends it on to someone who checks and uploads the text. </p>
<p>As for redoing old texts &#8212; someone commented on it in the forums, but I don&#8217;t think there&#8217;s been any announcement. I&#8221;m not sure what&#8217;s been tackled other than Robert Louis Stevenson. We&#8217;re doing a multi-volume collection of his complete works. </p>
<p>The question of displaying the images as well as the etext is a vexing one. Many people in DP would like to do so and I understand that we&#8217;re holding the scans that we made and we control. However, we&#8217;ve done a fair number of books based on  borrowed scans from Gallica, Google Books, and other such sites. Pictures of two-dimensional texts or pictures that are out of copyright can&#8217;t be copyrighted and if we were to display the &#8220;borrowed&#8221; pictures, we&#8217;d probably be OK but &#8230; it seems better not to push it. </p>
<p>I hope that I haven&#8217;t gotten myself in hot water here, by seeming to speak for DP. I&#8217;ve been volunteering there for four years, but I&#8217;m just a foot-soldier, not one of the generals.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon Noring</title>
		<link>http://www.teleread.com/ebooks/digital-text-masters-digitizing-the-classic-public-domain-books/comment-page-1/#comment-224843</link>
		<dc:creator>Jon Noring</dc:creator>
		<pubDate>Tue, 13 Feb 2007 16:16:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=6174#comment-224843</guid>
		<description>Karen, that is very good news about DP remastering some of the older PG works! It&#8217;s something several of us, for a long time, have encouraged them doing. I&#8217;ve mentioned it at least two times in the TeleBlog, and several times on &lt;a href=&quot;http://lists.pglaf.org/listinfo.cgi/gutvol-d&quot; rel=&quot;nofollow&quot;&gt;gutvol-d&lt;/a&gt; over the last few years. With the new PG policy of allowing provenance information in the new texts (bravo to Michael Hart and Greg Newby), this is definitely a great development where the winners are the readers.

And I&#8217;ve not heard about the five round system. Does this mean five proofreading rounds? Looking at the DP home page, it still shows only three. One thing that interests me is how accurate is the three proofreading system? That&#8217;s one thing I hope DTM will be able to answer.

For others reading this, DP is focused on the production side of the texts. As far a I know they still only donate their work product to PG, and make no other use of the texts. In addition, they have not (as far as I know) instituted an archival scanning requirement or recommendation, although individuals scanning books could do so if they want. (For one book I submitted to them, an original of Burton&#8217;s &lt;em&gt;Kama Sutra of Vatsyayana&lt;/em&gt; which still needs to be proofed, the scans were done at archival quality and then downsampled to meet DP&#8217;s preferences.)

I&#8217;ve even toyed for a while with the idea of starting a &lt;a href=&quot;http://groups.yahoo.com/group/distscan/&quot; rel=&quot;nofollow&quot;&gt;&#8220;Distributed Scanners&#8221;&lt;/a&gt; group (which could still be launched to support the &#8220;Digital Text Masters&#8221; project) &#8212; just find a bunch of &lt;a href=&quot;http://www.m-w.com/dictionary/persnickety&quot; rel=&quot;nofollow&quot;&gt;persnickety&lt;/a&gt; people (maybe with a touch of obsessive-compulsiveness to assure things are done right &lt;laugh/&gt;) with suitable scanners (e.g., high-grade sheet feed scanners and the Plustek OpticBook) and steer them to scan the great classics according to a set of requirements and guidelines. DS would, of course, build a database for archiving the scan sets along with donating copies of the originals to the Internet Archive (the scan sets, at archival quality, will take up a &lt;em&gt;lot&lt;/em&gt; of space, like five or more gigs per book, but with terabyte drives getting dirt cheap, and burning DVD&#8217;s also getting cheap, space is no longer an issue.)

So our vision for DTM, should it get launched, is much more comprehensive and wide-ranging. It certainly could take advantage of the DP system and if DP wants to be involved. (I don&#8217;t want to be presumptuous here &#8212; Juliet will understandably require that there be meat and real potential in the DTM proposal for her to commit any official DP mindshare to it.) A copy would go to PG, which is important, and we would get a copy as well. DTM would, of course, take the lead in securing the archival-quality and QC&#8217;d scans.

And DTM would work to start building the text annotation project (either alone, or preferably as part of David Rothman&#8217;s LibraryCity venture should he succeed at getting that funded in time) &#8212; important to make the texts much more useful in the educational market. Btw, since DTM will probably do a similar approach to mastering as what DP will do in the future with PGTEI, I see the works done by DP, which DTM won&#8217;t do, to augment the DTM works in the database we set up &#8212; again more synergies between the projects.

Since DP and PG are closely linked, PG will benefit from this as well. DTM is not intended to compete, but to complement PG and DP, and possibly open up doors the other projects will have more difficulty in opening. Visualize the association as a Venn diagram, with three circles intersecting with each other &#8212; DTM will have some of its circle outside of the others. The result is that the projects in toto will cover a bigger area.</description>
		<content:encoded><![CDATA[<p>Karen, that is very good news about DP remastering some of the older PG works! It&rsquo;s something several of us, for a long time, have encouraged them doing. I&rsquo;ve mentioned it at least two times in the TeleBlog, and several times on <a href="http://lists.pglaf.org/listinfo.cgi/gutvol-d" rel="nofollow">gutvol-d</a> over the last few years. With the new PG policy of allowing provenance information in the new texts (bravo to Michael Hart and Greg Newby), this is definitely a great development where the winners are the readers.</p>
<p>And I&rsquo;ve not heard about the five round system. Does this mean five proofreading rounds? Looking at the DP home page, it still shows only three. One thing that interests me is how accurate is the three proofreading system? That&rsquo;s one thing I hope DTM will be able to answer.</p>
<p>For others reading this, DP is focused on the production side of the texts. As far a I know they still only donate their work product to PG, and make no other use of the texts. In addition, they have not (as far as I know) instituted an archival scanning requirement or recommendation, although individuals scanning books could do so if they want. (For one book I submitted to them, an original of Burton&rsquo;s <em>Kama Sutra of Vatsyayana</em> which still needs to be proofed, the scans were done at archival quality and then downsampled to meet DP&rsquo;s preferences.)</p>
<p>I&rsquo;ve even toyed for a while with the idea of starting a <a href="http://groups.yahoo.com/group/distscan/" rel="nofollow">&ldquo;Distributed Scanners&rdquo;</a> group (which could still be launched to support the &ldquo;Digital Text Masters&rdquo; project) &mdash; just find a bunch of <a href="http://www.m-w.com/dictionary/persnickety" rel="nofollow">persnickety</a> people (maybe with a touch of obsessive-compulsiveness to assure things are done right &lt;laugh/&gt;) with suitable scanners (e.g., high-grade sheet feed scanners and the Plustek OpticBook) and steer them to scan the great classics according to a set of requirements and guidelines. DS would, of course, build a database for archiving the scan sets along with donating copies of the originals to the Internet Archive (the scan sets, at archival quality, will take up a <em>lot</em> of space, like five or more gigs per book, but with terabyte drives getting dirt cheap, and burning DVD&rsquo;s also getting cheap, space is no longer an issue.)</p>
<p>So our vision for DTM, should it get launched, is much more comprehensive and wide-ranging. It certainly could take advantage of the DP system and if DP wants to be involved. (I don&rsquo;t want to be presumptuous here &mdash; Juliet will understandably require that there be meat and real potential in the DTM proposal for her to commit any official DP mindshare to it.) A copy would go to PG, which is important, and we would get a copy as well. DTM would, of course, take the lead in securing the archival-quality and QC&rsquo;d scans.</p>
<p>And DTM would work to start building the text annotation project (either alone, or preferably as part of David Rothman&rsquo;s LibraryCity venture should he succeed at getting that funded in time) &mdash; important to make the texts much more useful in the educational market. Btw, since DTM will probably do a similar approach to mastering as what DP will do in the future with PGTEI, I see the works done by DP, which DTM won&rsquo;t do, to augment the DTM works in the database we set up &mdash; again more synergies between the projects.</p>
<p>Since DP and PG are closely linked, PG will benefit from this as well. DTM is not intended to compete, but to complement PG and DP, and possibly open up doors the other projects will have more difficulty in opening. Visualize the association as a Venn diagram, with three circles intersecting with each other &mdash; DTM will have some of its circle outside of the others. The result is that the projects in toto will cover a bigger area.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Karen Lofstrom</title>
		<link>http://www.teleread.com/ebooks/digital-text-masters-digitizing-the-classic-public-domain-books/comment-page-1/#comment-224529</link>
		<dc:creator>Karen Lofstrom</dc:creator>
		<pubDate>Tue, 13 Feb 2007 06:51:14 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=6174#comment-224529</guid>
		<description>Quietly, without much fanfare, DP has already started redoing some works that were done before DP joined Project Gutenberg. I don&#039;t think that TPTB (The Powers That Be in DP) are opposed to this in principle; they&#039;re just moving carefully. 

I should think that DP would want to participate in the project. We already have the infrastructure and now, a five round system that is producing extremely high-quality works.  Indeed, a Dickens novel would be a welcome break from working on The Diseases of the Horse and other such odd works meandering through the system. 

We aren&#039;t limited to offering works to PG, so far as I know.  I would guess that TPTB would want to make any results available both to you and to PG.  But that&#039;s just a guess. 

I can&#039;t speak for TPTB. However, you shouldn&#039;t fear to approach them.</description>
		<content:encoded><![CDATA[<p>Quietly, without much fanfare, DP has already started redoing some works that were done before DP joined Project Gutenberg. I don&#8217;t think that TPTB (The Powers That Be in DP) are opposed to this in principle; they&#8217;re just moving carefully. </p>
<p>I should think that DP would want to participate in the project. We already have the infrastructure and now, a five round system that is producing extremely high-quality works.  Indeed, a Dickens novel would be a welcome break from working on The Diseases of the Horse and other such odd works meandering through the system. </p>
<p>We aren&#8217;t limited to offering works to PG, so far as I know.  I would guess that TPTB would want to make any results available both to you and to PG.  But that&#8217;s just a guess. </p>
<p>I can&#8217;t speak for TPTB. However, you shouldn&#8217;t fear to approach them.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced
Database Caching using disk: basic
Object Caching 469/499 objects using disk: basic

Served from: www.teleread.com @ 2012-02-15 06:56:47 -->
