<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Google&#8217;s digitizing 10M books a year, says the Economist&#8212;so what does this mean to the public domain community?</title>
	<atom:link href="http://www.teleread.com/2007/03/23/googles-digitizing-10m-books-a-year-says-the-economist-so-what-does-this-mean-to-the-public-domain-community/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.teleread.com/ebooks/googles-digitizing-10m-books-a-year-says-the-economist-so-what-does-this-mean-to-the-public-domain-community/</link>
	<description>News &#38; views on e-books, libraries, publishing and related topics</description>
	<lastBuildDate>Wed, 15 Feb 2012 20:23:39 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>By: Garson O'Toole</title>
		<link>http://www.teleread.com/ebooks/googles-digitizing-10m-books-a-year-says-the-economist-so-what-does-this-mean-to-the-public-domain-community/comment-page-1/#comment-278247</link>
		<dc:creator>Garson O'Toole</dc:creator>
		<pubDate>Sun, 25 Mar 2007 18:49:30 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=6324#comment-278247</guid>
		<description>I have been using Google Book search to conduct a simple fun research project that I hope to write about on this blog. It is a remarkably powerful and useful tool and Google in my opinion deserves praise for building it and letting people use it without charge. Government and non-profit groups have so far spectacularly failed in constructing a comprehensive e-book library. (Certainly the Internet Archive, Distributed Proofreaders, Gutenberg and other groups also deserve praise.)

The Google tool has exasperating flaws as Google Doubter and Eric Wilson mention above. The database contains too many blurry unreadable scan images together with improperly cropped images and half-images. In addition there are sometimes fingers visible in scans and inaccurate optical character recognition (OCR) results. The snippets are too small and the graphical image companion for each snippet sometimes is cropped too tightly or shows the wrong text. (Google is under ferocious legal attack by publishers and author guilds and perhaps it has deliberately chosen a tiny snippet size.) The main problem hampering my research diversion is inaccurate/misleading publication dates.</description>
		<content:encoded><![CDATA[<p>I have been using Google Book search to conduct a simple fun research project that I hope to write about on this blog. It is a remarkably powerful and useful tool and Google in my opinion deserves praise for building it and letting people use it without charge. Government and non-profit groups have so far spectacularly failed in constructing a comprehensive e-book library. (Certainly the Internet Archive, Distributed Proofreaders, Gutenberg and other groups also deserve praise.)</p>
<p>The Google tool has exasperating flaws as Google Doubter and Eric Wilson mention above. The database contains too many blurry unreadable scan images together with improperly cropped images and half-images. In addition there are sometimes fingers visible in scans and inaccurate optical character recognition (OCR) results. The snippets are too small and the graphical image companion for each snippet sometimes is cropped too tightly or shows the wrong text. (Google is under ferocious legal attack by publishers and author guilds and perhaps it has deliberately chosen a tiny snippet size.) The main problem hampering my research diversion is inaccurate/misleading publication dates.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Eric Wilson</title>
		<link>http://www.teleread.com/ebooks/googles-digitizing-10m-books-a-year-says-the-economist-so-what-does-this-mean-to-the-public-domain-community/comment-page-1/#comment-278097</link>
		<dc:creator>Eric Wilson</dc:creator>
		<pubDate>Sun, 25 Mar 2007 16:34:14 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=6324#comment-278097</guid>
		<description>Google Doubter makes an interesting point re: the scan quality...putting aside the fact the bloated file size is sometimes due to color reproductions of tan pages(!), some of the p.d. Google Books I tried reading were either missing pages or had them out of order.  I still needed Project Gutenberg copies to be sure I got everything that was supposed to be there. 

I&#039;m a big booster for digital facsimiles of classic editions, so in that respect I don&#039;t mind PDF so much, but Google Books has a bit of a way to go in the quality control department before they&#039;re a proper threat.</description>
		<content:encoded><![CDATA[<p>Google Doubter makes an interesting point re: the scan quality&#8230;putting aside the fact the bloated file size is sometimes due to color reproductions of tan pages(!), some of the p.d. Google Books I tried reading were either missing pages or had them out of order.  I still needed Project Gutenberg copies to be sure I got everything that was supposed to be there. </p>
<p>I&#8217;m a big booster for digital facsimiles of classic editions, so in that respect I don&#8217;t mind PDF so much, but Google Books has a bit of a way to go in the quality control department before they&#8217;re a proper threat.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Rothman</title>
		<link>http://www.teleread.com/ebooks/googles-digitizing-10m-books-a-year-says-the-economist-so-what-does-this-mean-to-the-public-domain-community/comment-page-1/#comment-276331</link>
		<dc:creator>David Rothman</dc:creator>
		<pubDate>Sat, 24 Mar 2007 05:02:58 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=6324#comment-276331</guid>
		<description>Thanks for your thoughts, Brian. Remember, Google is already watermarking public domain content with a corporate logo, and that just might be the opening for much more. I&#039;m all in favor of all kinds of biz models, including Google&#039;s. But I do worry about it preempting libraries and other institutions and Gutenberg-style groups in various respects. - David</description>
		<content:encoded><![CDATA[<p>Thanks for your thoughts, Brian. Remember, Google is already watermarking public domain content with a corporate logo, and that just might be the opening for much more. I&#8217;m all in favor of all kinds of biz models, including Google&#8217;s. But I do worry about it preempting libraries and other institutions and Gutenberg-style groups in various respects. &#8211; David</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brian Carnell</title>
		<link>http://www.teleread.com/ebooks/googles-digitizing-10m-books-a-year-says-the-economist-so-what-does-this-mean-to-the-public-domain-community/comment-page-1/#comment-275866</link>
		<dc:creator>Brian Carnell</dc:creator>
		<pubDate>Fri, 23 Mar 2007 22:44:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=6324#comment-275866</guid>
		<description>I guess I&#039;m missing something -- why should I care if Google supplants PG and IA for putting public domain books online. They are still in the public domain, so nothing will prevent me from grabbing the book and making my own version or redistributing it in text form. 

Now obviously if Google tries to assert some sort of ownership of derivatives of its public domain scans, that&#039;s a problem, but assuming nothing like that happens and they don&#039;t care if I grab the public domain PDF and do my own OCR and convert it to the ebook format flavor of the week, what&#039;s the problem?</description>
		<content:encoded><![CDATA[<p>I guess I&#8217;m missing something &#8212; why should I care if Google supplants PG and IA for putting public domain books online. They are still in the public domain, so nothing will prevent me from grabbing the book and making my own version or redistributing it in text form. </p>
<p>Now obviously if Google tries to assert some sort of ownership of derivatives of its public domain scans, that&#8217;s a problem, but assuming nothing like that happens and they don&#8217;t care if I grab the public domain PDF and do my own OCR and convert it to the ebook format flavor of the week, what&#8217;s the problem?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: google doubter</title>
		<link>http://www.teleread.com/ebooks/googles-digitizing-10m-books-a-year-says-the-economist-so-what-does-this-mean-to-the-public-domain-community/comment-page-1/#comment-275814</link>
		<dc:creator>google doubter</dc:creator>
		<pubDate>Fri, 23 Mar 2007 21:50:19 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=6324#comment-275814</guid>
		<description>The Google books exercise is of enormous value for me but it is like searching a garbage can. The quality of Google books is appaling, OCR is no better and their policies to show you only snippets of books more then 100 years old that are no longer available in bookshops or libraries is very frustrating. What is more, the snippets are in at least 50 per cent not relevant to my search.
As an individual I can do much better scans/photos with OCR and indexing. Thanks and shame for Google.</description>
		<content:encoded><![CDATA[<p>The Google books exercise is of enormous value for me but it is like searching a garbage can. The quality of Google books is appaling, OCR is no better and their policies to show you only snippets of books more then 100 years old that are no longer available in bookshops or libraries is very frustrating. What is more, the snippets are in at least 50 per cent not relevant to my search.<br />
As an individual I can do much better scans/photos with OCR and indexing. Thanks and shame for Google.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: SCWeaver</title>
		<link>http://www.teleread.com/ebooks/googles-digitizing-10m-books-a-year-says-the-economist-so-what-does-this-mean-to-the-public-domain-community/comment-page-1/#comment-275491</link>
		<dc:creator>SCWeaver</dc:creator>
		<pubDate>Fri, 23 Mar 2007 17:44:31 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=6324#comment-275491</guid>
		<description>David, regarding a previous post, the Internet Arichive has a scanned and ORC copy of &quot;Free Air&quot; available for download.</description>
		<content:encoded><![CDATA[<p>David, regarding a previous post, the Internet Arichive has a scanned and ORC copy of &#8220;Free Air&#8221; available for download.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced
Database Caching using disk: basic
Object Caching 374/399 objects using disk: basic

Served from: www.teleread.com @ 2012-02-15 17:12:47 -->
