<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: FamousPlagiarists.com&#8211;and an e-book angle</title>
	<atom:link href="http://www.teleread.com/2005/06/23/famous-plargiarists-and-an-e-book-angle/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.teleread.com/uncategorized/famous-plargiarists-and-an-e-book-angle/</link>
	<description>News &#38; views on e-books, libraries, publishing and related topics</description>
	<lastBuildDate>Tue, 14 Feb 2012 21:55:20 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
	<item>
		<title>By: Garson Poole</title>
		<link>http://www.teleread.com/uncategorized/famous-plargiarists-and-an-e-book-angle/comment-page-1/#comment-966</link>
		<dc:creator>Garson Poole</dc:creator>
		<pubDate>Sat, 25 Jun 2005 10:37:52 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=3101#comment-966</guid>
		<description>I agree that word-crunching all the Project Gutenberg to look for plagiarism would be interesting. Current plagiarists are apparently blissfully unaware that the probability of detection is growing rapidly, and that it will be nearly unavoidable in the future. The corpus of documents available in electronic form is enormous and rapidly expanding. Even primitive tools such as &quot;Google&quot; give a preview of the speed and power of search on huge databases. First generation companies specializing in detecting plagiarism such as turnitin.com already exist. 

Intellectuals have been remarkably tardy in demanding that the Library of Congress electronically scan its entire collection for easy and universal access. Certainly, this should be done immediately for all documents in the public domain. Also, for full-text searching and indexing purposes it should be done for all documents even those which are copyrighted. 

Optical character recognition is an imperfect technology, but it allows the extraction of searchable text from scanned printed documents with 98 or 99 percent accuracy. This is adequate for finding verbatim and near-verbatim plagiarism of text passages through the use of flexible approximate matching algorithms. (It will not, however, catch extensive paraphrasing.) 

New automated tools will help shame thieving miscreants by comparing candidate documents against all the documents in a super-corpus which includes the current web, archives such as the &quot;Wayback machine&quot; archive.org, and the printed texts in Library of Congress. Historians will be able to judge the &quot;originality&quot; of historical figures and previous historians with perhaps fascinating revelations.

(Note: I did not plagirize this comment. I wrote it back in January 2003 at another website and thought it would be appropriate here also.)</description>
		<content:encoded><![CDATA[<p>I agree that word-crunching all the Project Gutenberg to look for plagiarism would be interesting. Current plagiarists are apparently blissfully unaware that the probability of detection is growing rapidly, and that it will be nearly unavoidable in the future. The corpus of documents available in electronic form is enormous and rapidly expanding. Even primitive tools such as &#8220;Google&#8221; give a preview of the speed and power of search on huge databases. First generation companies specializing in detecting plagiarism such as turnitin.com already exist. </p>
<p>Intellectuals have been remarkably tardy in demanding that the Library of Congress electronically scan its entire collection for easy and universal access. Certainly, this should be done immediately for all documents in the public domain. Also, for full-text searching and indexing purposes it should be done for all documents even those which are copyrighted. </p>
<p>Optical character recognition is an imperfect technology, but it allows the extraction of searchable text from scanned printed documents with 98 or 99 percent accuracy. This is adequate for finding verbatim and near-verbatim plagiarism of text passages through the use of flexible approximate matching algorithms. (It will not, however, catch extensive paraphrasing.) </p>
<p>New automated tools will help shame thieving miscreants by comparing candidate documents against all the documents in a super-corpus which includes the current web, archives such as the &#8220;Wayback machine&#8221; archive.org, and the printed texts in Library of Congress. Historians will be able to judge the &#8220;originality&#8221; of historical figures and previous historians with perhaps fascinating revelations.</p>
<p>(Note: I did not plagirize this comment. I wrote it back in January 2003 at another website and thought it would be appropriate here also.)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Rothman</title>
		<link>http://www.teleread.com/uncategorized/famous-plargiarists-and-an-e-book-angle/comment-page-1/#comment-918</link>
		<dc:creator>David Rothman</dc:creator>
		<pubDate>Fri, 24 Jun 2005 06:48:40 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=3101#comment-918</guid>
		<description>Oh, but that is one of my favorite movies! Clearly Frank Abagnale&#039;s plagiarism of his victims&#039; signatures fell outside fair use guidelines.</description>
		<content:encoded><![CDATA[<p>Oh, but that is one of my favorite movies! Clearly Frank Abagnale&#8217;s plagiarism of his victims&#8217; signatures fell outside fair use guidelines.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Albert Z</title>
		<link>http://www.teleread.com/uncategorized/famous-plargiarists-and-an-e-book-angle/comment-page-1/#comment-903</link>
		<dc:creator>Albert Z</dc:creator>
		<pubDate>Thu, 23 Jun 2005 21:48:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.teleread.org/blog/?p=3101#comment-903</guid>
		<description>And let&#039;s now forget about Frank Abbagnale, the subject of the recent &quot;Catch me if you can&quot; with Di Caprio and Tom Hanks. I really liked that movie because it clearly showed what a clever man can do with his plagiarist skills :)</description>
		<content:encoded><![CDATA[<p>And let&#8217;s now forget about Frank Abbagnale, the subject of the recent &#8220;Catch me if you can&#8221; with Di Caprio and Tom Hanks. I really liked that movie because it clearly showed what a clever man can do with his plagiarist skills <img src='http://www.teleread.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced
Database Caching using disk: basic
Object Caching 323/347 objects using disk: basic

Served from: www.teleread.com @ 2012-02-14 20:10:39 -->
