<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Nongraphical.com &#187; python</title>
	<atom:link href="http://nongraphical.com/tag/python/feed/" rel="self" type="application/rss+xml" />
	<link>http://nongraphical.com</link>
	<description>A dynamic trapeze of web design, programming, and randomness.</description>
	<lastBuildDate>Wed, 09 Jun 2010 18:15:29 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>A tool for removing duplicate files</title>
		<link>http://nongraphical.com/2008/10/a-tool-for-removing-duplicate-files/</link>
		<comments>http://nongraphical.com/2008/10/a-tool-for-removing-duplicate-files/#comments</comments>
		<pubDate>Thu, 02 Oct 2008 21:47:08 +0000</pubDate>
		<dc:creator>fyhuang</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[code]]></category>
		<category><![CDATA[computers]]></category>
		<category><![CDATA[linux]]></category>
		<category><![CDATA[python]]></category>

		<guid isPermaLink="false">http://blog.nongraphical.com/?p=85</guid>
		<description><![CDATA[Down&#173;load Re&#173;move&#173;Du&#173;pli&#173;cates.py

One of the prob&#173;lems with us&#173;ing hy&#173;brid Win&#173;dows and Lin&#173;ux en&#173;vi&#173;ron&#173;ments is that one needs to watch close&#173;ly for filesys&#173;tem and file anoma&#173;lies and in&#173;con&#173;sis&#173;ten&#173;cies. Dif&#173;fer&#173;ing end-of-line mark&#173;ers, for ex&#173;am&#173;ple, cause many prob&#173;lems when shar&#173;ing files be&#173;tween the two op&#173;er&#173;at&#173;ing sys&#173;tems. One par&#173;tic&#173;u&#173;lar prob&#173;lem I&#8217;ve run in&#173;to is that of hav&#173;ing du&#173;pli&#173;cate files, or [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://nongraphical.com/media/uploads/RemoveDuplicates.py">Down&shy;load Re&shy;moveDu&shy;pli&shy;cates.py</a></p>

<p>One of the prob&shy;lems with us&shy;ing hy&shy;brid Win&shy;dows and Lin&shy;ux en&shy;vi&shy;ron&shy;ments is that one needs to watch close&shy;ly for filesys&shy;tem and file anoma&shy;lies and in&shy;con&shy;sis&shy;ten&shy;cies. Dif&shy;fer&shy;ing end-of-line mark&shy;ers, for ex&shy;am&shy;ple, cause many prob&shy;lems when shar&shy;ing files be&shy;tween the two op&shy;er&shy;at&shy;ing sys&shy;tems. One par&shy;tic&shy;u&shy;lar prob&shy;lem I&#8217;ve run in&shy;to is that of hav&shy;ing du&shy;pli&shy;cate files, or in other words, mul&shy;ti&shy;ple files with the same file&shy;name. This can hap&shy;pen if, say, you copy a di&shy;rec&shy;to&shy;ry some&shy;where in Win&shy;dows, then switch to Lin&shy;ux and use a tool such as rsync to copy that same di&shy;rec&shy;to&shy;ry over again. If the cap&shy;i&shy;tal&shy;iza&shy;tion is dif&shy;fer&shy;ent, Lin&shy;ux will not re&shy;place the old files, be&shy;cause Lin&shy;ux, un&shy;like Win&shy;dows, is case-sen&shy;si&shy;tive. This will even hap&shy;pen, and is tech&shy;ni&shy;cal&shy;ly ac&shy;cept&shy;able, on NTFS filesys&shy;tems.</p>

<p>The so&shy;lu&shy;tion I&#8217;ve come up with is this sim&shy;ple script, called Re&shy;moveDu&shy;pli&shy;cates.py. Ob&shy;vi&shy;ous&shy;ly, you need <a href="http://www.python.org/">Python</a> in&shy;stalled to run it, but it has no ad&shy;di&shy;tion&shy;al de&shy;pen&shy;den&shy;cies. Sim&shy;ply run it <em>in the di&shy;rec&shy;to&shy;ry you wish to clean</em>, and it should do the rest. Note that you shouldn&#8217;t use this for en&shy;tire filesys&shy;tems (yet), be&shy;cause it will use ridicu&shy;lous amounts of mem&shy;o&shy;ry if it is given a high num&shy;ber of files. <a href="http://nongraphical.com/media/uploads/RemoveDuplicates.py">Down&shy;load it here</a>!</p>

<p>P.S. Al&shy;so, I can&shy;not guar&shy;an&shy;tee that this tool will work as in&shy;tend&shy;ed or will be bug-free. Use wise&shy;ly.</p>
]]></content:encoded>
			<wfw:commentRss>http://nongraphical.com/2008/10/a-tool-for-removing-duplicate-files/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.287 seconds -->
