<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: xargs parallel execution</title>
	<atom:link href="http://matpalm.com/blog/2009/11/06/xargs-parallel-execution/feed/" rel="self" type="application/rss+xml" />
	<link>http://matpalm.com/blog/2009/11/06/xargs-parallel-execution/</link>
	<description>thoughts from a data scientist wannabe</description>
	<lastBuildDate>Sun, 06 Jun 2010 19:38:46 -0400</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: matpalm</title>
		<link>http://matpalm.com/blog/2009/11/06/xargs-parallel-execution/comment-page-1/#comment-52</link>
		<dc:creator>matpalm</dc:creator>
		<pubDate>Thu, 28 Jan 2010 08:45:12 +0000</pubDate>
		<guid isPermaLink="false">http://matpalm.com/blog/?p=217#comment-52</guid>
		<description>thanks ole, 
had never heard of parallel before
will have to add it to my must-install list
mat</description>
		<content:encoded><![CDATA[<p>thanks ole,<br />
had never heard of parallel before<br />
will have to add it to my must-install list<br />
mat</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ole Tange</title>
		<link>http://matpalm.com/blog/2009/11/06/xargs-parallel-execution/comment-page-1/#comment-51</link>
		<dc:creator>Ole Tange</dc:creator>
		<pubDate>Thu, 28 Jan 2010 07:23:28 +0000</pubDate>
		<guid isPermaLink="false">http://matpalm.com/blog/?p=217#comment-51</guid>
		<description>As you have discovered you need to make temporary files to avoid mixing xargs&#039; output for different jobs.

Parallel  &lt;a href=&quot;https://savannah.nongnu.org/projects/parallel/&quot; rel=&quot;nofollow&quot;&gt;https://savannah.nongnu.org/projects/parallel/&lt;/a&gt; does not suffer from this:

ls sample*gz &#124; parallel -j+0 -k &#039;zcat {} &#124; ./script.rb&#039; &gt; output

-k makes sure the output is in the same order as input.

-j+0 runs number_of_cores jobs in parallel.</description>
		<content:encoded><![CDATA[<p>As you have discovered you need to make temporary files to avoid mixing xargs&#8217; output for different jobs.</p>
<p>Parallel  <a href="https://savannah.nongnu.org/projects/parallel/" rel="nofollow">https://savannah.nongnu.org/projects/parallel/</a> does not suffer from this:</p>
<p>ls sample*gz | parallel -j+0 -k &#8216;zcat {} | ./script.rb&#8217; &gt; output</p>
<p>-k makes sure the output is in the same order as input.</p>
<p>-j+0 runs number_of_cores jobs in parallel.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
