<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments for joe crobak&#039;s website</title>
	<atom:link href="http://www.crobak.org/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.crobak.org</link>
	<description>sharing what I find</description>
	<lastBuildDate>Fri, 10 May 2013 18:21:40 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
	<item>
		<title>Comment on Workflow Engines for Hadoop by Alvin</title>
		<link>http://www.crobak.org/2012/07/workflow-engines-for-hadoop/#comment-619</link>
		<dc:creator>Alvin</dc:creator>
		<pubDate>Fri, 10 May 2013 18:21:40 +0000</pubDate>
		<guid isPermaLink="false">http://www.crobak.org/?p=111#comment-619</guid>
		<description><![CDATA[Oozie does support rerun w/o making every action a sub-workflow.  Specify oozie.wf.rerun.failnodes=true in the job.properties file and rerun the same workflow with the -rerun option and the previous job number as an argument.]]></description>
		<content:encoded><![CDATA[<p>Oozie does support rerun w/o making every action a sub-workflow.  Specify oozie.wf.rerun.failnodes=true in the job.properties file and rerun the same workflow with the -rerun option and the previous job number as an argument.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Workflow Engines for Hadoop by ivan provalov</title>
		<link>http://www.crobak.org/2012/07/workflow-engines-for-hadoop/#comment-617</link>
		<dc:creator>ivan provalov</dc:creator>
		<pubDate>Thu, 07 Feb 2013 23:54:35 +0000</pubDate>
		<guid isPermaLink="false">http://www.crobak.org/?p=111#comment-617</guid>
		<description><![CDATA[Joe, 

Great comparison.   For Oozie workflows visualization, there is also a VizOozie.  It parses a workflow xml file and generates a dot file.  You can use graphviz&#039;s dot to convert it to any format:

https://github.com/iprovalo/vizoozie

See this blog for more info:

http://info.lucidworks.com/blog/bid/266880/LucidWorks-Big-Data-Oozie-Workflow-With-VizOozie

Regards,

Ivan Provalov]]></description>
		<content:encoded><![CDATA[<p>Joe, </p>
<p>Great comparison.   For Oozie workflows visualization, there is also a VizOozie.  It parses a workflow xml file and generates a dot file.  You can use graphviz&#8217;s dot to convert it to any format:</p>
<p><a href="https://github.com/iprovalo/vizoozie" rel="nofollow">https://github.com/iprovalo/vizoozie</a></p>
<p>See this blog for more info:</p>
<p><a href="http://info.lucidworks.com/blog/bid/266880/LucidWorks-Big-Data-Oozie-Workflow-With-VizOozie" rel="nofollow">http://info.lucidworks.com/blog/bid/266880/LucidWorks-Big-Data-Oozie-Workflow-With-VizOozie</a></p>
<p>Regards,</p>
<p>Ivan Provalov</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Moving wordpress blog to lighttpd by raspberry pi, lighttpd and wordpress pretty permalink &#124; Rants, grunts and chants</title>
		<link>http://www.crobak.org/2011/01/moving-wordpress-blog-to-lighttpd/#comment-615</link>
		<dc:creator>raspberry pi, lighttpd and wordpress pretty permalink &#124; Rants, grunts and chants</dc:creator>
		<pubDate>Fri, 28 Dec 2012 14:03:59 +0000</pubDate>
		<guid isPermaLink="false">http://www-test.crobak.org/?p=3#comment-615</guid>
		<description><![CDATA[[...] finds these links useful: http://www.crobak.org/2011/01/moving-wordpress-blog-to-lighttpd/ http://longspine.com/how-to/wordpress-pretty-permalinks-on-lighttpd/ [...]]]></description>
		<content:encoded><![CDATA[<p>[...] finds these links useful: <a href="http://www.crobak.org/2011/01/moving-wordpress-blog-to-lighttpd/" rel="nofollow">http://www.crobak.org/2011/01/moving-wordpress-blog-to-lighttpd/</a> <a href="http://longspine.com/how-to/wordpress-pretty-permalinks-on-lighttpd/" rel="nofollow">http://longspine.com/how-to/wordpress-pretty-permalinks-on-lighttpd/</a> [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Getting Started with Apache Hadoop 0.23.0 by Keith Wiley</title>
		<link>http://www.crobak.org/2011/12/getting-started-with-apache-hadoop-0-23-0/#comment-613</link>
		<dc:creator>Keith Wiley</dc:creator>
		<pubDate>Fri, 02 Nov 2012 18:36:51 +0000</pubDate>
		<guid isPermaLink="false">http://www.crobak.org/?p=85#comment-613</guid>
		<description><![CDATA[Gah!  Even though a comment pointed out that you forgot to mention formatting the namenode, and even though you replied that you would update the article...the omission is still there.  I spent quite a while trying to figure out why the datanode would start but not the namenode (I figured it out by investigating the namenode error log in logs/).

You should really put that update in the article.  :-)

Cheers!]]></description>
		<content:encoded><![CDATA[<p>Gah!  Even though a comment pointed out that you forgot to mention formatting the namenode, and even though you replied that you would update the article&#8230;the omission is still there.  I spent quite a while trying to figure out why the datanode would start but not the namenode (I figured it out by investigating the namenode error log in logs/).</p>
<p>You should really put that update in the article.  <img src='http://www.crobak.org/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p>Cheers!</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Getting Started with Apache Hadoop 0.23.0 by Hardik</title>
		<link>http://www.crobak.org/2011/12/getting-started-with-apache-hadoop-0-23-0/#comment-611</link>
		<dc:creator>Hardik</dc:creator>
		<pubDate>Fri, 07 Sep 2012 00:12:17 +0000</pubDate>
		<guid isPermaLink="false">http://www.crobak.org/?p=85#comment-611</guid>
		<description><![CDATA[I get the same FileNotFoundException running &quot;pi&quot; example,  anyone with some idea pleas help

 bin/hadoop jar share/hadoop/mapreduce/hadoop-mapreduce-examples-0.23.1.jar  pi -Dmapreduce.clientfactory.class.name=org.apache.hadoop.mapred.YarnClientFactory -libjars share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-0.23.1.jar 16 10000
12/09/06 19:56:10 WARN conf.Configuration: mapred.used.genericoptionsparser is deprecated. Instead, use mapreduce.client.genericoptionsparser.used
Number of Maps  = 16
Samples per Map = 10000
Wrote input for Map #0
Wrote input for Map #1
Wrote input for Map #2
Wrote input for Map #3
Wrote input for Map #4
Wrote input for Map #5
Wrote input for Map #6
Wrote input for Map #7
Wrote input for Map #8
Wrote input for Map #9
Wrote input for Map #10
Wrote input for Map #11
Wrote input for Map #12
Wrote input for Map #13
Wrote input for Map #14
Wrote input for Map #15
Starting Job
12/09/06 19:56:22 WARN conf.Configuration: fs.default.name is deprecated. Instead, use fs.defaultFS
12/09/06 19:56:22 INFO input.FileInputFormat: Total input paths to process : 16
12/09/06 19:56:23 INFO mapreduce.JobSubmitter: number of splits:16
12/09/06 19:56:25 INFO mapred.ResourceMgrDelegate: Submitted application application_1346972652940_0002 to ResourceManager at /0.0.0.0:8040
12/09/06 19:56:27 INFO mapreduce.Job: The url to track the job: http://10.215.12.11:8088/proxy/application_1346972652940_0002/
12/09/06 19:56:27 INFO mapreduce.Job: Running job: job_1346972652940_0002
12/09/06 19:56:55 INFO mapreduce.Job: Job job_1346972652940_0002 running in uber mode : false
12/09/06 19:56:55 INFO mapreduce.Job:  map 0% reduce 0%
12/09/06 19:56:56 INFO mapreduce.Job: Job job_1346972652940_0002 failed with state FAILED due to: Application application_1346972652940_0002 failed 1 times due to AM Container for appattempt_1346972652940_0002_000001 exited with  exitCode: 1 due to: 
.Failing this attempt.. Failing the application.
12/09/06 19:56:56 INFO mapreduce.Job: Counters: 0
Job Finished in 35.048 seconds
java.io.FileNotFoundException: File does not exist: hdfs://localhost:9000/user/hardikpandya/QuasiMonteCarlo_TMP_3_141592654/out/reduce-out
	at org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:729)
	at org.apache.hadoop.io.SequenceFile$Reader.(SequenceFile.java:1685)
	at org.apache.hadoop.io.SequenceFile$Reader.(SequenceFile.java:1709)
	at org.apache.hadoop.examples.QuasiMonteCarlo.estimatePi(QuasiMonteCarlo.java:314)
	at org.apache.hadoop.examples.QuasiMonteCarlo.run(QuasiMonteCarlo.java:351)
	at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69)
	at org.apache.hadoop.examples.QuasiMonteCarlo.main(QuasiMonteCarlo.java:360)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597)
	at org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:72)
	at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:144)
	at org.apache.hadoop.examples.ExampleDriver.main(ExampleDriver.java:68)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597)
	at org.apache.hadoop.util.RunJar.main(RunJar.java:200)]]></description>
		<content:encoded><![CDATA[<p>I get the same FileNotFoundException running &#8220;pi&#8221; example,  anyone with some idea pleas help</p>
<p> bin/hadoop jar share/hadoop/mapreduce/hadoop-mapreduce-examples-0.23.1.jar  pi -Dmapreduce.clientfactory.class.name=org.apache.hadoop.mapred.YarnClientFactory -libjars share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-0.23.1.jar 16 10000<br />
12/09/06 19:56:10 WARN conf.Configuration: mapred.used.genericoptionsparser is deprecated. Instead, use mapreduce.client.genericoptionsparser.used<br />
Number of Maps  = 16<br />
Samples per Map = 10000<br />
Wrote input for Map #0<br />
Wrote input for Map #1<br />
Wrote input for Map #2<br />
Wrote input for Map #3<br />
Wrote input for Map #4<br />
Wrote input for Map #5<br />
Wrote input for Map #6<br />
Wrote input for Map #7<br />
Wrote input for Map #8<br />
Wrote input for Map #9<br />
Wrote input for Map #10<br />
Wrote input for Map #11<br />
Wrote input for Map #12<br />
Wrote input for Map #13<br />
Wrote input for Map #14<br />
Wrote input for Map #15<br />
Starting Job<br />
12/09/06 19:56:22 WARN conf.Configuration: fs.default.name is deprecated. Instead, use fs.defaultFS<br />
12/09/06 19:56:22 INFO input.FileInputFormat: Total input paths to process : 16<br />
12/09/06 19:56:23 INFO mapreduce.JobSubmitter: number of splits:16<br />
12/09/06 19:56:25 INFO mapred.ResourceMgrDelegate: Submitted application application_1346972652940_0002 to ResourceManager at /0.0.0.0:8040<br />
12/09/06 19:56:27 INFO mapreduce.Job: The url to track the job: <a href="http://10.215.12.11:8088/proxy/application_1346972652940_0002/" rel="nofollow">http://10.215.12.11:8088/proxy/application_1346972652940_0002/</a><br />
12/09/06 19:56:27 INFO mapreduce.Job: Running job: job_1346972652940_0002<br />
12/09/06 19:56:55 INFO mapreduce.Job: Job job_1346972652940_0002 running in uber mode : false<br />
12/09/06 19:56:55 INFO mapreduce.Job:  map 0% reduce 0%<br />
12/09/06 19:56:56 INFO mapreduce.Job: Job job_1346972652940_0002 failed with state FAILED due to: Application application_1346972652940_0002 failed 1 times due to AM Container for appattempt_1346972652940_0002_000001 exited with  exitCode: 1 due to:<br />
.Failing this attempt.. Failing the application.<br />
12/09/06 19:56:56 INFO mapreduce.Job: Counters: 0<br />
Job Finished in 35.048 seconds<br />
java.io.FileNotFoundException: File does not exist: hdfs://localhost:9000/user/hardikpandya/QuasiMonteCarlo_TMP_3_141592654/out/reduce-out<br />
	at org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:729)<br />
	at org.apache.hadoop.io.SequenceFile$Reader.(SequenceFile.java:1685)<br />
	at org.apache.hadoop.io.SequenceFile$Reader.(SequenceFile.java:1709)<br />
	at org.apache.hadoop.examples.QuasiMonteCarlo.estimatePi(QuasiMonteCarlo.java:314)<br />
	at org.apache.hadoop.examples.QuasiMonteCarlo.run(QuasiMonteCarlo.java:351)<br />
	at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69)<br />
	at org.apache.hadoop.examples.QuasiMonteCarlo.main(QuasiMonteCarlo.java:360)<br />
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)<br />
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)<br />
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)<br />
	at java.lang.reflect.Method.invoke(Method.java:597)<br />
	at org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:72)<br />
	at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:144)<br />
	at org.apache.hadoop.examples.ExampleDriver.main(ExampleDriver.java:68)<br />
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)<br />
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)<br />
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)<br />
	at java.lang.reflect.Method.invoke(Method.java:597)<br />
	at org.apache.hadoop.util.RunJar.main(RunJar.java:200)</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Getting Started with Apache Hadoop 0.23.0 by dheeren@yahoo.com</title>
		<link>http://www.crobak.org/2011/12/getting-started-with-apache-hadoop-0-23-0/#comment-610</link>
		<dc:creator>dheeren@yahoo.com</dc:creator>
		<pubDate>Tue, 04 Sep 2012 23:54:43 +0000</pubDate>
		<guid isPermaLink="false">http://www.crobak.org/?p=85#comment-610</guid>
		<description><![CDATA[To start history server use
$ sbin/mr-jobhistory-daemon.sh start

With cdh401 I was unable to start history server using
$ ysbin/arn-daemon.sh start historyserver

starting historyserver, logging to /tmp/yarn-hhhhuser-historyserver-rd1-nn1-1-sfm.ops.sfdc.net.out
Exception in thread &quot;main&quot; java.lang.NoClassDefFoundError: historyserver
Caused by: java.lang.ClassNotFoundException: historyserver
	at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
	at java.security.AccessController.doPrivileged(Native Method)
	at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
	at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
	at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
	at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
Could not find the main class: historyserver.  Program will exit.]]></description>
		<content:encoded><![CDATA[<p>To start history server use<br />
$ sbin/mr-jobhistory-daemon.sh start</p>
<p>With cdh401 I was unable to start history server using<br />
$ ysbin/arn-daemon.sh start historyserver</p>
<p>starting historyserver, logging to /tmp/yarn-hhhhuser-historyserver-rd1-nn1-1-sfm.ops.sfdc.net.out<br />
Exception in thread &#8220;main&#8221; java.lang.NoClassDefFoundError: historyserver<br />
Caused by: java.lang.ClassNotFoundException: historyserver<br />
	at java.net.URLClassLoader$1.run(URLClassLoader.java:202)<br />
	at java.security.AccessController.doPrivileged(Native Method)<br />
	at java.net.URLClassLoader.findClass(URLClassLoader.java:190)<br />
	at java.lang.ClassLoader.loadClass(ClassLoader.java:306)<br />
	at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)<br />
	at java.lang.ClassLoader.loadClass(ClassLoader.java:247)<br />
Could not find the main class: historyserver.  Program will exit.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Workflow Engines for Hadoop by Manoj</title>
		<link>http://www.crobak.org/2012/07/workflow-engines-for-hadoop/#comment-606</link>
		<dc:creator>Manoj</dc:creator>
		<pubDate>Tue, 07 Aug 2012 18:17:43 +0000</pubDate>
		<guid isPermaLink="false">http://www.crobak.org/?p=111#comment-606</guid>
		<description><![CDATA[Good stuff!]]></description>
		<content:encoded><![CDATA[<p>Good stuff!</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Getting Started with Apache Hadoop 0.23.0 by rashmi</title>
		<link>http://www.crobak.org/2011/12/getting-started-with-apache-hadoop-0-23-0/#comment-534</link>
		<dc:creator>rashmi</dc:creator>
		<pubDate>Wed, 18 Jul 2012 19:06:35 +0000</pubDate>
		<guid isPermaLink="false">http://www.crobak.org/?p=85#comment-534</guid>
		<description><![CDATA[Hi,

For hadoop-2.0.0 installation on two linux machines, what should be values of fs.defaultFS and dfs.name.dir and dfs.data.dir properties on both name nodes????

one machine hostname is rsi-nod-nsn1 and another one is rsi-nod-nsn2...

i want to make both federated namenodes.. and both should be used as datanodes too..

what should be configuration changes for the same? i am not finding masters, mapred-site.xml, and hadoop-env.sh files in hadoopHome/etc/hadoop folder... how do i make changes for these files?]]></description>
		<content:encoded><![CDATA[<p>Hi,</p>
<p>For hadoop-2.0.0 installation on two linux machines, what should be values of fs.defaultFS and dfs.name.dir and dfs.data.dir properties on both name nodes????</p>
<p>one machine hostname is rsi-nod-nsn1 and another one is rsi-nod-nsn2&#8230;</p>
<p>i want to make both federated namenodes.. and both should be used as datanodes too..</p>
<p>what should be configuration changes for the same? i am not finding masters, mapred-site.xml, and hadoop-env.sh files in hadoopHome/etc/hadoop folder&#8230; how do i make changes for these files?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on two puppet tricks: combining arrays and local tests by Callum</title>
		<link>http://www.crobak.org/2011/02/two-puppet-tricks-combining-arrays-and-local-tests/#comment-520</link>
		<dc:creator>Callum</dc:creator>
		<pubDate>Tue, 10 Jul 2012 13:24:45 +0000</pubDate>
		<guid isPermaLink="false">http://www.crobak.org/?p=58#comment-520</guid>
		<description><![CDATA[A simpler option was posted on etherized.com which uses:
&lt;code&gt;(array1+array2+arrayX).flatten.join(&#039;,&#039;)&lt;/code&gt;

Thanks for posting this, helped me out in a bind, the solution of using inline_template works well in my case. :-)]]></description>
		<content:encoded><![CDATA[<p>A simpler option was posted on etherized.com which uses:<br />
<code>(array1+array2+arrayX).flatten.join(',')</code></p>
<p>Thanks for posting this, helped me out in a bind, the solution of using inline_template works well in my case. <img src='http://www.crobak.org/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Workflow Engines for Hadoop by Nathan Bijnens</title>
		<link>http://www.crobak.org/2012/07/workflow-engines-for-hadoop/#comment-507</link>
		<dc:creator>Nathan Bijnens</dc:creator>
		<pubDate>Thu, 05 Jul 2012 18:09:21 +0000</pubDate>
		<guid isPermaLink="false">http://www.crobak.org/?p=111#comment-507</guid>
		<description><![CDATA[Nice comparison. One addition, we use our internal Jenkins server on our test cluster as a job control engine. Quite flexible, and already completely integrated with the rest of the workflow. Of-course it lacks in certain aspects.

Nathan]]></description>
		<content:encoded><![CDATA[<p>Nice comparison. One addition, we use our internal Jenkins server on our test cluster as a job control engine. Quite flexible, and already completely integrated with the rest of the workflow. Of-course it lacks in certain aspects.</p>
<p>Nathan</p>
]]></content:encoded>
	</item>
</channel>
</rss>
