Stochastic Nonsense

Put something smart here.

Useful Tweaks for Hadoop on EMR

more ram for the workers: modify mapred-site.xml and add

1
<property><name>mapred.child.java.opts</name><value>-Xmx3192m</value></property>

To push the changes to all the machines, use the script to modify mapper or reducer count on a running emr cluster.