/
Hadoop Tutorials and Extra Features
Hadoop Tutorials and Extra Features
See the following commands and tutorials to learn how to test IO speed, debug and profile, do log file performance analysis, and more.
How-tos
Test IO speed
Use the following command to test IO speed.
hadoop jar /usr/lib/hadoop/hadoop-test-0.20.2+737.jar TestDFSIO -write -nrFiles 1 -fileSize 2000
Tutorials
Debugging and profiling with IsolationRunner
See: https://hadoop.apache.org/docs/r1.2.1/mapred_tutorial.html#IsolationRunner
Vaidya log file performance analysis
See: https://hadoop.apache.org/docs/r1.2.1/vaidya.html
Profiling
See: http://hadoop.apache.org/docs/r1.2.1/mapred_tutorial.html#Profiling
Debug scripts
See: http://hadoop.apache.org/docs/r1.2.1/mapred_tutorial.html#Debugging
Features
The distributed cache
See: http://hadoop.apache.org/docs/r1.2.1/mapred_tutorial.html#DistributedCache
The HDFS rebalancing feature
See: http://hadoop.apache.org/docs/r1.2.1/hdfs_user_guide.html#Rebalancer