BigBench
This Big Data Benchmark for BigBench Specification ("Software") is maintained by Intel and Cloudera
About BigBench
The [[https://github.com/intel-hadoop/Big-Data-Benchmark-for-Big-Bench|Big Data Benchmark for Big Bench]] specification based benchmark.
Run BigBench
You can run the testsuite [[https://github.com/intel-hadoop/Big-Data-Benchmark-for-Big-Bench/wiki/HowToRunBigBench|BigBench]] referring documentation web page.
You need to edit the conf/userSettings.conf file and do the changes according to your installation E.g. Change the variable's content from
$ -export BIG_BENCH_HADOOP_CONF="/etc/hadoop/conf.cloudera.hdfs" $ -export BIG_BENCH_HADOOP_LIBS_NATIVE="/opt/cloudera/parcels/CDH/lib/hadoop/lib/native"
To the below lines
$ +export BIG_BENCH_HADOOP_CONF="/usr/local/hadoop/etc/hadoop" $ +export BIG_BENCH_HADOOP_LIBS_NATIVE="/usr/local/hadoop/lib/native"
A complete benchmark run with all stages, can be done by running (e.g., 4 map tasks, scale factor 100, 2 streams):
$ Big-Bench$ ./bin/bigBench runBenchmark -m 4 -f 100 -s 2
Issues
On Ubuntu, errors: “line 46: pssh: command not found”.
Tried installing pssh package using the command
$ sudo apt-get install pssh
and created the below softlinks under /usr/bin/
parallel-ssh is pssh parallel-scp is pscp parallel-rsync is prsync parallel-nuke is pnuke parallel-slurp is pslurp
It looks like couple of error’s or issues need tobe fixed
#1 Benchmark run terminated Reason: An error occured while running a command in phase CLEAN_ALL #2 A warning WARN util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable