# Big Data Lab 1 There are some changes to make for the Lab 1 code to work properly. <br> ## Make sure `jps` shows 6 processes when executed <br> ## Install JDK 8 1. Update packages ```bash sudo apt update -y sudo apt upgrade -y ``` 2. Uninstall all other JDK versions ```bash sudo apt remove --purge openjdk* default-jdk* ``` 3. Install JDK 8 ```bash sudo apt install -y openjdk-8-jdk ``` 4. Check java version ```bash java -version ``` <br> ## Install JUnit ```bash sudo apt install -y junit ``` <br> ## Edit mapred-site.xml ```bash sudo nano $HADOOP_HOME/etc/hadoop/mapred-site.xml ``` Add the following inside the `<configuration>` tags: ```xml <property> <name>yarn.app.mapreduce.am.env</name> <value>HADOOP_MAPRED_HOME=$HADOOP_HOME</value> </property> <property> <name>mapreduce.map.env</name> <value>HADOOP_MAPRED_HOME=$HADOOP_HOME</value> </property> <property> <name>mapreduce.reduce.env</name> <value>HADOOP_MAPRED_HOME=$HADOOP_HOME</value> </property> ``` <br> ## Task 2 - Delete the 100MB file before executing the block size command each time ```bash hdfs dfs -rm /L1/100MB.txt ``` - Use the following block sizes: 1] 1048576 2] 2097152 3] 3145728
×
Sign in
Email
Password
Forgot password
or
By clicking below, you agree to our
terms of service
.
Sign in via Facebook
Sign in via Twitter
Sign in via GitHub
Sign in via Dropbox
Sign in with Wallet
Wallet (
)
Connect another wallet
New to HackMD?
Sign up