Welcome to the course of Big Data. This document will describe the list of software you will need in this course. Straight ahead!
Processor: Virtualization Enabled (Check Virtualization on Windows, On Linux: lscpu
). Sometimes it is disabled in BIOS.
Memory: We recommend at least 8Gb of RAM
Storage: To complete the first half of the course, you will need roughly 20Gb of free storage on your hard disk.
OS: macOS or Linux (latest Ubuntu LTS) are fine. You will have some additional issues with Hadoop and Spark on Windows, but it will work. If you want to use Linux - install it on hardware, do not use nested virtualization.
If you cannot meet these hardware requirements, please consult with your TA.
Refresh your knowledge of bash.
For the first lab, install Virtual Box, Vagrant, and Docker.
Download Vagrant Box Image from Vagrant Repository.