# Hadoop practice note 1-1 * Win7 * VirtualBox 6.1.0 * HDP 2.5 ## :memo: Where do I start? ### Step 1: Download HDP ova * [VirtualBox - HDP 2.5](https://archive.cloudera.com/hwx-sandbox/hdp/hdp-2.5.0/HDP_2.5_virtualbox.ova) * [VirtualBox - HDP 2.6.5](https://archive.cloudera.com/hwx-sandbox/hdp/hdp-2.6.5/HDP_2.6.5_virtualbox_180626.ova) * [VirtualBox_Extension_Pack-6.1.0](https://www.virtualbox.org/wiki/Downloads) ### Step 2: Import application to virtualBox  ### Step 3: 啟動VM  ### Step 4: 啟動HDP * 開起網頁: http://127.0.0.1:8888 * 帳號: maria_dev/ maria_dev * Admin: admin/ hadoop   ### step 5: 上傳Data * 利用Hive上傳u.data和u.item,並命名為ratings和movie_id。  * 檢查上傳是否成功 * 裡用SQL下指令 ``` SQL= SELECT movie_id, count(movie_id) as ratingCount FROM ratings GROUP BY movie_id ORDER BY ratingCount DESC; ```  * 使用vitualization  ## :interrobang: Error msg 1. 啟動VM時的錯誤訊息 * 內容:Unable to allocate and lock memory. The virtual machine will be paused. Please close applications to free up memory or close the VM. 錯誤 ID:HostMemoryLow 嚴重性:非致命錯誤 * 解決方式:升級virtualBox到6.1.0 2. 使用vitualization出現以下訊息畫面  * 解決方式:SQL重新run一變。 ###### tags: `DE`,`Hadoop`
×
Sign in
Email
Password
Forgot password
or
By clicking below, you agree to our
terms of service
.
Sign in via Facebook
Sign in via Twitter
Sign in via GitHub
Sign in via Dropbox
Sign in with Wallet
Wallet (
)
Connect another wallet
New to HackMD?
Sign up