House Sales in King County, USA === Predict house price using kNN in R **[English](https://hackmd.io/s/SJ2hlMQz7)** **[中文版](https://hackmd.io/s/BJGOjXWRM)** ## DATABASE ![](https://i.imgur.com/XJC8itL.png) FROM : [Kaggle](https://www.kaggle.com/harlfoxem/housesalesprediction) ## Why choose KING COUNTY in Kaggle? :::info Ans : Taiwan government data open platform lacks relevant complete information And because it is an open source project, I hope to establish a **complete forecasting paradigm** ::: ## Program environment > **R** language **LINUX** :::warning **Originally wanted to use Rstudio, but after discussing it with the TA, I felt that developing R under Linux would be better than software:** - R has many packages that are only available on linux - What this project needs to do is open source project - I do not know if someone will use this project (?) in the future, so open source under Linux is better:smiley: - More unknown areas can be explored when building an environment under Linux:100: ::: ## Schedule ```mermaid gantt title A Gantt Diagram section Build build R :a0 , 2018-05-05, 18d section data_preprocessing database :2018-05-13 , 10d preprocess :2018-05-23 , 10d section Train/Test select feature :2018-06-02, 10d Improve accuracy :2018-06-12 ,12d ``` ## BULID R on LINUX reference [Technical documents](https://hackmd.io/s/r1R3MRkgQ) ## preprocess reference [Technical documents](https://hackmd.io/s/r1R3MRkgQ) ## reference [R語言進行機器學習方法及實例(一)](https://hk.saowen.com/a/57cac19c62d7cc640dae3471360106c1d0475e09d4c9032621124c9242335c86) ---