# Data Exploration - Features Available - Missing Values - House prices kernel estimation per geographic coordinates - Price distribution per building area - Average house price per year built - House price distribution per region # Feature Eng - Pegar no Address e separar por Address Name - Date, separar por Day, Month, Year. # Data Preparation # Modeling - Regression Trees - Bagging - Random Forest - Boosting - Kernel Approaches (SVM) # Dataset (https://www.kaggle.com/anthonypino/melbourne-housing-market) - Suburb: Suburb - Address: Address (eliminar) - Rooms: Number of rooms - Price: Price in Australian dollars - Method: - S - property sold; - SP - property sold prior; - PI - property passed in; - PN - sold prior not disclosed; - SN - sold not disclosed; - NB - no bid; - VB - vendor bid; - W - withdrawn prior to auction; - SA - sold after auction; - SS - sold after auction price not disclosed. - N/A - price or highest bid not available. - Type: - br - bedroom(s); - h - house,cottage,villa, semi,terrace; - u - unit, duplex; - t - townhouse; - dev site - development site; - o res - other residential. - SellerG: Real Estate Agent - Date: Date sold - Distance: Distance from CBD in Kilometres - Regionname: General Region (West, North West, North, North east …etc) - Propertycount: Number of properties that exist in the suburb. - Bedroom2 : Scraped # of Bedrooms (from different source) - Bathroom: Number of Bathrooms - Car: Number of carspots - Landsize: Land Size in Metres - BuildingArea: Building Size in Metres - YearBuilt: Year the house was built - CouncilArea: Governing council for the area - Lattitude: Self explanitory - Longtitude: Self explanitory