Skip to main content

Housing Data Analysis

ยท One min read
Amir Afshari

density2

This dataset is based on data from the 1990 California census. Our dataset consist of 26k samples with 10 features.

features = [longitude, latitude, housing_median_age, total_rooms, total_bedrooms, population, households, median_income, median_house_value, ocean_proximity]

Ocean Proximity Count

Ocean Proximity is our only categorical data in this dataset.

ocean_proximity ocean_proximity-pie

Where are the most populated areas?

Population Density Recongnition

density density1 density2

Correlations

Median Income vs Median House Value (Strongest Correlation)

correlations

Distribution of Features

feature-distribution

Outliers

outliers