This project requires Python 3.X and the following Python libraries installed:
To run this project, the Anaconda manager was used, in a Jupyter notebook environment.
- train_test_split -> This library is used to separate test and training data.
- DecisionTreeRegressor -> This library was used to predict each of the attributes based on all the others, to see which attributes are easier to predict.
- PCA -> This library was used to find the main components that direct the data in this dataset.
- LabelEncoder -> This library was used to code the categorical columns in a readable format for training models.
- ProfileReport -> This library was used to generate a report of the attributes present in the dataset, with the purpose of helping in cleaning the data.
- Real Estate Ads Report.pdf -> Report created from the data set, taking into account the attributes present.