diff --git a/slides.html b/slides.html
new file mode 100644
index 0000000..c2ecd0e
--- /dev/null
+++ b/slides.html
@@ -0,0 +1,129 @@
+
PreFer data challenge
\ No newline at end of file
diff --git a/slides.md b/slides.md
new file mode 100755
index 0000000..b566982
--- /dev/null
+++ b/slides.md
@@ -0,0 +1,137 @@
+---
+marp: true
+author: Simone Meneghello, Alessio Piraccini, Gianluca Tori
+size: 4:4
+theme: gaia
+title: PreFer data challenge
+---
+
+# First slide :)
+
+---
+
+# Initial considerations
+
+- **Recent Data Priority**: Key insights for predicting fertility (2021-2023) likely stem from surveys around 2021. Older data may have weaker predictive power. Focus on recent surveys, supplemented by relevant past data.
+
+- **Feature Selection**: Despite numerous survey questions, only a few features may be crucial. Identifying and selecting these key features is essential to streamline the dataset.
+
+- **Handling Missing Values**: Missing data might not be random and could hold valuable information. Some missing responses result from survey branching logic. Longitudinal data may allow filling gaps with past information.
+
+- **Key Predictors**: Likely predictors include age, relationship status, economic situation, and existing children. However, the dataset's breadth may reveal unexpected patterns.
+
+---
+
+# Data exploration
+
+- **Task Overview**: Classify if a person will have a child within 2021-2023 using 2020 data.
+
+- **Dataset Details**:
+ - Initial dataset: 6,418 rows, 31,634 columns.
+ - Cleaned dataset: 987 rows, 25,868 columns (removed rows with missing outcomes and columns with all missing values).
+
+- **Background Dataset**:
+ - Longitudinal format with repeated entries per subject across years.
+ - Filtered to include subjects with available outcomes.
+ - Contains key predictive features (e.g., age, income, civil status).
+
+---
+
+# Tree selection
+
+- Used a univariate decision tree with stratified 5-fold cross-validation for feature evaluation.
+- Decision tree chosen for handling missing data and categorical features efficiently.
+- Quick iteration over features due to optimized implementations and manageable data size.
+
+![gas](./saved/tree_selection_big_year.png)
+
+- Results show F1 scores by survey year, with a random predictor as a baseline.
+- Performance decreases over time, supporting the relevance of recent surveys.
+
+---
+
+## Tree Selection: 2020 Focus
+
+- Further explored 2020 variables using the univariate tree method.
+
+![gas](./saved/tree_selection_2020.png)
+
+- Most surveys, except "Family & Household," have few features with strong predictive power.
+- Confirms that only a few features are crucial for prediction.
+
+---
+
+# Missing values
+
+Hi there!
+
+---
+
+# Submission history
+
+Hi there!
+
+---
+
+# Exploratory data analysis
+
+Hi there!
+
+![gas](./saved/correlation_matrix.png)
+
+---
+
+# Exploratory data analysis
+
+Hi there!
+
+![gas](./saved/eda_feature_plots.png)
+
+---
+
+# Exploratory data analysis
+
+Hi there!
+
+![gas](./saved/eda_feature_plots.png)
+
+---
+
+# Modeling
+
+Hi there!
+
+---
+
+# Model interpretation
+
+Hi there!
+
+![gas](./saved/shap_bar.png)
+
+---
+
+# Model interpretation
+
+Hi there!
+
+![gas](./saved/shap_scatter_plots.png)
+
+---
+
+# Final considerations
+
+Hi there!
+
+---
+
+# Appendix: Big five
+
+Hi there!
+
+---
+
+# Appendix: Big five
+
+Hi there!
+