Summarized list of projects done for portfolio making purposes. More details in each repo/folder.
Better UI version here.
- Causal impact analysis of marriage to mental health - Analysis of how marriage effects the chances of having depression based on NLS (National Longitudinal Survey) data. Analysis was done using traditional methods (simple logistic regression, PSM, IPW) and newer methods (t-learner, double machine learning) w/ DoWhy & EconML.
- Dashboarding in various tools - Created dashboard in different tools to compare & contrast. Pros & cons are written in Medium. Github stores the dashboard code for Panel and Plotly, which were deployed in GCP.
- Applying ML techniqnues to classify ASD writers using only part-of-speech information - Project to classify Reddit posts written by people with ASD (Autistic Spectrum Disorder) without revealing the content, but only use POS (e.g. noun, verb, etc) information. Tested different classifiers (Logistic regression, LGBM, Random Forest, etc) as well as neural network based (RNN).
- Correlation analysis of world happiness report data with other open data - A hypothetical project to run a deeper analysis of the World Happiness Report data with other open data from the World Bank (e.g. income, health expenditure) to see what KPIs/signals correlates to people's happiness.
- Quantitative analysis to words frequently used by ChatGPT-as-an-editor - Prompted ChatGPT to edit data science articles on Medium and analyzed the differences in word frequency before and after the ChatGPT edits. Additionally analyzed the word frequency change between 2022 (pre-chatGPT) and 2023 (post-chatGPT) to find similar trend in word frequency change.