Skip to content
View HNStaggs's full-sized avatar

Highlights

  • Pro

Block or report HNStaggs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
HNStaggs/README.md

Hi there, I'm Halee Staggs, welcome to my Git 👋

About Me

I'm a full stack AI developer, data scientist, and project manager. I am passionate about asking creative questions and using data to solve problems and reveal insights. I find it intriguing that human behavior and processes can be explained with algorithms. I am currently passionate about the new frontier of implementing AI solutions like LLMs in healthcare and research!

Skills

  • Programming Languages: Python, R, SQL
  • AI Techniques: LLM/SLM Fine Tuning, RAG, Synthetic Data Generation (VAE), Recommender Systems, Machine Learning
  • Libraries: PyTorch, Keras, TensorFlow, ScikitLearn, SQAlchemy, NLTK, Tidyverse, Caret, Car, e1071
  • Tools: Git, MySQL Workbench, R Studio, Jupyter Notebook
  • Statistics: SAS, SPSS
  • Data Vis/Dashboards: Tableau, Power BI, R Shiny
  • Front End: Streamlit, PyQt5
  • Cloud: AWS, GCP, Microsoft Azure
  • Back End: Database Engineering, ETL, APIs
  • Replicability & Full Stack Integration: Docker, Kubernetes

Projects

  • Long COVID - Developed and evaluated models using 50 biopsychosocial features to classify long COVID diagnosis and predict cognitive complaint severity. Analyzed long COVID biotypes using k-means clustering of symptom profiles.
  • Gender Gaps in Clinical Trials - Use natural language processing (NLP) of registered clinical trial descriptions (clinicaltrials.gov API) to assess differences in sex representation using classification model coefficients.
  • Participant Dropout Factors - Created a decision tree classifier to identify demographics of participants likely to drop out of online research.
  • COVID Economic Database - Created an ETL pipeline of covid-related, global, macroeconomic data.
  • Pharmaceutical Drug Surveillance - Created an ETL pipeline of adverse drug event information to create a database that supports machine learning objectives, dashboarding, and model deployment via Streamlit application.
  • Political Discourse Sentiment Analysis - Leveraged AWS cloud computing and natural language processing to capture public sentiment of presidential candidates from Twitter tweets and New York Times article comments.

Education

  • Master of Science in Applied Data Science, University of San Diego
  • Bachelor of Science in Psychology/Biological Science, San Jose State University
  • Associate of Arts in Social and Behavioral Sciences, Moorpark College

Contact Me

Interests

  • Professional Interests: Machine learning, data engineering
  • Hobbies: Weight lifting, pilates, traveling, spending time with my dog, Luna :)

Let's Connect!

Feel free to reach out if you want to collaborate on a project or just want to connect!


"You cannot get clean in a dirty shower."

Pinned Loading

  1. COVID_Economic_Database COVID_Economic_Database Public

    Jupyter Notebook

  2. Participant-Dropout-Classification Participant-Dropout-Classification Public

    Decision tree to classifiy participant dropout.

  3. Political-Discourse-NLP-AWS Political-Discourse-NLP-AWS Public

    Jupyter Notebook 1

  4. Clinical-Trial-Gender-Gaps Clinical-Trial-Gender-Gaps Public

    Jupyter Notebook 1 1

  5. teamlunarlanding/Pharma-Drug-Surveillance teamlunarlanding/Pharma-Drug-Surveillance Public

    Jupyter Notebook