Skip to content

gt-sse-center/PatientX.AI

Repository files navigation

PatientX.AI

PatientX.AI is a tool designed to analyze and visualize patient experiences along specific treatment pathways. By scraping patient forums, running topic modeling algorithms, and constructing a "Journey map" based on these topics, PatientX.AI provides insights into common challenges, symptoms, and milestones that patients may encounter during their treatment. This tool aims to enhance both patient understanding of treatment processes and doctors' ability to provide timely support and resources.

Table of Contents

Overview

PatientX.AI offers:

  • Insight for Patients: A visual map of common experiences to help patients prepare for different stages of their treatment.
  • Support for Medical Providers: Analysis to help doctors identify critical stages where patients might benefit from additional resources, advice, or interventions.

Features

  • Topic Modeling: Uses NLP algorithms to identify common themes and group them into topics related to the treatment journey.
  • Topic Visualization: Generates visualizations to help experts interpret topics identified through topic modeling
  • Journey Map Visualization: Generates a user-friendly journey map, visually outlining typical patient experiences and symptom patterns.

Project Structure

PatientX.AI/
├── data/                                     # folder to hold data to be used by pipelines
├── existing_code/                            # folder to hold code prior to work with GT CSSE
│   ├──legacy_Data_sempling_topic_modeling.r  # R script for running pipeline
└── README.md     

Installation

  1. Clone the repository:
    git clone https://github.com/your-username/PatientX.AI.git
    

Running Pipelines

To run the analysis pipelines, use one of the following commands:

Python Pipeline

To run the Python pipeline for data sampling and topic modeling, execute:

$> python3 data_sampling_topic_modeling_2.py

Legacy R Pipeline

To run the legacy R pipeline, navigate to the existing_code folder and execute:

$> Rscript legacy_Data_sempling_topic_modeling.r

Note: Before running the R pipeline, ensure that the required R packages are installed. Refer to the comments in the legacy_Data_sempling_topic_modeling.r file for package installation instructions.