Description

Purpose of this project is to leverage reviews about major delivery companies that are operating in the UK, and perform NLP tasks to analyze different aspects of the reviews like the sentiment, most common words, probability distributions across word sequences, and more.

Project Roadmap

graph   LR
    A[Build a tool to connect to web sources APIs] -->|Get reviews from web| B[Clean reviews]
    B --> D[Knowledge Graphs]
    B --> F[Unsupervised Clustering]
    B --> C(Sentiment Analysis)
    B --> |Identify topic of review| E[Topic Extraction]
    E -->  |Train Model| I[Assign Topic to new instances]
    C --> |Train Model| J[Sentiment Classifier]
    I --> K[Build UI]
    J --> K[Build UI]

Data Retrieval API

To get reviews from the TrustPilot website, we are leveraging a custom made web scraping tool. This tool is iterating across different pages of the website and collects the reviews and any other relevant information, with the output being stored in CSV files.

Running Guide

Set-up the appropriate configurations in config.json. The config needs to get populated with the following metadata:
- source_url: Main domain URL
- starting_page: Domain subpath to a specific reviews page
- steps: Defines number of pages to iterate over
- company: Company/Service of interest
Execute the python retriever script
python data_retriever.py

Name		Name	Last commit message	Last commit date
Latest commit History 226 Commits
.vscode		.vscode
__pycache__		__pycache__
helpers		helpers
img		img
jupyter_notebook		jupyter_notebook
processing		processing
.gitattributes		.gitattributes
.gitignore		.gitignore
LDAModellerClass.py		LDAModellerClass.py
README.md		README.md
config.json		config.json
data_retriever.py		data_retriever.py
processed_pages.txt		processed_pages.txt
reviews.csv		reviews.csv
reviews.py		reviews.py
texteda.py		texteda.py
trustplt.py		trustplt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Description

Project Roadmap

Data Retrieval API

Running Guide

About

Releases

Packages

Languages

Mohsanaliac/Text_Analysis_of_Consumer_Reviews

Folders and files

Latest commit

History

Repository files navigation

Description

Project Roadmap

Data Retrieval API

Running Guide

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages