Sentiment Classification

Objective

In this project we trained a pre-built model (BERT) using the transformers library from Hugging Face on a dataset of labeled tweets, labeled for sarcasm (sarcastic/not sarcastic) See link for details on training data. We then used this model to predict the class (sarcastic/not sarcastic) of a set of provided unlabeled tweets for comparison to a competitive baseline score. We were able to beat the baseline with our model. See link for a full report.

Repository Contents

./Alternative Methods & Models/: Contains additional models that we built and trained but which were unsuccessful at beating the baseline score.
./data/: Contains the test and train data provided for the competition.
./Project Documentation/: Contains the final report, demo, and other project deliverables.
answer.txt: Our final file containing the classification of the test tweets which outperformed the F1 score of the baseline.
Bert.ipynb: Our notebook file in which we build, train, and test the model.
TEXT_PREPROCESSING.py: A dependecy of Bert.ipynb, used to preprocess the text for tokenization.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Classification

Objective

Repository Contents

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
Alternative Methods & Models		Alternative Methods & Models
Project Documentation		Project Documentation
data		data
Bert.ipynb		Bert.ipynb
README.md		README.md
TEXT_PREPROCESSING.py		TEXT_PREPROCESSING.py
answer.txt		answer.txt
data_description.md		data_description.md
livedatalab_config.json		livedatalab_config.json

steve303/NLP-SentimentClassification

Folders and files

Latest commit

History

Repository files navigation

Sentiment Classification

Objective

Repository Contents

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages