Text Classification with Neural Networks

This project demonstrates how to perform text classification using Neural Networks. The primary task is to classify movie reviews as positive or negative based on textual data from the IMDB dataset.

Dataset

The dataset used in this project is the IMDB dataset, which contains 50,000 movie reviews with the following structure:

review: The textual content of the review.
sentiment: The label indicating the sentiment of the review (positive or negative).

You can download the dataset from Kaggle link.

Project Structure

Data Preprocessing:
- Remove special characters and HTML tags.
- Convert text to lowercase.
- Remove stop words using NLTK.
- Tokenize and pad sequences.
Model Building:
- Utilize embedding layers for word representation.
- Build a sequential neural network with layers like LSTM and Dense.
Evaluation:
- Split data into training and testing sets.
- Train the model using the training set and evaluate performance on the test set.

Technologies Used

Python
Keras (for deep learning models)
NLTK (for text preprocessing)
Pandas & NumPy (for data manipulation)
Google Colab (for execution environment)

Model Architecture

The model consists of the following components:

Embedding Layer: To convert words into dense vectors.
LSTM Layer: For capturing sequential dependencies in the text.
Dense Layers: For classification output.

Usage

Clone this repository:

git clone https://github.com/thanghd1112/Text-Classification-with-Neural-Networks.git

Open the 07_textClassification.ipynb notebook.
Load the IMDB dataset into the specified path.
Run the notebook cells sequentially.

Results

The model achieves high accuracy on the test set, demonstrating its effectiveness in classifying movie reviews. The exact results (e.g., accuracy, loss) can be found in the notebook's output section.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
07_textClassification.ipynb		07_textClassification.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text Classification with Neural Networks

Table of Contents

Dataset

Project Structure

Technologies Used

Model Architecture

Usage

Results

About

Releases

Packages

Languages

thanghd1112/Text-Classification-with-Neural-Networks

Folders and files

Latest commit

History

Repository files navigation

Text Classification with Neural Networks

Table of Contents

Dataset

Project Structure

Technologies Used

Model Architecture

Usage

Results

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages