Skip to content

This tool is designed for comprehensive text data cleaning and pre-processing. Built to support data scientists and NLP practitioners, it improves the process of preparing text data for analysis and machine learning models.

License

Notifications You must be signed in to change notification settings

AbdullahHDev/NLPToolkit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NLPToolkit: Advanced Text Preprocessing App


About The Project

NLPToolkit is designed to simplify the preprocessing of text data for natural language processing (NLP) applications. It offers a wide range of text cleaning and preprocessing functionalities including tokenization, stopword removal, stemming, lemmatization, and other advanced cleaning techniques.

Built With

Python NLTK Streamlit

Getting Started

This section provides instructions to get you started with setting up and running the NLPToolkit app on your local machine.

Prerequisites

  • Python 3.x
  • Pip package manager

Installation

  1. Clone the repository: git clone https://github.com/AbdullahHDev/NLPToolkit.git
  2. Install required Python packages: pip install -r requirements.txt
  3. Run the Streamlit application: streamlit run app.py

Usage

  1. Through the app's UI, upload your CSV file containing text data.
  2. Select the text column you wish to preprocess.
  3. Choose the preprocessing steps you want to apply.
  4. Download the preprocessed text data for further analysis or model training.

License

Distributed under the MIT License. See LICENSE for more information.

Contact

Abdullah Hussain - [email protected]

Project Link: https://github.com/AbdullahHDev/NLPToolkit

Acknowledgments

About

This tool is designed for comprehensive text data cleaning and pre-processing. Built to support data scientists and NLP practitioners, it improves the process of preparing text data for analysis and machine learning models.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages