NLPToolkit is designed to simplify the preprocessing of text data for natural language processing (NLP) applications. It offers a wide range of text cleaning and preprocessing functionalities including tokenization, stopword removal, stemming, lemmatization, and other advanced cleaning techniques.
This section provides instructions to get you started with setting up and running the NLPToolkit app on your local machine.
- Python 3.x
- Pip package manager
- Clone the repository:
git clone https://github.com/AbdullahHDev/NLPToolkit.git
- Install required Python packages:
pip install -r requirements.txt
- Run the Streamlit application:
streamlit run app.py
- Through the app's UI, upload your CSV file containing text data.
- Select the text column you wish to preprocess.
- Choose the preprocessing steps you want to apply.
- Download the preprocessed text data for further analysis or model training.
Distributed under the MIT License. See LICENSE
for more information.
Abdullah Hussain - [email protected]
Project Link: https://github.com/AbdullahHDev/NLPToolkit