This repo created for show an example for text normalization on a dataset.
For use this code, you should install the nltk package with pip install nltk
This code, normalize raw text data with nltk package and tokenize a sample of data for show you an example. you can use this, for any other datasets with change the section of load dataset.