Skip to content

Latest commit

 

History

History
23 lines (16 loc) · 735 Bytes

README.md

File metadata and controls

23 lines (16 loc) · 735 Bytes

Python Word Count from txt File

Apps that counts the number of words in a text file and graphs the occurrences, in this particular case I have taken the draft document of the new constitution of Chile and I have proceeded to clean it to be able to process it and obtain the data

Requirements for process a PDF file into a txt file and plot results:

tika==1.24 (https://tika.apache.org/)
seaborn==0.11.2 (https://seaborn.pydata.org)
numpy==1.22.3 (https://numpy.org)
pandas==1.4.2 (https://pandas.pydata.org)

Install:

pip install tika
pip install seaborn
pip install numpy
pip install pandas

Convert PDF into Txt File

Check: convertpdf_to_txtplain.py

Process and plot Data

Check: read_words.py