GitHub - NickPrivate/Processing-Tokenized-Data: Custom Python Parser to tokenize any kind of Python Code

Contributors && Project Demo:

Ryan Haddadi
Nicholas Goulart
Anthony LaPan
https://streamable.com/qrkcvk

Requirements:

Python 3

Install Python here:

https://www.python.org/downloads/

Dependencies:

pip install tabulate
pip install nltk

Run program on Unix:

cd to where the file is located then type

python3 main.py

Run program on Windows:

cd to where the file is located then type

python main.py

Overview

main.py initializes the script by importing the modules used to tokenize the data. Each module contains a function that handles the data. This modular approach allows for easier integration and expansion.

Functionality:

file_reader.py
- Responsible for reading input data. It ensures that data is correctly loaded into the program for processing.
preprocessor.py
- Preprocesses the input data by performing tasks such as removing comments, making it ready for tokenization.
tokenizer.py
- Analyzes the preprocessed data to identify and extract tokens. This module is central to the program's purpose, converting raw text into a structured format for further analysis, sending the tokenized data to be processed by the output formatter.
output_formatter.py
- Formats the tokenized data into a table for easy readability.

Test Cases

Included in the "test" directory is "input.txt" a test case that is automatically called upon running the script. Feel free to put any kind of test case in this file, save the file, and run main.py.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
modules		modules
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Requirements.pdf		Requirements.pdf
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contributors && Project Demo:

Requirements:

Install Python here:

Dependencies:

Run program on Unix:

Run program on Windows:

Overview

Functionality:

Test Cases

About

Releases

Packages

Contributors 3

Languages

License

NickPrivate/Processing-Tokenized-Data

Folders and files

Latest commit

History

Repository files navigation

Contributors && Project Demo:

Requirements:

Install Python here:

Dependencies:

Run program on Unix:

Run program on Windows:

Overview

Functionality:

Test Cases

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages