Skip to content

cloudbopper/perysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

periodicals-analysis

Description

This repository implements the optical character recognition (OCR), natural-language processing (NLP) and topic modeling of historical organic periodicals.

Requirements

I recommend using virtualenv/virtualenvwrapper to install/manage the python modules.

Install

Use Pip to install directly from GitHub.

pip install git+https://github.com/cloudbopper/perysis.git@master#egg=perysis

Add '-e' for an editable install.

Running

Preprocessing OCR-ed text

python -m perysis.preprocess -input_dir documents_raw -output_dir documents_processed

License

periodicals-analysis is free, open source software, released under the MIT license. See LICENSE for details.

Contact

Akshay Sood

About

Analysis of historical organic periodicals.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published