Building Blocks

Below is a list of the corpus tools we use at Mind Mimic Labs. They are intended to be building blocks for both general research in our lab as well as publication boilerplate. Each tool should be considered stand-alone and includes both code (~/code) and documentation (~/docs). There is a combined requirements.txt file for all the tools found in the root of the repo. The documentation will include both instructions as to what the code is for, how to run it, and what publication boilerplate to put in the Methods and Materials section.

Scripts

Unless otherwise noted, all scripts follow the same execution path.

Open a command prompt
Change into the ~/code folder.
Run python {{scriptname}}.py -in d:/corpus_in -out d:/corpus_out. You should change the input and output paths as desired.

The list of current scripts is below. In general, you want to first run documents-to-corpus, then other scripts. Individual papers/projects/repos will instruct on the exact order in their README.md's Tabula Rasa section.

Data Pre-Processing

Formatting

Deep Learning

Misc

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
code		code
docs		docs
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Building Blocks

Scripts

About

Contributors 2

Languages

License

MindMimicLabs/building-blocks

Folders and files

Latest commit

History

Repository files navigation

Building Blocks

Scripts

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 2

Languages