Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Alternate cleaned up pipeline ready for integration #2

Open
wants to merge 54 commits into
base: master
Choose a base branch
from

Conversation

alecristia
Copy link
Collaborator

This version of the pipeline includes:

  • code optimization for running time (corpus extraction done once), improved dealing with foreign words
  • final version of cleaning to increase comparability across morphemes and words
  • final version of phonologization with clearer rule annotation
  • final version of cutting, occurring after phonologization
  • no real changes to analyses
  • simpler results collapse
  • analyses in knittable version

Remain to be sorted:

  • code for corpus extraction works in mac but not linux
  • cutting does not match across Chintang and Japanese for corpus length
  • analyses running so there could be bugs not yet spotted

alecristia and others added 30 commits April 13, 2018 17:10
…ne works; suspect syntax error due to my meddling
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants