Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[distsim] LIN-DEP (seems to) generate only ADJ rules (German, CONLL format) #292

Open
gilnoh opened this issue Nov 6, 2013 · 1 comment

Comments

@gilnoh
Copy link
Member

gilnoh commented Nov 6, 2013

After successful generation and redis-conversion; the lexical resource based on Lin dependency works for the German.

However, the resource has no (or almost no) nouns, or verbs. I tried with various common terms that shoud be existing; but couldn't generate any match.

For the moment, I do not know how I can iterate over all rules (or all entries), so this is just my guess: But it is quite likely that LIN-DEP resource generated with existing configuration only has ADJs. (Not even ADVs or Vs, it seems...)

Again, I can be wrong, since I don't know how to iterate over all the rules / elmements. Is this normal? (I guess not).

Also, ADJs were quite ... strange. For example, (much more common) ADJs like gut /schlete (good/ bad) does not existing, while some ADJs like recht (right) are there, etc ...

@gilnoh
Copy link
Member Author

gilnoh commented Nov 6, 2013

You can reproduce this with the following intermediate size corpus: (1/30th of SDEWAC)
http://www.cl.uni-heidelberg.de/~noh/sdewac_part01.mstparsed.utf8.conll.gz

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant