This SRL corpus has 5,000 annotated sentences, which is much smaller than SRL corpora of other languages. For example, the English PropBank contains about 50,000 sentences, which is ten times larger. While smaller in size, the Vietnamese PropBank has more semantic roles than the English PropBank has – 25 roles compared to 21 roles. This makes the unavoidable data sparseness problem more severe for Vienamese SRL than for English SRL.
Model | F1 | Paper/Source | Code |
---|---|---|---|
Pham et al. 2015 | 73.53 | Pham et al. 2015 | Official |