Multilingual Question Answering Project

Reader Phase

We used XLM-R-base to fine-tuning on 2 source datasets:

UIT-ViQuAD
MLQA (https://github.com/facebookresearch/MLQA)

We used augmentation technique to enhancing model performance. Specifically, we papraphased the questions in the data by:

    Vietnamese question --(Translate)-> Chinese question --(Translate)-> Vietnamese question
    For exmaple: 
      Origin : Qua đầu thế kỷ 21, Jackson bắt đầu hợp tác cùng các nhà soạn nhạc nổi tiếng nào?
      After paraphasing: Jackson bắt đầu hợp tác với những nhà soạn nhạc nổi tiếng nào vào đầu những năm 2000?

Deep Translator is used [https://github.com/nidhaloff/deep-translator] to call Google API.

Our model can work well in some questions type like What, When, Where in both English and Vietnamese.

Usage:

from transformers import pipeline

# My check-point 've already push to huggingface. 
model_checkpoint = "chieunq/XLM-R-base-finetuned-uit-vquad-1"
question_answerer = pipeline("question-answering", model=model_checkpoint)

context = """
Nhóm của chúng tôi là sinh viên năm 4 trường ĐH Công Nghệ - ĐHQG Hà Nội. Nhóm gồm 3 thành viên : Nguyễn Quang Chiều, Nguyễn Quang Huy và Nguyễn Trần Anh Đức . Đây là pha Reader trong dự án cuồi kì môn Các vấn đề hiện đại trong CNTT của nhóm . 
"""
question = "Who are the 3 members of the group?"
question_answerer(question=question, context=context)

Output

{'score': 0.998,
 'start': 98,
 'end': 158,
 'answer': 'Nguyễn Quang Chiều, Nguyễn Quang Huy và Nguyễn Trần Anh Đức.'}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
Question_Answering_uit_vquad.ipynb		Question_Answering_uit_vquad.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multilingual Question Answering Project

Reader Phase

Usage:

Output

About

Releases

Packages

Languages

nqchieutb01/Multilingual-Question-Anwsering

Folders and files

Latest commit

History

Repository files navigation

Multilingual Question Answering Project

Reader Phase

Usage:

Output

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages