Skip to content

ZiyiTsang/Epipaca

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

The code implementation to Epipaca

Epipaca This is the cross-languadge LLM adapter design for epilepsy-care instruction, with support both Mandarin and English.

Training steps

Generate the seed_task(~200 Record)

In this step, we handwrite some of the task in seed_task, then we ask the LLM to generate more seed_task record in both Mandarin and English. The seed_task is the instruction for epilepsy-care. After that, we check the generated seed_task and remove the bad generated record.

Generate the synthetic data(2k Record)

In this step, we ask the LLM to generate more synthetic data in both Mandarin and English. The man-write filter-rule is applied to filter the bad generated record. We also upload the Epilepsy_Synthetics dataset for research proposes only.

Finetune the LLM

In this step, we finetune the LLM with the seed_task and synthetic data.

About

Epipaca

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published