GEC-with-MoE-CoT-and-Rubrics

GEC System Architecture

Our GEC system architecture employs a novel combination of methodologies to enhance the accuracy and diversity of grammatical error corrections:

Mixture of Experts: Utilizes a specialized approach where each "student" model is an expert on one of four datasets—A, B, C, or N—achieved through fine-tuning on each specific dataset.
Chain-of-Thought Prompting: Utilizes structured reasoning prompts to guide the model towards more logical and contextually relevant corrections.
Rubric-Guided Evaluations: Incorporates specific evaluation criteria, ensuring that corrections adhere to predefined quality standards.
In-Context Learning: Leverages the capabilities of models with advanced reasoning abilities, notably GPT-3.5-turbo-1106, for dynamic learning within contextual boundaries.
Teacher and Student Model Integration: Combines the insights of teacher models in evaluating student corrections, fostering a rich pool of diverse and insightful corrections.

Clone the Repository:

git clone [email protected]:CrownKira/GEC-with-MoE-CoT-and-Rubrics.git

Create and Activate the Virtual Environment:

python3 -m venv venv
source venv/bin/activate

Install Required Python Packages:
```
pip3 install -r requirements.txt
```
Download the Necessary spaCy Language Model:
```
python3 -m spacy download en_core_web_sm
```

Configure Environment Variables: Obtain the .env file from me or set your own environment variables as needed.
Run the Main Script: Replace main.py with the actual name of your script.
```
python3 main.py
```
Check Outputs: Navigate to the outputs directory to access corrected text files and CSV outputs.

Name		Name	Last commit message	Last commit date
Latest commit History 132 Commits
backup		backup
cache		cache
clients		clients
commands		commands
corrected_m2		corrected_m2
corrected_output		corrected_output
demo/greco		demo/greco
fact-check		fact-check
gpt		gpt
reference_m2		reference_m2
reference_output		reference_output
splitters		splitters
systems		systems
test		test
together-ai		together-ai
.gitignore		.gitignore
README.md		README.md
main.py		main.py
main.py.bak		main.py.bak
main_line.py		main_line.py
requirements.txt		requirements.txt
setup.sh		setup.sh
test-chunking.py		test-chunking.py
test-command.py		test-command.py
test-csv.py		test-csv.py
test-local.py		test-local.py
test-spacy		test-spacy
test-spacy.py		test-spacy.py