⚙️ Fine-tuning Mistral 7B using QLoRA for Video Game Reviews Classification

This repository contains code for fine-tuning the Mistral 7B language model using QLoRA (Quantization-Based Low-Rank Adaptation) for the task of classifying video game reviews. The goal is to leverage self-supervised fine-tuning to improve the model's performance on a specific downstream task, namely, classifying video game reviews based on their meaning representation.

Introduction

The code in this repository fine-tunes the Mistral 7B language model using QLoRA for the task of classifying video game reviews. The following steps are performed in this codebase:

Accelerator Setup: Utilizes the Accelerate library to configure distributed training and offloading to CPUs, preparing the environment for efficient training.
Load Dataset: The video game reviews dataset is loaded using the Hugging Face Datasets library. This dataset consists of training, validation, and test splits.
Tokenization: Tokenizes the data using a predefined prompt and prepares it for fine-tuning. The data is tokenized to make the labels and input_ids match.
Base Model Inspection: The base model's initial performance on a test input is assessed to establish a baseline for comparison.
Set Up LoRA: The LoRA (Low-Rank Adaptation) framework is set up, and QLoRA adapters are applied to specific linear layers of the model.
Training: The fine-tuning process is executed, and the model is trained on the prepared dataset with QLoRA adapters.
Testing: After training, the best-performing model checkpoint is loaded, and the model is tested on a sample input to evaluate its performance.

Installation

To use this code, you need to install the required dependencies. You can install these dependencies using pip:

pip install -q -U bitsandbytes
pip install -q -U git+https://github.com/huggingface/transformers.git
pip install -q -U git+https://github.com/huggingface/peft.git
pip install -q -U git+https://github.com/huggingface/accelerate.git
pip install -q -U datasets scipy ipywidgets
pip install wandb -qU

Results

The code in this repository demonstrates the fine-tuning of Mistral 7B with QLoRA for video game reviews classification. The final model achieved improved performance on the specific downstream task of classifying video game reviews based on their meaning representation.

License

This code is provided under an open-source license. You can find the license information in the repository.

For any questions or inquiries, please feel free to contact the project maintainers.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
mistral_finetune.ipynb		mistral_finetune.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

⚙️ Fine-tuning Mistral 7B using QLoRA for Video Game Reviews Classification

Table of Contents

Introduction

Installation

Results

License

About

Releases

Packages

Languages

titi-devv/mistral7b-finetuned

Folders and files

Latest commit

History

Repository files navigation

⚙️ Fine-tuning Mistral 7B using QLoRA for Video Game Reviews Classification

Table of Contents

Introduction

Installation

Results

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages