Selective Generation for Controllable Language Models

This repository contains the PyTorch implementation, which outputs evaluation results for the Neurips 2024 paper "Selective Generation for Controllable Language Models".

Environment Setup

If you use conda,

conda create -n sg python=3.8
conda activate sg
pip install -r requirements.txt

Additional dependencies (e.g., PyTorch) may need to be installed if you want to load models, check the below.

Models

If you are only going to use the models and datasets as provided in the paper, you do not need to load the models manually, as both log probabilities and entailment scores have been precomputed and stored in the dataset.

We used Alpaca7B and the GPT-3.5-Turbo API as generators, and DeBERTa-v2-xxlarge, fine-tuned on the MNLI dataset, as the entailment model.

To use Alpaca7B, request access to LLaMA and load the Stanford Alpaca weights.
To use GPT-3.5, set up the OpenAI API.

If you wish to use a different entailment model, modify the EMDLPATH variable in the shell script accordingly.

This implementation supports greedy generation and obtaining log probabilities from other models on demand. However, if you want to use a different model, labeling must be done manually. To use other APIs, you must manually configure the dataset. (You can do this by referring to /generation/.)

Data

In this paper, the Natural Questions (NQ) dataset and the QA2D dataset, filtered with only SQuAD, are sampled and used. Since NQ dataset has no transformed answers, we use Transforming Question Answering Datasets Into Natural Language Inference Datasets to obtain rule-based transformed sequences. The QA2D dataset, also available via this repository, contains human-annotated answers from Turk.

How to run

The following commands generate the figures and tables presented in the paper.

To get results (in /snapshots/) for a given model and dataset, GPT-3.5 & NQ dataset for example,

./scripts/run_nq_gpt3.5.sh

To draw box plots, GPT-3.5 & NQ dataset for example,

# This draws box plots, Figure 4.
./scripts/run_nq_gpt3.5_plot.sh

To draw bar plots,

# This draws bar plots over different numbers of unlabeled samples, Figure 3.
./scripts/run_nq_gpt3.5_quan_plot.sh
./scripts/run_nq_alpaca7B_quan_plot.sh

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data/nli		data/nli
generation		generation
learning		learning
models		models
scripts		scripts
uncertainty		uncertainty
util		util
.gitignore		.gitignore
README.md		README.md
args.py		args.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Selective Generation for Controllable Language Models

Environment Setup

Models

Data

How to run

About

Releases

Packages

Contributors 2

Languages

ml-postech/selective-generation

Folders and files

Latest commit

History

Repository files navigation

Selective Generation for Controllable Language Models

Environment Setup

Models

Data

How to run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages