fairseq2: FAIR Sequence Modeling Toolkit 2

Documentation: Stable, Nightly | Install: Linux, macOS, Windows, From Source | Contribute: Guidelines

fairseq2 is a sequence modeling toolkit that allows researchers to train custom models for content generation tasks.

Who uses it?

Many FAIR teams utilize fairseq2 for a diverse set of projects, ranging from language model preference optimization to pretraining video diffusion models.

How is fairseq2 different from the original fairseq?

fairseq2 is a start-from-scratch project that can be considered a reboot of the original fairseq to provide a clean, modular API. Notably, it differs from its predecessor in its design philosophy, moving from a monolithic framework to an extensible, much less intrusive architecture allowing researchers to independently own their project code base.

As fairseq2 is a complete new project rather than an incremental update to the original fairseq, we intentionally avoided labeling it as fairseq version 2, reflecting its distinct and separate identity.

What's New?

February 2025: Instruction finetuning and preference optimization recipes with support for DPO, CPO, SimPO, and ORPO. Supports tensor parallelism and 70B+ scales.

Features

First-party recipes for language model instruction finetuning and preference optimization
Multi-GPU, multi-node training using DDP, FSDP, and tensor parallelism. Supports 70B+ models.
Native support for vLLM along with built-in sampling and beam search sequence generators
Extensible with setuptools extension mechanism. Easily register new models, optimizers, lr schedulers, trainer units without forking/branching the library.
Modern PyTorch tooling. Uses composability (i.e. torch.compile), PyTorch FSDP, and other relevant features
Streaming-based, high throughput data pipeline API written in C++ with support for speech and (soon) video decoding
Programmatic asset cards for version controlled access to models, datasets, and tokenizers
Flexible, but deterministic configuration based on the built-in structured API

Getting Started

Visit our documentation website to learn more about fairseq2.

Models

As of today, the following models are available in fairseq2 for use in training and evaluation recipes:

fairseq2 is also used by various external projects such as:

Installing on Linux

System Dependencies

fairseq2 depends on libsndfile, which can be installed via the system package manager on most Linux distributions. For Ubuntu-based systems, run:

sudo apt install libsndfile1

Similarly, on Fedora, run:

sudo dnf install libsndfile

For other Linux distributions, please consult its documentation on how to install packages.

pip

To install fairseq2 on Linux x86-64, run:

pip install fairseq2

This command will install a version of fairseq2 that is compatible with PyTorch hosted on PyPI.

At this time, we do not offer a pre-built package for ARM-based systems such as Raspberry PI or NVIDIA Jetson. Please refer to Install From Source to learn how to build and install fairseq2 on those systems.

Variants

Besides PyPI, fairseq2 also has pre-built packages available for different PyTorch and CUDA versions hosted on FAIR's package repository. The following matrix shows the supported combinations.

fairseq2	PyTorch	Python	Variant*	Arch
`HEAD`	`2.6.0`	`>=3.10`, `<=3.12`	`cpu`, `cu118`, `cu124`	`x86_64`
	`2.5.0`, `2.5.1`	`>=3.10`, `<=3.12`	`cpu`, `cu118`, `cu121`, `cu124`	`x86_64`
	`2.4.0`, `2.4.1`	`>=3.10`, `<=3.12`	`cpu`, `cu118`, `cu121`, `cu124`	`x86_64`
`0.4`	`2.6.0`	`>=3.10`, `<=3.12`	`cpu`, `cu118`, `cu124`	`x86_64`
	`2.5.0`, `2.5.1`	`>=3.10`, `<=3.12`	`cpu`, `cu118`, `cu121`, `cu124`	`x86_64`
	`2.4.0`, `2.4.1`	`>=3.10`, `<=3.12`	`cpu`, `cu118`, `cu121`, `cu124`	`x86_64`

* cuXYZ refers to CUDA XY.Z (e.g. cu118 means CUDA 11.8)

To install a specific combination, first follow the installation instructions on pytorch.org for the desired PyTorch version, and then use the following command (shown for PyTorch 2.6.0 and variant cu124):

pip install fairseq2\
  --extra-index-url https://fair.pkg.atmeta.com/fairseq2/whl/pt2.6.0/cu124

Warning

fairseq2 relies on the C++ API of PyTorch which has no API/ABI compatibility between releases. This means you have to install the fairseq2 variant that exactly matches your PyTorch version. Otherwise, you might experience issues like immediate process crashes or spurious segfaults. For the same reason, if you upgrade your PyTorch version, you must also upgrade your fairseq2 installation.

Nightlies

For Linux, we also host nightly builds on FAIR's package repository. The supported variants are identical to the ones listed in Variants above. Once you have installed the desired PyTorch version, you can use the following command to install the corresponding nightly package (shown for PyTorch 2.6.0 and variant cu124):

pip install fairseq2\
  --pre --extra-index-url https://fair.pkg.atmeta.com/fairseq2/whl/nightly/pt2.6.0/cu124

Installing on macOS

System Dependencies

fairseq2 depends on libsndfile, which can be installed via Homebrew:

brew install libsndfile

pip

To install fairseq2 on ARM64-based (i.e. Apple silicon) Mac computers, run:

pip install fairseq2

This command will install a version of fairseq2 that is compatible with PyTorch hosted on PyPI.

At this time, we do not offer a pre-built package for Intel-based Mac computers. Please refer to Install From Source to learn how to build and install fairseq2 on Intel machines.

Variants

Besides PyPI, fairseq2 also has pre-built packages available for different PyTorch versions hosted on FAIR's package repository. The following matrix shows the supported combinations.

fairseq2	PyTorch	Python	Arch
`0.4`	`2.6.0`	`>=3.10`, `<=3.12`	`arm64`

To install a specific combination, first follow the installation instructions on pytorch.org for the desired PyTorch version, and then use the following command (shown for PyTorch 2.6.0):

pip install fairseq2\
  --extra-index-url https://fair.pkg.atmeta.com/fairseq2/whl/pt2.6.0/cpu

Warning

fairseq2 relies on the C++ API of PyTorch which has no API/ABI compatibility between releases. This means you have to install the fairseq2 variant that exactly matches your PyTorch version. Otherwise, you might experience issues like immediate process crashes or spurious segfaults. For the same reason, if you upgrade your PyTorch version, you must also upgrade your fairseq2 installation.

Nightlies

For macOS, we also host nightly builds on FAIR's package repository. The supported variants are identical to the ones listed in Variants above. Once you have installed the desired PyTorch version, you can use the following command to install the corresponding nightly package (shown for PyTorch 2.6.0):

pip install fairseq2\
  --pre --extra-index-url https://fair.pkg.atmeta.com/fairseq2/whl/nightly/pt2.6.0/cpu

Installing on Windows

fairseq2 does not have native support for Windows and there are no plans to support it in the foreseeable future. However, you can use fairseq2 via the Windows Subsystem for Linux (a.k.a. WSL) along with full CUDA support introduced in WSL 2. Please follow the instructions in the Installing on Linux section for a WSL-based installation.

Installing from Source

See here.

Contributing

We always welcome contributions to fairseq2! Please refer to Contribution Guidelines to learn how to format, test, and submit your work.

Citing fairseq2

If you use fairseq2 in your research and wish to refer to it, please use the following BibTeX entry.

@software{balioglu2023fairseq2,
  author = {Can Balioglu and Martin Gleize and Artyom Kozhevnikov and Ilia Kulikov and Tuan Tran and Julien Yao},
  title = {fairseq2},
  url = {http://github.com/facebookresearch/fairseq2},
  year = {2023},
}

License

This project is MIT licensed, as found in the LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 1,202 Commits
.github		.github
ci		ci
doc		doc
native		native
src/fairseq2		src/fairseq2
tests		tests
tools		tools
.gitignore		.gitignore
.gitmodules		.gitmodules
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
INSTALL_FROM_SOURCE.md		INSTALL_FROM_SOURCE.md
LICENSE		LICENSE
README.md		README.md
VERSION		VERSION
bibliography.bib		bibliography.bib
pyproject.toml		pyproject.toml
requirements-devel.txt		requirements-devel.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fairseq2: FAIR Sequence Modeling Toolkit 2

Who uses it?

How is fairseq2 different from the original fairseq?

What's New?

Features

Getting Started

Models

Installing on Linux

System Dependencies

pip

Variants

Nightlies

Installing on macOS

System Dependencies

pip

Variants

Nightlies

Installing on Windows

Installing from Source

Contributing

Citing fairseq2

License

About

Releases 5

Contributors 40

Languages

License

facebookresearch/fairseq2

Folders and files

Latest commit

History

Repository files navigation

fairseq2: FAIR Sequence Modeling Toolkit 2

Who uses it?

How is fairseq2 different from the original fairseq?

What's New?

Features

Getting Started

Models

Installing on Linux

System Dependencies

pip

Variants

Nightlies

Installing on macOS

System Dependencies

pip

Variants

Nightlies

Installing on Windows

Installing from Source

Contributing

Citing fairseq2

License

About

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases 5

Contributors 40

Languages