Skip to content

Commit

Permalink
MNTP training (#15)
Browse files Browse the repository at this point in the history
* first iteration of script ,TODO: save properly

* model class loading

* bf16 flash attn

* ignore wandb

* added dependencies for running mlm training

* revert custom local change

* peft loading and saving

* remove unused functions

* all configs

* minor formatting changes

* flash attention needs no-build-isolation
  • Loading branch information
vaibhavad authored Apr 17, 2024
1 parent c442e71 commit 0abdfcd
Show file tree
Hide file tree
Showing 9 changed files with 1,093 additions and 5 deletions.
4 changes: 3 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
@@ -1,4 +1,6 @@
build/
dist/
*.egg-info
**/__pycache__
**/__pycache__
wandb/**
output/**
6 changes: 4 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,15 +14,17 @@ LLM2Vec is a simple recipe to convert decoder-only LLMs into text encoders. It c
</p>

## Installation
To use LLM2Vec, first install the llm2vec package from PyPI.
To use LLM2Vec, first install the llm2vec package from PyPI, followed by installing flash-attention:

```bash
pip install llm2vec
pip install flash-attn --no-build-isolation
```
You can also directly install it from our code by cloning the repository and:
You can also directly install the latest version of llm2vec by cloning the repository:

```bash
pip install -e .
pip install flash-attn --no-build-isolation
```

## Getting Started
Expand Down
Loading

0 comments on commit 0abdfcd

Please sign in to comment.