simple-rwkv

RWKV LLM servicer for SimpleAI

Description

This project uses the RWKV-LM model and turns it into an gRPC service that can be used through SimpleAI.

RWKV is an RNN with Transformer-level language model performance that can be trained like a GPT transformer and is 100% attention-free. It combines the best of RNN and transformer, providing great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Usage

Edit the MODEL variable in get_models.py to choose the model size and context.

Edit the STRATEGY variable in lib_raven.py to decide how the weights will be loaded, play with this to optimise the throughput for your system. See below for a graphic explanation or checkout ChatRWKV for more information.

Build

docker build . -t raven-rwkv-service:latest

Start service

docker run -it --rm -p 50051:50051 --gpus all raven-rwkv-service:latest

Add to model.toml

```toml
[raven]
    [raven.metadata]
        owned_by    = 'BlinkDL'
        permission  = []
        description = 'RWKV fine tuned for instruction answering'
    [raven.network]
        url = 'localhost:50051'


## Credits

Heavily borrowed from lhenault & BlinkDL

https://huggingface.co/spaces/BlinkDL/Raven-RWKV-7B

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
simple_rwkv		simple_rwkv
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
compose.models.toml		compose.models.toml
docker-compose.yml		docker-compose.yml
logging.conf		logging.conf
models.toml		models.toml
obsidian_serve.py		obsidian_serve.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

simple-rwkv

Description

Usage

Build

Start service

Add to model.toml

About

Releases

Packages

Languages

License

Nintorac/simple_rwkv

Folders and files

Latest commit

History

Repository files navigation

simple-rwkv

Description

Usage

Build

Start service

Add to model.toml

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages