Releases: michaelfeil/infinity
Releases · michaelfeil/infinity
0.0.25
What's Changed
- add engine args similar to vllm by @michaelfeil in #102
- ct2 bump by @michaelfeil in #103
- Deps free by @michaelfeil in #104
- fix: cli start by @michaelfeil in #105
Full Changelog: 0.0.24...0.0.25
0.0.24
What's Changed
- Update README.md contribution guidelines by @michaelfeil in #91
- Update tensorrt, onnxruntime, cuda base by @michaelfeil in #93
- Update dependencies by @NirantK in #96
- Torch dynamic shapes by @michaelfeil in #97
- update poetry version + cache in ci by @michaelfeil in #99
- pydantic upgrade by @michaelfeil in #100
- pydantic-v1-backwards-fixes by @michaelfeil in #101
New Contributors
Full Changelog: 0.0.23...0.0.24
0.0.23
What's Changed
- support hf_transfer by @michaelfeil in #81
- update dstack support by @deep-diver in #79
- Update Dockerfile to python 3.11 + CI fix by @michaelfeil in #83
- adding revision by @michaelfeil in #84
- starting to deprecated fastembed and ctranslate2 by @michaelfeil in #86
New Contributors
- @deep-diver made their first contribution in #79 Thanks @deep-diver
Full Changelog: 0.0.22...0.0.23
0.0.22
0.0.21
What's Changed
- improvements optimum by @michaelfeil in #74
- bump sentence-transformers to 2.3.0 by @michaelfeil in #76
- update dockerfile and tensorrt by @michaelfeil in #75
Full Changelog: 0.0.20...0.0.21
0.0.20
What's Changed
- update arm docker by @michaelfeil in #73
- patch release: optimum tokenization issue
Full Changelog: 0.0.19...0.0.20
0.0.19 - yanked
0.0.18 - yanked
What's Changed
- support mps backend. by @ninehills in #59
- Add optimum[onnx] by @michaelfeil in #68
New Contributors
- @ninehills made their first contribution in #59 Thanks @ninehills for sharing this on twitter.
Full Changelog: 0.0.17...0.0.18
0.0.17
What's Changed
Breaking: Switched to Cuda 12.1 and torch 2.1.2
- Add rerank/predict endpoint in the API by @michaelfeil in #50
- update dockerfile (Cuda 12.1 and torch 2.1.2) and tests by @michaelfeil in #54
Full Changelog: 0.0.16...0.0.17
0.0.16
What's Changed
- fixing delayed warmup by @michaelfeil in #53
- expose
capabilities
by @michaelfeil in #53
Full Changelog: 0.0.15...0.0.16