v5.1.0
This release adds new model support for the translation pipeline, OpenAI Whisper support in the transcription pipeline and ARM Docker images. Topic modeling was also updated with improvements, including how to use BM25/TF-IDF indexes to drive topic models.
See below for full details on the new features, improvements and bug fixes.
New Features
Improvements
- Read authentication parameters from storage task (#332)
- Update scoring algorithms (#351)
- Add config option for list of stopwords to ignore with topic generation (#352)
- Allow for setting custom translation model path (#355)
- Update caption pipeline to call image-to-text pipeline (#361)
- Update transcription pipeline to call automatic-speech-recognition pipeline (#362)
- Only pass tokenizer to pipeline when necessary (#363)
- Improve default max length logic for text generation (#364)
- Update transcription notebook (#365)
- Update translation notebook (#366)
- Move mkdocs dependencies from docs.yml to setup.py (#368)