Skip to content

v0.5.0

Compare
Choose a tag to compare
@shubhadeepd shubhadeepd released this 20 Mar 18:10
· 102 commits to main since this release
6de0008

This release adds new dedicated RAG examples showcasing state of the art usecases, switches to the latest API catalog endpoints from NVIDIA and also refactors the API interface of chain-server. This release also improves the developer experience by adding github pages based documentation and streamlining the example deployment flow using dedicated compose files.

Added

Changed

  • Switched from NVIDIA AI Foundation to NVIDIA API Catalog endpoints for accessing cloud hosted LLM models.
  • Refactored API schema of chain-server component to support runtime allocation of llm parameters like temperature, max tokens, chat history etc.
  • Renamed llm-playground service in compose files to rag-playground.
  • Switched base containers for all components to ubuntu instead of pytorch and optimized container build time as well as container size.
  • Deprecated yaml based configuration to avoid confusion, all configurations are now environment variable based.
  • Removed requirement of hardcoding NVIDIA_API_KEY in compose.env file.
  • Upgraded all python dependencies for chain-server and rag-playground services.

Fixed

  • Fixed a bug causing hallucinated answer when retriever fails to return any documents.
  • Fixed some accuracy issues for all the examples.