Skip to content

0.3.1

Compare
Choose a tag to compare
@b4rtaz b4rtaz released this 28 Apr 21:36
· 117 commits to main since this release
37fad6a
  • Changed order of QKV synchronization (details)
  • All tasks of Llama architecture are executed in parallel
  • Rope cache for Llama architecture