Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: sync llama cpp & support use jinja template / tool_calls #117

Merged
merged 30 commits into from
Feb 3, 2025

Conversation

jhen0409
Copy link
Member

@jhen0409 jhen0409 commented Jan 25, 2025

Sync to current latest commit ggml-org/llama.cpp@6eecde3, for:

  • Jinja template & universal tool call support
  • Metal: Use residency sets (included in changes)

TODO:

  • getFormattedChat
    • add jinja param (bool, default to false)
    • Cached templates in llama_rn_context (the function is slow)
  • completion param
    • add jinja param (bool, default to false)
    • add tools param
  • context: add more info
    • model supported tool_use
    • model chat_template details (take from minja::chat_template)
  • Android: error handling

@jhen0409 jhen0409 marked this pull request as ready for review February 3, 2025 05:46
@jhen0409 jhen0409 changed the title feat: sync llama cpp & support use jinja template feat: sync llama cpp & support use jinja template / tool_calls Feb 3, 2025
@jhen0409 jhen0409 merged commit dd2179e into main Feb 3, 2025
6 checks passed
@jhen0409 jhen0409 deleted the sync-llama-cpp branch February 3, 2025 06:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant