-
Notifications
You must be signed in to change notification settings - Fork 4
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Support for Gemini 2 Flash Thinking (#35)
- plus documentation enhancements
- Loading branch information
Showing
16 changed files
with
235 additions
and
57 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,19 +1,93 @@ | ||
# Supported models | ||
|
||
The following models have been [evaluated](evaluation.md) with `freeact`: | ||
For the following models, `freeact` provides model-specific prompt templates. | ||
|
||
- Claude 3.5 Sonnet (20241022) | ||
- Claude 3.5 Haiku (20241022) | ||
- Gemini 2.0 Flash (experimental) | ||
- Qwen 2.5 Coder 32B Instruct | ||
- DeepSeek V3 | ||
| Model | Release | [Evaluation](evaluation.md) | Prompt | | ||
|-----------------------------|------------|-----------|--------------| | ||
| Claude 3.5 Sonnet | 2024-10-22 | ✓ | optimized | | ||
| Claude 3.5 Haiku | 2024-10-22 | ✓ | optimized | | ||
| Gemini 2.0 Flash | 2024-12-11 | ✓ | experimental | | ||
| Gemini 2.0 Flash Thinking | 2025-01-21 | ✗ | experimental | | ||
| Qwen 2.5 Coder 32B Instruct | | ✓ | experimental | | ||
| DeepSeek V3 | | ✓ | experimental | | ||
|
||
For these models, `freeact` provides model-specific prompt templates. | ||
!!! Info | ||
|
||
!!! Note | ||
`freeact` additionally supports the [integration](integration.md) of new models from any provider that is compatible with the [OpenAI Python SDK](https://github.com/openai/openai-python), including open models deployed locally with [ollama](https://ollama.com/) or [TGI](https://huggingface.co/docs/text-generation-inference/index), for example. | ||
|
||
In addition to the models we evaluated, `freeact` also supports the [integration](integration.md) of new models from any provider that is compatible with the [OpenAI Python SDK](https://github.com/openai/openai-python), including open models deployed locally with [ollama](https://ollama.com/) or [TGI](https://huggingface.co/docs/text-generation-inference/index), for example. | ||
## Command line | ||
|
||
!!! Tip | ||
This section demonstrates how you can launch `freeact` agents with these models from the [command line](cli.md). All agents use the [predefined](https://gradion-ai.github.io/freeact-skills/) `freeact_skills.search.google.stream.api` skill module for generative Google search. The required [Gemini](https://aistudio.google.com/app/apikey) API key for that skill must be defined in a `.env` file in the current working directory: | ||
|
||
For best performance, we recommend Claude 3.5 Sonnet, with DeepSeek V3 as a close second. Support for Gemini 2.0 Flash, Qwen 2.5 Coder, and DeepSeek V3 remains experimental as we continue to optimize their prompt templates. | ||
```env title=".env" | ||
# Required for `freeact_skills.search.google.stream.api` | ||
GOOGLE_API_KEY=... | ||
``` | ||
|
||
API keys and base URLs for code action models are provided as `--api-key` and `--base-url` arguments, respectively. Code actions are executed in a Docker container created from the [prebuilt](environment.md#prebuilt-docker-images) `ghcr.io/gradion-ai/ipybox:basic` image, passed as `--ipybox-tag` argument. | ||
|
||
!!! Info | ||
|
||
The [CLI documentation](cli.md) covers more details how environment variables can be passed to `freeact` agent systems. | ||
|
||
### Claude 3.5 Sonnet | ||
|
||
```bash | ||
python -m freeact.cli \ | ||
--model-name=claude-3-5-sonnet-20241022 \ | ||
--ipybox-tag=ghcr.io/gradion-ai/ipybox:basic \ | ||
--skill-modules=freeact_skills.search.google.stream.api \ | ||
--api-key=$ANTHROPIC_API_KEY | ||
``` | ||
|
||
### Claude 3.5 Haiku | ||
|
||
```bash | ||
python -m freeact.cli \ | ||
--model-name=claude-3-5-haiku-20241022 \ | ||
--ipybox-tag=ghcr.io/gradion-ai/ipybox:basic \ | ||
--skill-modules=freeact_skills.search.google.stream.api \ | ||
--api-key=$ANTHROPIC_API_KEY | ||
``` | ||
|
||
### Gemini 2.0 Flash | ||
|
||
```bash | ||
python -m freeact.cli \ | ||
--model-name=gemini-2.0-flash-exp \ | ||
--ipybox-tag=ghcr.io/gradion-ai/ipybox:basic \ | ||
--skill-modules=freeact_skills.search.google.stream.api | ||
--api-key=$GOOGLE_API_KEY | ||
``` | ||
|
||
### Gemini 2.0 Flash Thinking | ||
|
||
```bash | ||
python -m freeact.cli \ | ||
--model-name=gemini-2.0-flash-thinking-exp-01-21 \ | ||
--ipybox-tag=ghcr.io/gradion-ai/ipybox:basic \ | ||
--skill-modules=freeact_skills.search.google.stream.api | ||
--api-key=$GOOGLE_API_KEY | ||
``` | ||
|
||
### Qwen 2.5 Coder 32B Instruct | ||
|
||
```bash | ||
python -m freeact.cli \ | ||
--model-name=Qwen/Qwen2.5-Coder-32B-Instruct \ | ||
--ipybox-tag=ghcr.io/gradion-ai/ipybox:basic \ | ||
--skill-modules=freeact_skills.search.google.stream.api \ | ||
--base-url=https://api-inference.huggingface.co/v1/ \ | ||
--api-key=$HF_TOKEN | ||
``` | ||
|
||
### DeepSeek V3 | ||
|
||
```bash | ||
python -m freeact.cli \ | ||
--model-name=accounts/fireworks/models/deepseek-v3 \ | ||
--ipybox-tag=ghcr.io/gradion-ai/ipybox:basic \ | ||
--skill-modules=freeact_skills.search.google.stream.api \ | ||
--base-url=https://api.fireworks.ai/inference/v1 \ | ||
--api-key=$FIREWORKS_API_KEY | ||
``` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.