Skip to content

Commit

Permalink
Merge pull request eugeneyan#104 from kaikaidai/main
Browse files Browse the repository at this point in the history
Fix Selene Mini context length
  • Loading branch information
eugeneyan authored Feb 13, 2025
2 parents f806b2e + 9afcb33 commit 78a2524
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -69,7 +69,7 @@ These LLMs (Large Language Models) are all licensed for commercial use (e.g., Ap
| Phi-3 small/medium | 2024/05<!--21--> | [Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct), [Phi-3-mini-128k-instruct](https://huggingface.co/microsoft/Phi-3-mini-128k-instruct), [Phi-3-medium-4k-instruct](https://huggingface.co/microsoft/Phi-3-medium-4k-instruct), [Phi-3-medium-128k-instruct](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct) | [New models added to the Phi-3 family, available on Microsoft Azure](https://azure.microsoft.com/en-us/blog/new-models-added-to-the-phi-3-family-available-on-microsoft-azure/), [Technical Report](https://arxiv.org/abs/2404.14219) | 7, 14 | [4096, 128k](https://arxiv.org/abs/2404.14219) | MIT | |
| Phi-4 | 2024/12 | [Phi-4](https://huggingface.co/microsoft/phi-4)| [Introducing Phi-4: Microsoft’s Newest Small Language Model Specializing in Complex Reasoning](https://techcommunity.microsoft.com/blog/aiplatformblog/introducing-phi-4-microsoft%E2%80%99s-newest-small-language-model-specializing-in-comple/4357090), [Technical Report](https://arxiv.org/pdf/2412.08905) | 14 | [4096](https://arxiv.org/pdf/2412.08905) | MIT | |
| YuLan-Mini | 2024/12 | [YuLan-Mini](https://huggingface.co/yulan-team/YuLan-Mini) | [YuLan-Mini: An Open Data-efficient Language Model](https://arxiv.org/abs/2412.17743), [GitHub](https://github.com/RUC-GSAI/YuLan-Mini) | 14 | [28672](https://github.com/RUC-GSAI/YuLan-Mini) | MIT | [YuLan-Mini](https://huggingface.co/yulan-team/YuLan-Mini) |
| Selene Mini | 2025/01 | [Selene Mini](https://huggingface.co/AtlaAI/Selene-1-Mini-Llama-3.1-8B) | [Atla Selene Mini: A General Purpose Evaluation Model](https://arxiv.org/abs/2501.17195v1), [GitHub](https://github.com/atla-ai/selene-mini) | 8 | [8192](https://github.com/atla-ai/selene-mini) | Apache 2.0 | [Hugging Face Space](https://huggingface.co/spaces/AtlaAI/selene) |
| Selene Mini | 2025/01 | [Selene Mini](https://huggingface.co/AtlaAI/Selene-1-Mini-Llama-3.1-8B) | [Atla Selene Mini: A General Purpose Evaluation Model](https://arxiv.org/abs/2501.17195v1), [GitHub](https://github.com/atla-ai/selene-mini) | 8 | [128K](https://github.com/atla-ai/selene-mini) | Apache 2.0 | [Hugging Face Space](https://huggingface.co/spaces/AtlaAI/selene) |

## Open LLMs for code

Expand Down

0 comments on commit 78a2524

Please sign in to comment.