-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
epic: Model Converter Pipeline #22
Comments
Objectives
Quantization Strategy
Example CommandThis is an example command to run model with tag
This concise command provides sufficient information for users. Tasks
This approach will streamline model management and improve user experience when working with cortex.cpp. |
CI Pipelines for Model Conversion and QuantizationThis PR introduces two CI pipelines to streamline the model processing workflow: 1. CI Convert and Quantization PipelineThis pipeline automates the process of converting and quantizing models. Inputs:
Process:
Result:After successful processing, new tags will be added to the model repository. For example, see the llama3 repository: 2. CI Update model.yml PipelineThis pipeline updates the model.yml file with new information. Inputs:
Process:
These pipelines automate crucial steps in model processing and metadata management, streamlining the workflow for model updates and deployments. |
@nguyenhoangthuan99 how do we use this pipeline? |
The cortexso model repo must be created before running this pipeline (e.g. Supported quantization levels: To use this pipeline:
|
@nguyenhoangthuan99 I am refactoring the "Built-in Model Library" to a separate epic: #21
|
Infra:
|
I add the updated model converter pipeline to janhq/models repo. And also add a pipeline to automatically update the model.yml file in hugging face cc @gabrielle-ong, now we can run CI pipeline in this repo. Guild for update Please update with the format Note that the |
Marking as complete, successfully done for mistral-nemo and llama3.2 |
Goal
model.yaml
with best parametersUser Story
Decisions
Tasklist
Model Compilation Pipeline
Future Roadmap
The text was updated successfully, but these errors were encountered: