consolidated.safetensors #9916

CrispStrobe · 2024-10-16T21:29:51Z

easier handling (as eg for Ministral)

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

easier handling (as eg for Ministral)

compilade · 2024-10-16T23:05:37Z

convert_hf_to_gguf.py

        for filename in os.listdir(dir_model):
-            if filename.startswith(prefix) and filename.endswith(suffix):
+            if any(filename.startswith(prefix) for prefix in prefixes) and any(filename.endswith(suffix) for suffix in suffixes):
+                part_names.append(filename)
+            elif filename == "consolidated.safetensors":
                part_names.append(filename)


What if there are both model*.safetensors files and consolidated.safetensors in the same directory?

For example, https://huggingface.co/mistralai/Mamba-Codestral-7B-v0.1/ (which needs #9126) has both consolidated.safetensors and model-0000?-of-00003.safetensors.

Since git config --local lfs.fetchinclude <some_pattern> can be used to selectively download model files, I'm not sure how to handle that case if consolidated.safetensors is detected. I think the convert script should not use both at once (since duplicated tensor names are problematic), but how to choose?

What do you think?

indeed. something like below then?

def get_model_part_names(dir_model: Path, prefixes: list[str], suffixes: list[str]) -> list[str]: """ Retrieves the list of model part filenames from the model directory. Prioritizes 'model-XXXX-of-XXXX.safetensors' files over 'consolidated.safetensors'. Parameters: - dir_model (Path): Path to the model directory. - prefixes (list[str]): List of filename prefixes to match. - suffixes (list[str]): List of filename suffixes to match. Returns: - list[str]: Sorted list of model part filenames. """ part_names: list[str] = [] # Collect files matching the given prefixes and suffixes for filename in os.listdir(dir_model): if any(filename.startswith(prefix) for prefix in prefixes) and any(filename.endswith(suffix) for suffix in suffixes): part_names.append(filename) elif filename == "consolidated.safetensors": part_names.append(filename) # Sort the list for consistency part_names.sort() # Check if both split files and 'consolidated.safetensors' are present split_files = [f for f in part_names if f.startswith("model-") and f.endswith(".safetensors")] consolidated_present = "consolidated.safetensors" in part_names if split_files and consolidated_present: logger.debug("Both split model files and 'consolidated.safetensors' found. Ignoring 'consolidated.safetensors'.") # Remove 'consolidated.safetensors' from part_names part_names = [f for f in part_names if f != "consolidated.safetensors"] # Final sort after potential removal part_names.sort() if not part_names: logger.warning("No model weight files found in the directory.") return part_names

consolidated.safetensors

645eb3c

easier handling (as eg for Ministral)

github-actions bot added the python python script changes label Oct 16, 2024

compilade reviewed Oct 16, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

consolidated.safetensors #9916

consolidated.safetensors #9916

CrispStrobe commented Oct 16, 2024

compilade Oct 16, 2024

CrispStrobe Oct 17, 2024 •

edited

Loading

consolidated.safetensors #9916

Are you sure you want to change the base?

consolidated.safetensors #9916

Conversation

CrispStrobe commented Oct 16, 2024

compilade Oct 16, 2024

Choose a reason for hiding this comment

CrispStrobe Oct 17, 2024 • edited Loading

Choose a reason for hiding this comment

CrispStrobe Oct 17, 2024 •

edited

Loading