Missing Transformers initializer for Falcon models #1988

martin-gorner · 2024-11-19T14:59:54Z

Repro code:
model5 = keras_hub.models.CausalLM.from_preset("hf://tiiuae/falcon-7b-instruct", dtype="bfloat16")
Result:
ValueError: KerasHub has no converter for huggingface/transformers models with model type 'falcon'

Now that the Falcon model family exists in Keras-hub, this should work.

The text was updated successfully, but these errors were encountered:

mehtamansi29 · 2024-12-04T18:54:07Z

Hi @martin-gorner -

Thanks for reporting the issue. You can intialize falcon-7b-instruct model using transformers AutoTokenizer, AutoModelForCausalLM class.

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

model_name = "tiiuae/falcon-7b-instruct"

tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype=torch.bfloat16, trust_remote_code=True, device_map="auto")

And for loading falcon model family(falcon_refinedweb_1b_en) in keras-hub, you can use like this.
model5 = keras_hub.models.CausalLM.from_preset("hf://keras/falcon_refinedweb_1b_en", dtype="bfloat16")

Attached gist here for the reference.

mehtamansi29 self-assigned this Dec 4, 2024

mehtamansi29 assigned mehtamansi29 and unassigned mehtamansi29 Dec 4, 2024

mehtamansi29 added type:support stat:awaiting response from contributor labels Dec 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing Transformers initializer for Falcon models #1988

Missing Transformers initializer for Falcon models #1988

martin-gorner commented Nov 19, 2024 •

edited

Loading

mehtamansi29 commented Dec 4, 2024

Missing Transformers initializer for Falcon models #1988

Missing Transformers initializer for Falcon models #1988

Comments

martin-gorner commented Nov 19, 2024 • edited Loading

mehtamansi29 commented Dec 4, 2024

martin-gorner commented Nov 19, 2024 •

edited

Loading