Add ShieldGemma class to KerasHub #1974

RyanMullins · 2024-11-05T20:38:01Z

Draft PR to facilitate discussion around #1973.

mattdangerw · 2024-11-20T20:36:36Z

Sorry for the delay! I've been on leave, just getting back. Still thinking on this, not sure what to do...

One potential option would be to expose a more robust tool for classifying with a language model. Something like a TextClassifierLM task class, that takes in a token -> class idx mapping, and an optional prompt template. Could be both fit() and predict() friendly, using regular the causal "supervised fine-tuning" for training. So you could use it in pure inference mode for shield gemma (we'd have to resave the model including this task config including the 0 -> yes, 1 -> no map), or use it to DIY fine tune a gemma classifier for say for any classification problem via this predict a single token setup.

Or we could go with something like this, that is totally ad-hoc. But totally ad-hoc tasks might run us into hot water elsewhere. For example huggingface is trying to generate code snippets for user uploaded KerasHub models in a reliable fashion, and having a more consistent cross model task flow definitely helps with that aim.

github-actions bot added the Gemma Gemma model specific issues label Nov 5, 2024

RyanMullins force-pushed the shieldgemma branch from 0453a4a to 4c4571f Compare November 5, 2024 20:40

Skeleton ShieldGemma class

bc1af1f

RyanMullins force-pushed the shieldgemma branch from 4c4571f to bc1af1f Compare November 5, 2024 20:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ShieldGemma class to KerasHub #1974

Add ShieldGemma class to KerasHub #1974

RyanMullins commented Nov 5, 2024

mattdangerw commented Nov 20, 2024

Add ShieldGemma class to KerasHub #1974

Are you sure you want to change the base?

Add ShieldGemma class to KerasHub #1974

Conversation

RyanMullins commented Nov 5, 2024

mattdangerw commented Nov 20, 2024