Add `JAIS` model(s) #8118

fmz · 2024-06-25T18:39:55Z

Add support for Jais and Jais-chat: a new state-of-the-art Arabic-centric foundation and instruction-tuned open generative large language models. https://arxiv.org/abs/2308.16149

The model is essentially GPT2 with some modifications:

Brand new vocabulary.
ALiBi positional encoding.
SwiGLU activations.
Some random scaling factors

For this PR I only added support for 13B and 1.3B Jais

fmz · 2024-06-25T18:41:08Z

convert-hf-to-gguf.py

@@ -427,9 +427,6 @@ def get_vocab_base_pre(self, tokenizer) -> str:
        # NOTE: if you get an error here, you need to update the convert-hf-to-gguf-update.py script
        #       or pull the latest version of the model from Huggingface
        #       don't edit the hashes manually!
-        if chkhsh == "0ef9807a4087ebef797fc749390439009c3b9eda9ad1a097abbe738f486c01e5":


Not really sure why this happened...
I ran convert-hf-to-gguf-update.py as instructed

Most likely you don't have access to the HF repos

fmz · 2024-06-25T18:41:28Z

convert-hf-to-gguf.py

+
+        return tensors
+
+


too many new lines

fmz · 2024-06-25T18:41:38Z

examples/main/main.cpp

@@ -733,7 +733,6 @@ int main(int argc, char ** argv) {

                // Console/Stream Output
                fprintf(stdout, "%s", token_str.c_str());
-


fmz · 2024-06-25T18:41:50Z

ggml.c

@@ -13516,13 +13516,13 @@ static void ggml_compute_forward_soft_max_f32(
            } else {
                for (int i = 0; i < nc; ++i) {
                    wp[i] += slope*mp_f32[i];
+


revert changes in this file

fmz · 2024-06-25T18:43:21Z

llama.cpp

@@ -6700,7 +6733,6 @@ static bool llm_load_tensors(
            case LLM_ARCH_BITNET:
                {
                    model.tok_embd = ml.create_tensor(ctx_input, tn(LLM_TENSOR_TOKEN_EMBD, "weight"), {n_embd, n_vocab});
-


fmz · 2024-06-26T16:19:22Z

@ggerganov @slaren
This PR is good to be reviewed. Please lmk if you have any comments/suggestions, especially w.r.t convert-hf-to-gguf.py

Thanks in advance!

src/llama.cpp

fmz · 2024-06-28T14:14:15Z

@slaren @ggerganov
Thanks for your comments! I addressed everything so far. Let me know if there is anything else

fmz · 2024-06-28T14:23:31Z

Here is some sample output from Jais-13-chat (ignoring proper system-prompting):
1.

Instruction: Translate قهوة into English.

What is the English translation for the word قهوة؟

Pronunciation:

/kah-wah/

Defintion:

A dark, bitter fluid produced by the roasted seeds of several species of shrub of the genus Coffea. [end of text]

Instruction: جاوب باللغة الانجليزية: ما هو مشروبك المفضل؟

إجابة: My favorite drink is tea. [end of text]

slaren · 2024-06-30T15:26:01Z

src/llama.cpp

+                // TODO: become GGUF KV parameter
+                hparams.f_max_alibi_bias = 8.0f;


There is already a <arch>.attention.max_alibi_bias parameter. I know that this was copied from other archs that still hardcode this parameter, but I don't think we should do this for new archs. Instead, it should be added as metadata in convert-hf-to-gguf.py with gguf_writer.add_max_alibi_bias.

Ah perfect
I'll update it shortly

convert-hf-to-gguf.py

src/llama.cpp

convert-hf-to-gguf.py

fmz · 2024-07-02T14:31:55Z

@slaren @ggerganov Let me know if you have any more comments on this. If not, can you please merge it when you get the chance (I don't have permission)?

* Add `JAIS` model(s) * cleanup * address review comments * remove hack * un-hardcode max-alibi-bias * minor tweaks --------- Co-authored-by: fmz <[email protected]>

fmz commented Jun 25, 2024

View reviewed changes

mofosyne added the Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level label Jun 25, 2024

github-actions bot added the python python script changes label Jun 25, 2024

fmz force-pushed the jais branch 2 times, most recently from 22a6497 to 1a51b36 Compare June 26, 2024 16:16

slaren reviewed Jun 26, 2024

View reviewed changes

src/llama.cpp Show resolved Hide resolved

slaren reviewed Jun 26, 2024

View reviewed changes

src/llama.cpp Show resolved Hide resolved

fmz added 3 commits June 28, 2024 07:03

Add JAIS model(s)

34300a0

cleanup

a067ed8

address review comments

a8d4afb

fmz force-pushed the jais branch from 68d3314 to a8d4afb Compare June 28, 2024 14:08

remove hack

f42285f

ggerganov approved these changes Jun 29, 2024

View reviewed changes

slaren reviewed Jun 30, 2024

View reviewed changes

un-hardcode max-alibi-bias

2d4de51

fmz requested a review from slaren July 1, 2024 18:20

compilade reviewed Jul 1, 2024

View reviewed changes

convert-hf-to-gguf.py Outdated Show resolved Hide resolved

src/llama.cpp Outdated Show resolved Hide resolved

convert-hf-to-gguf.py Outdated Show resolved Hide resolved

minor tweaks

8b64c7a

slaren approved these changes Jul 2, 2024

View reviewed changes

slaren merged commit 9689673 into ggerganov:master Jul 2, 2024
54 checks passed

fmz mentioned this pull request Jul 5, 2024

Support JAISLMHeadModel architecture #5902

Closed

compilade mentioned this pull request Jul 9, 2024

Adding models to the list in convert-hf-to-gguf-update.py #8357

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `JAIS` model(s) #8118

Add `JAIS` model(s) #8118

fmz commented Jun 25, 2024 •

edited

Loading

fmz Jun 25, 2024

ggerganov Jun 27, 2024

fmz Jun 25, 2024

fmz Jun 25, 2024

fmz Jun 25, 2024

fmz Jun 25, 2024

fmz commented Jun 26, 2024

fmz commented Jun 28, 2024

fmz commented Jun 28, 2024

slaren Jun 30, 2024

fmz Jul 1, 2024

fmz commented Jul 2, 2024

		@@ -733,7 +733,6 @@ int main(int argc, char ** argv) {

		// Console/Stream Output
		fprintf(stdout, "%s", token_str.c_str());

		// TODO: become GGUF KV parameter
		hparams.f_max_alibi_bias = 8.0f;

Add JAIS model(s) #8118

Add JAIS model(s) #8118

Conversation

fmz commented Jun 25, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fmz commented Jun 26, 2024

fmz commented Jun 28, 2024

fmz commented Jun 28, 2024

Instruction: Translate قهوة into English.

Instruction: جاوب باللغة الانجليزية: ما هو مشروبك المفضل؟

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fmz commented Jul 2, 2024

Add `JAIS` model(s) #8118

Add `JAIS` model(s) #8118

fmz commented Jun 25, 2024 •

edited

Loading