Skip to content

Commit

Permalink
llama : use unused n_embd_k_gqa in k_shift
Browse files Browse the repository at this point in the history
This also slightly reduces the diff from the master branch
  • Loading branch information
compilade committed Sep 2, 2024
1 parent 5f62db7 commit 375de5b
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/llama.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -10806,7 +10806,7 @@ struct llm_build_context {
ggml_view_3d(ctx0, kv_self.k_l[il],
n_embd_head_k, n_head_kv, n_ctx,
ggml_row_size(kv_self.k_l[il]->type, n_embd_head_k),
ggml_row_size(kv_self.k_l[il]->type, hparams.n_embd_k_gqa(il)),
ggml_row_size(kv_self.k_l[il]->type, n_embd_k_gqa),
0),
lctx.inp_K_shift, rope_factors, n_rot, rope_type, n_ctx_orig, freq_base, freq_scale,
ext_factor, attn_factor, beta_fast, beta_slow);
Expand Down

0 comments on commit 375de5b

Please sign in to comment.