Skip to content

Commit

Permalink
decrease batch for hbm
Browse files Browse the repository at this point in the history
  • Loading branch information
ahmeda14960 committed Apr 24, 2024
1 parent 61d4e96 commit c94e433
Showing 1 changed file with 4 additions and 3 deletions.
7 changes: 4 additions & 3 deletions config/llama2_1b_mark.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -237,10 +237,11 @@ trainer:
tags: ["pile", "llama", "web_comparison"]

mp: p=f32,c=bfloat16
train_batch_size: 512
num_train_steps: 5000
train_batch_size: 256
num_train_steps: 10000
steps_per_eval: 1000
per_device_eval_parallelism: 64
max_eval_batches: 2
per_device_eval_parallelism: -1
tensor_parallel_axes: ["mlp", "heads"]
fsdp_axis: "embed"
batch_axis: "batch"
Expand Down

0 comments on commit c94e433

Please sign in to comment.