Add sequence packing for lm-eval-harness #850

dlwh · 2025-01-07T17:01:17Z

Speedup was less than I hoped, but I think that can be improved with better packing strategies (and also removing some batching overhead. the bottleneck is now data loading??)

nikil-ravi · 2025-01-08T07:44:09Z

scripts/gcs_bulk_delete.py

was this intended to be part of the PR? Just checking

nah, but i don't care enough to take it out :-)

src/levanter/data/packing.py

nikil-ravi · 2025-01-08T08:34:46Z

Curious whether we know the rough ratio of padding:non padding tokens with this packing strategy?

Co-authored-by: Nikil Ravi <[email protected]>

dlwh · 2025-01-08T16:49:13Z

It's about 90% real tokens (as opposed to ~90% padding before)

dlwh added 14 commits December 12, 2024 13:12

switch to doing lm eval harness evals in bf16

02e6d00

inital cut at sequence packing

ef0df26

inital cut at sequence packing

c25ee91

add is_correct checking

9738a93

wip

3ef5dcb

Merge remote-tracking branch 'origin/main' into packing

e566802

wip

d4c2d2b

wip

3103c1d

Merge remote-tracking branch 'origin/main' into packing

f1da7ef

wip got a crash from jax

721a5aa

ok maybe this works?

a4d1df9

solved nondeterminism i think

0616db0

ok this feels pretty good

a350b86

batch together tokenization to improve throughput somewhat

ec7b041

dlwh changed the title ~~Packing~~ Add sequence packing for lm-eval-harness Jan 7, 2025

dlwh added 2 commits January 7, 2025 09:14

precommit

605e743

wip

90be3f3

nikil-ravi reviewed Jan 8, 2025

View reviewed changes

src/levanter/data/packing.py Outdated Show resolved Hide resolved

nikil-ravi reviewed Jan 8, 2025

View reviewed changes

src/levanter/data/packing.py Outdated Show resolved Hide resolved

nikil-ravi approved these changes Jan 8, 2025

View reviewed changes

Apply suggestions from code review

2b5d52c

Co-authored-by: Nikil Ravi <[email protected]>

dlwh added 4 commits January 8, 2025 08:53

Merge remote-tracking branch 'origin/packing' into packing

a7bde58

comment

af5fb7e

dumb

5c0fca2

sigh

f87a8d7

dlwh merged commit 93a8aa9 into main Jan 8, 2025
8 checks passed

dlwh deleted the packing branch January 8, 2025 18:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sequence packing for lm-eval-harness #850

Add sequence packing for lm-eval-harness #850

dlwh commented Jan 7, 2025 •

edited

Loading

nikil-ravi Jan 8, 2025

dlwh Jan 8, 2025

nikil-ravi commented Jan 8, 2025 •

edited

Loading

dlwh commented Jan 8, 2025

Add sequence packing for lm-eval-harness #850

Add sequence packing for lm-eval-harness #850

Conversation

dlwh commented Jan 7, 2025 • edited Loading

nikil-ravi Jan 8, 2025

Choose a reason for hiding this comment

dlwh Jan 8, 2025

Choose a reason for hiding this comment

nikil-ravi commented Jan 8, 2025 • edited Loading

dlwh commented Jan 8, 2025

dlwh commented Jan 7, 2025 •

edited

Loading

nikil-ravi commented Jan 8, 2025 •

edited

Loading