Skip to content

Commit

Permalink
actually defaulting to None is the right call, throughput suffers still
Browse files Browse the repository at this point in the history
  • Loading branch information
dlwh committed Nov 12, 2024
1 parent c8ea018 commit f8ab86b
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/levanter/models/lm_model.py
Original file line number Diff line number Diff line change
Expand Up @@ -71,7 +71,7 @@ def Pos(self) -> Axis:
def Embed(self) -> Axis:
pass

cross_entropy_block_size: Optional[int] = 64000
cross_entropy_block_size: Optional[int] = None
"""
The block size for computing cross-entropy loss. This is the number of tokens that are processed together
in a single block. This can be adjusted to fit within memory constraints. It's deliberately set to a large
Expand Down

0 comments on commit f8ab86b

Please sign in to comment.