Skip to content
This repository has been archived by the owner on Nov 1, 2024. It is now read-only.

Broadcast json index file rather than loading on all ranks #752

Open
wants to merge 1 commit into
base: cm3v2_7b_freeze
Choose a base branch
from

Conversation

sriniiyer
Copy link
Contributor

Patch Description
Broadcast jsonl index instead of loading on all ranks, to prevent slowdown by multiple ranks reading the same file

Testing steps
Tested on 64 nodes, loaded in 2 mins.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants