Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implementation of Horovod backend in Mesh TensorFlow #3

Open
2 of 5 tasks
EiffL opened this issue May 12, 2021 · 2 comments
Open
2 of 5 tasks

Implementation of Horovod backend in Mesh TensorFlow #3

EiffL opened this issue May 12, 2021 · 2 comments
Assignees
Labels
Hackathon Goal High level goals for the hack week Mesh TensorFlow Issues related to Mesh TensorFlow

Comments

@EiffL
Copy link
Member

EiffL commented May 12, 2021

This issue is to track the developments needed to finalize and validate the Mesh TensorFlow implementation relying on horovod for the backend. This overarching goal will encapsulate several smaller issues.

Goal

By the end of the hackweek, submit a Pull Request to https://github.com/tensorflow/mesh with our new implemenation for GPU clusters

Participants

The main participants to this task are:

Tasks

Progress made on these subtasks can be reported here.

@EiffL EiffL added Mesh TensorFlow Issues related to Mesh TensorFlow Hackathon Goal High level goals for the hack week labels May 12, 2021
@EiffL
Copy link
Member Author

EiffL commented May 19, 2021

And we have identified another issue here, I'm adding it tot the list of things we need to resolve: DifferentiableUniverseInitiative/mesh#4

@EiffL
Copy link
Member Author

EiffL commented May 26, 2021

We have managed to mostly solve this the two first points of this issue, by the following:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Hackathon Goal High level goals for the hack week Mesh TensorFlow Issues related to Mesh TensorFlow
Projects
None yet
Development

No branches or pull requests

3 participants