Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TTG device tasks with coroutines #265

Conversation

devreal
Copy link
Contributor

@devreal devreal commented Sep 29, 2023

Supersedes #253

devreal and others added 17 commits August 28, 2023 22:23
Most functionality is an extension/copy of the CUDA support. Also ported
the bspmm example to HIP.

Signed-off-by: Joseph Schuchart <[email protected]>
Signed-off-by: Joseph Schuchart <[email protected]>
Signed-off-by: Joseph Schuchart <[email protected]>
Signed-off-by: Joseph Schuchart <[email protected]>
Signed-off-by: Joseph Schuchart <[email protected]>
Signed-off-by: Joseph Schuchart <[email protected]>
Signed-off-by: Joseph Schuchart <[email protected]>
Signed-off-by: Joseph Schuchart <[email protected]>
avoids CMake errors related to lapack including OpenMP_C dependence that MADNESS does not know anything about
@devreal devreal requested review from evaleev and therault September 29, 2023 20:13
@devreal devreal marked this pull request as draft September 29, 2023 20:13
devreal and others added 10 commits October 1, 2023 18:12
Signed-off-by: Joseph Schuchart <[email protected]>
Notable changes:
- add ttg::device::current_device and ttg::device::current_stream to
  query current streams
- Consistently start counting devices at 0
- The host has ID -2
- Fixes in the broadcast

Signed-off-by: Joseph Schuchart <[email protected]>
This fixes a segfault in multi-gpu operations. We should not destroy the
full data_t but only remove the host-side copy and release our reference
on the data. The data will be destroyed once all gpu copies have been
removed.

Signed-off-by: Joseph Schuchart <[email protected]>
@evaleev evaleev force-pushed the ttg-device-support-master-coro-with-stream-tasks branch from 2166d5e to f05530f Compare February 1, 2024 05:25
@evaleev evaleev force-pushed the ttg-device-support-master-coro-with-stream-tasks branch from 753ccb9 to 1ffaf6a Compare February 2, 2024 14:41
devreal and others added 20 commits February 2, 2024 09:44
Signed-off-by: Joseph Schuchart <[email protected]>
For small objects it is beneficial of packing the data into the
activation message instead of going through an extra GET.


Signed-off-by: Joseph Schuchart <[email protected]>
Work may be performed by the owner of A, B, or C, or the default
honoring the R parameter.

Signed-off-by: Joseph Schuchart <[email protected]>
For matrix A, we make sure that tiles are sent out in column-order (left to right).
For matrix B, we want to make sure that tiles are sent out in row-order (top to bottom).

Signed-off-by: Joseph Schuchart <[email protected]>
…thub.com:devreal/ttg into ttg-device-support-master-coro-with-stream-tasks
Signed-off-by: Joseph Schuchart <[email protected]>
Signed-off-by: Joseph Schuchart <[email protected]>
… only produce if Catch's `verbosity` != `quiet`
@@ -17,7 +17,7 @@ if (NOT TARGET PaRSEC::parsec)

FetchContent_Declare(
PARSEC
GIT_REPOSITORY https://github.com/ICLDisco/parsec.git
GIT_REPOSITORY https://github.com/devreal/parsec-1.git
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is there a PR on https://github.com/ICLDisco/parsec.git that we can quote here?

@evaleev evaleev merged commit f89cf45 into TESSEorg:ttg-device-support-master-coro-with-stream-tasks Feb 24, 2024
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants