Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] CUDASTF implementation of the stencil example #642

Merged
merged 15 commits into from
Nov 7, 2024

Commits on Oct 29, 2024

  1. - Implement the stencil example with CUDASTF

    - Add an installation script for CUDASTF, and define rules common/make.defs.cuda
    caugonnet committed Oct 29, 2024
    Configuration menu
    Copy the full SHA
    69605b9 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    858d6a2 View commit details
    Browse the repository at this point in the history
  3. dgemm kernel with CUDASTF

    caugonnet committed Oct 29, 2024
    Configuration menu
    Copy the full SHA
    1630e35 View commit details
    Browse the repository at this point in the history
  4. Implement dgemm with CUBLAS

    caugonnet committed Oct 29, 2024
    Configuration menu
    Copy the full SHA
    2e6fc13 View commit details
    Browse the repository at this point in the history
  5. move CUDASTF flags up and add {}

    Signed-off-by: Jeff Hammond <[email protected]>
    jeffhammond committed Oct 29, 2024
    Configuration menu
    Copy the full SHA
    3ed90bf View commit details
    Browse the repository at this point in the history
  6. add boilerplate and cleanup

    add NVIDIA copyright
    add Cedric attribution
    print the right name
    remove PRAGMA_SIMD
    add CUDA header for consistency
    
    Signed-off-by: Jeff Hammond <[email protected]>
    jeffhammond committed Oct 29, 2024
    Configuration menu
    Copy the full SHA
    3119ddd View commit details
    Browse the repository at this point in the history
  7. add stf alias for cudastf

    Signed-off-by: Jeff Hammond <[email protected]>
    jeffhammond committed Oct 29, 2024
    Configuration menu
    Copy the full SHA
    785c19d View commit details
    Browse the repository at this point in the history
  8. p2p example in CUDASTF

    caugonnet committed Oct 29, 2024
    Configuration menu
    Copy the full SHA
    824e35d View commit details
    Browse the repository at this point in the history

Commits on Oct 30, 2024

  1. Add transpose-cudastf

    caugonnet committed Oct 30, 2024
    Configuration menu
    Copy the full SHA
    b83e823 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    dd5f07b View commit details
    Browse the repository at this point in the history

Commits on Nov 7, 2024

  1. Configuration menu
    Copy the full SHA
    206b241 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    4502b04 View commit details
    Browse the repository at this point in the history
  3. credit Cedric

    jeffhammond committed Nov 7, 2024
    Configuration menu
    Copy the full SHA
    3bfd0cb View commit details
    Browse the repository at this point in the history
  4. remove unused parameter

    jeffhammond committed Nov 7, 2024
    Configuration menu
    Copy the full SHA
    dcc0b3d View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    b1424cb View commit details
    Browse the repository at this point in the history