Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] CUDASTF implementation of the stencil example #642

Merged
merged 15 commits into from
Nov 7, 2024

Conversation

caugonnet
Copy link
Contributor

@caugonnet caugonnet commented Oct 29, 2024

New PRK implementation checklist

CUDASTF implementation

Which kernels are implemented?

  • synch_p2p (p2p)
  • [ x] stencil
  • transpose
  • nstream
  • dgemm
  • reduce
  • sparse
  • branch
  • random
  • refcount
  • synch_global
  • PIC
  • AMR

Documentation and build examples

use ci/install-cudastf.sh
use make.defs.cuda
cd Cxx11/ << We need to define Cxx17 ?
make stencil-cudastf

We strongly recommend that you add the appropriate features
to make.defs.${toolchain} if appropriate.

Do you certify that your contribution is made in good faith and does not attempt to introduce any negative behavior into this project?

  • [ x] Yes
  • No

- Add an installation script for CUDASTF, and define rules common/make.defs.cuda
@jeffhammond jeffhammond merged commit e42232e into ParRes:main Nov 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants