Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Docs/PTX] Add device tensor map init example #1983

Merged

Conversation

ahendriksen
Copy link
Contributor

Description

closes #1982

Adds documentation of tensormap modification/initialization on device. Also improves on navigation and table of contents of the cuda::ptx docs.

Checklist

  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@ahendriksen ahendriksen requested review from a team as code owners July 12, 2024 11:57
-----------------

The full code example is included below. The `cuda::ptx` instructions in this
tutorial have become available in CUDA Toolkit 12.5.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Godbolt does not yet have CUDA 12.5, so there is no link here. I didn't want to link to a broken example.

@ahendriksen
Copy link
Contributor Author

Please don't merge yet. I still have to incorporate some internal feedback.

@ahendriksen ahendriksen added the blocked This PR cannot be merged due to various reasons label Jul 25, 2024
@ahendriksen ahendriksen force-pushed the docs-add-device-tensor-map-init-example branch from 68653eb to 90b6aa8 Compare October 7, 2024 13:01
@ahendriksen
Copy link
Contributor Author

In the mean time, the code example for on-device tensor map modification has made it into the CUDA programming guide. Instead of duplicating the documentation and code sample, I have linked to the relevant section in the programming guide. The improvements to table of contents and layout are still very much worth it in this PR.

I have copied over the fixed links from Bryan van de Ven's PR.

@ahendriksen ahendriksen removed the blocked This PR cannot be merged due to various reasons label Nov 19, 2024
@ahendriksen ahendriksen force-pushed the docs-add-device-tensor-map-init-example branch from 648c966 to 0dabcb9 Compare November 19, 2024 08:49
Copy link

copy-pr-bot bot commented Nov 19, 2024

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@ahendriksen
Copy link
Contributor Author

/ok to test

@ahendriksen ahendriksen enabled auto-merge (squash) November 19, 2024 09:24
@ahendriksen
Copy link
Contributor Author

/ok to test

@miscco
Copy link
Collaborator

miscco commented Nov 19, 2024

/ok to test

@ahendriksen ahendriksen merged commit 55ca56e into NVIDIA:main Nov 19, 2024
19 checks passed
@ahendriksen ahendriksen deleted the docs-add-device-tensor-map-init-example branch November 19, 2024 09:55
trxcllnt pushed a commit to trxcllnt/cccl that referenced this pull request Nov 23, 2024
* docs: Add device tensor map initialization example

* docs: Fix table of contents

There were quite a few empty pages. This commit makes sure that
navigation makes sense.

* docs: Spelling

Address and tensor map


Co-authored-by: Michael Schellenberger Costa <[email protected]>
Co-authored-by: Giannis Gonidelis <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

[DOC]: PTX: Document how to initialize/modify a tensor map in device memory
6 participants