[EPIC] cuda::ptx
platform for exposing PTX instructions
#573
Labels
feature request
New feature or request.
cuda::ptx
platform for exposing PTX instructions
#573
Is this a duplicate?
Area
libcu++
Is your feature request related to a problem? Please describe.
The development of higher-level APIs lags behind the introduction of new PTX instructions in the CUDA Toolkit.
Describe the solution you'd like
Provide a means to add PTX instructions before higher-level exposure has been developed.
That is:
cuda/ptx
cuda::ptx
Tasks
cuda::ptx
#616Describe alternatives you've considered
The alternative is to wait for higher-level exposure, which may take a while.
Another alternative is to provide an intermediate exposure like the recent addition of
cuda::device::experimental::cp_async_bulk_global_to_shared
and friends. These wrapper functions are a hybrid between pure PTX wrapping functions using native types, and higher-level APIs that take higher-level types (likecuda::barrier<..>
). These are very useful for documenting features to outside developers, but require design and deviate from the PTX ISA.Additional context
No response
The text was updated successfully, but these errors were encountered: