Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* [NVIDIA] Add operator==/!= to DevicePointer * [NVIDIA] Add CUDA::NodeParams, CUDA::TransferNode, CUDA::KernelNode * [NVIDIA] Add kernel args getters for Insert/Slice * [NVIDIA] Add KernelNodeTest and TransferNodeTest * [NVIDIA] Fix review issues * [NVIDIA] Add launchers to TI, refactor Execute() * [NVIDIA] Add TiCudaGraphInfo * [NVIDIA] Update TI to support CUDA graph as a body of iterations loop * [NVIDIA] Add operator== for dim3, KernelNode and NodeParams * [NVIDIA] Update Run() of *TopologyRunners to take non-const context reference * [NVIDIA] Remove TiCudaGraphInfo, add set_current_graph(), add_new_graph_info(), get_current_graph_info(), select_current_graph() * [NVIDIA] Change IsCudaGraphCompatible() interface to GetCudaGraphCompatibility() using enum * [NVIDIA] Add ExecuteGraph() to IOperationExec/OperationBase * [NVIDIA] Remove paramsGraph_/resultsGraph_ from CudaGraphInfo * [CUDA] Merge, ph2 * [CUDA] Merge, ph3 * [CUDA] Merge, ph4 * [CUDA] Merge, ph5 * [CUDA] Merge, ph6 --------- Co-authored-by: Andrii Pavliuk <[email protected]>
- Loading branch information