Skip to content
This repository has been archived by the owner on Nov 25, 2024. It is now read-only.

Commit

Permalink
[Bugfix] Sync stream for scatter_op (#235)
Browse files Browse the repository at this point in the history
This is to address the issue found by: #229

Authors:
  - Chang Liu (https://github.com/chang-l)

Approvers:
  - https://github.com/linhu-nv
  - Brad Rees (https://github.com/BradReesWork)

URL: #235
  • Loading branch information
chang-l authored Nov 21, 2024
1 parent 0c0583d commit 6e77c99
Showing 1 changed file with 3 additions and 1 deletion.
4 changes: 3 additions & 1 deletion cpp/src/wholememory_ops/scatter_op_impl_mapped.cu
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
/*
* Copyright (c) 2019-2023, NVIDIA CORPORATION.
* Copyright (c) 2019-2024, NVIDIA CORPORATION.
*
* Licensed under the Apache License, Version 2.0 (the "License");
* you may not use this file except in compliance with the License.
Expand All @@ -18,6 +18,7 @@
#include <wholememory/env_func_ptrs.h>
#include <wholememory/wholememory.h>

#include "cuda_macros.hpp"
#include "wholememory_ops/functions/gather_scatter_func.h"

namespace wholememory_ops {
Expand All @@ -41,6 +42,7 @@ wholememory_error_code_t wholememory_scatter_mapped(
wholememory_desc,
stream,
scatter_sms);
WM_CUDA_CHECK(cudaStreamSynchronize(stream));
}

} // namespace wholememory_ops

0 comments on commit 6e77c99

Please sign in to comment.