sender_choleskey #28

hcq9102 · 2023-09-26T16:47:59Z

using std::execution

[resolved]Issues:

[resolved] sync_wait() syntax Issue:
[resolved]last two diagnal results incorrect

mhaseeb123 · 2023-10-03T18:09:18Z

apps/choleskey/choleskey_stdpar_snd.cpp

+//
+// This example provides a stdexec(senders/receivers) implementation for choleskey decomposition code.
+#include <algorithm>
+#include <exec/any_sender_of.hpp>


Would recommend cleaning up any unused headers like any_sender_of.hpp

mhaseeb123 · 2023-10-03T22:39:27Z

apps/choleskey/choleskey_stdpar_snd.cpp

+
+                                sum_vec[piece] = std::transform_reduce(
+                                    std::execution::par,
+                                    counting_iterator(start),


~~This is not correct since counting_iterator(start) and counting_iterator(start +N) are two separate objects and may not be iterable.~~

This is valid since nvhpc/22.9+ as per https://forums.developer.nvidia.com/t/internal-compiler-error-bad-sptr-in-var-refsym/253631. The error was originating from cudart/11.7 -> cudatoolkit/11.7 in our default PM environment.

Also being caught by the compiler. see below:

Nevermind, this error was coming from libcudart/11.7 which is already loaded into modules and takes precedence even with nvhpc/23.7 compiler. Doing a ml unload cudatoolkit and rerunning cmake and make uses the latest cudart/12.x from nvhpc/23.x module and works fine.

great! thanks a lot.
Yes, I have used ml unload cudatoolkit when load modules:

I build with following options and no build issue:
cmake .. -DCMAKE_CXX_COMPILER=$(which nvc++) -DCMAKE_C_COMPILER=$(which nvc) -DCMAKE_BUILD_TYPE=Release -DSTDPAR=gpu

modules:
ml use /global/cfs/cdirs/m1759/wwei/nvhpc_23_7/modulefiles ; ml unload cudatoolkit ; ml nvhpc/23.1 cmake/3.24

hcq9102 added 4 commits September 26, 2023 09:05

sender_choleskey_sync_wait_issue

fcb7fce

fix partition&iterator

92b82b5

last two columns has issue

613f360

choleskey_decomposition_sender_correct

45eee4c

hcq9102 requested a review from weilewei September 30, 2023 21:41

format

12dea27

mhaseeb123 merged commit b21fb35 into mhaseeb123:main Oct 3, 2023
1 check passed

mhaseeb123 reviewed Oct 3, 2023

View reviewed changes

This was referenced Oct 3, 2023

Reverting "sender_choleskey" #30

Merged

Revert "Reverting "sender_choleskey"" #31

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sender_choleskey #28

sender_choleskey #28

hcq9102 commented Sep 26, 2023 •

edited

Loading

mhaseeb123 Oct 3, 2023

mhaseeb123 Oct 3, 2023 •

edited

Loading

mhaseeb123 Oct 3, 2023

mhaseeb123 Oct 4, 2023

hcq9102 Oct 4, 2023 •

edited

Loading

sender_choleskey #28

sender_choleskey #28

Conversation

hcq9102 commented Sep 26, 2023 • edited Loading

mhaseeb123 Oct 3, 2023

Choose a reason for hiding this comment

mhaseeb123 Oct 3, 2023 • edited Loading

Choose a reason for hiding this comment

mhaseeb123 Oct 3, 2023

Choose a reason for hiding this comment

mhaseeb123 Oct 4, 2023

Choose a reason for hiding this comment

hcq9102 Oct 4, 2023 • edited Loading

Choose a reason for hiding this comment

hcq9102 commented Sep 26, 2023 •

edited

Loading

mhaseeb123 Oct 3, 2023 •

edited

Loading

hcq9102 Oct 4, 2023 •

edited

Loading