-
Notifications
You must be signed in to change notification settings - Fork 91
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add specific communicator for neighborhood communication #1588
base: index-map-pgm
Are you sure you want to change the base?
Conversation
6acf7c4
to
8aa6ab9
Compare
b42ab92
to
8f104fd
Compare
8aa6ab9
to
77398bd
Compare
77398bd
to
d278cad
Compare
8f104fd
to
a0824a8
Compare
d278cad
to
d6112ef
Compare
a0824a8
to
8ad3f2f
Compare
d6112ef
to
1582673
Compare
1582673
to
db9b48a
Compare
8ad3f2f
to
26678b3
Compare
db9b48a
to
72eafff
Compare
26678b3
to
006d67d
Compare
72eafff
to
3c70106
Compare
006d67d
to
b295b11
Compare
3c70106
to
a1567b8
Compare
4db050c
to
1ebe59f
Compare
604a6e9
to
ba0982e
Compare
1ebe59f
to
e7d32a1
Compare
#cmakedefine GINKGO_FORCE_SPMV_BLOCKING_COMM | ||
#cmakedefine01 GINKGO_HAVE_OPENMPI_PRE_4_1_X |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
public interface break?
but I might also consider it is under experimental feature
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I can add the old macro again, but I'm not really sure how we usually consider macros for interface stability. IMO we either should state which macros are public, and which are private. Because this macro should definitely be private.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
some of them I might consider it is public like version and propabably mixed precision.
This is only used in the experimental feature, so I also think it is free to change
} // namespace gko | ||
|
||
#endif |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
} // namespace gko | |
#endif | |
} // namespace gko | |
#endif |
* Default constructor with empty communication pattern | ||
* @param base the base communicator |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
* Default constructor with empty communication pattern | |
* @param base the base communicator | |
* Default constructor with empty communication pattern | |
* | |
* @param base the base communicator |
* @tparam RecvType the type of the elements to receive | ||
* @param exec the executor for the communication |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
* @tparam RecvType the type of the elements to receive | |
* @param exec the executor for the communication | |
* @tparam RecvType the type of the elements to receive | |
* | |
* @param exec the executor for the communication |
* Default constructor with empty communication pattern | ||
* @param base the base communicator |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
* Default constructor with empty communication pattern | |
* @param base the base communicator | |
* Default constructor with empty communication pattern | |
* | |
* @param base the base communicator |
* @tparam GlobalIndexType the global index type of the map | ||
* @param base the base communicator |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
* @tparam GlobalIndexType the global index type of the map | |
* @param base the base communicator | |
* @tparam GlobalIndexType the global index type of the map | |
* | |
* @param base the base communicator |
* Equality is defined as having identical or congruent communicators and | ||
* their communication pattern is equal. No communication is done, i.e. | ||
* there is no reduction over the local equality check results. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
* Equality is defined as having identical or congruent communicators and | |
* their communication pattern is equal. No communication is done, i.e. | |
* there is no reduction over the local equality check results. | |
* Equality is defined as having identical or congruent communicators and | |
* their communication pattern is equal. There is no communication need in this function, i.e. | |
* there is no reduction over the local equality check results. |
ba0982e
to
23b4fe7
Compare
ceb6f2e
to
807118c
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM in general.
* This implementation uses the neighborhood communication | ||
* MPI_Ineighbor_alltoallv. See MPI documentation for more details. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
wrong one
* @param recv_buffer the receive buffer | ||
* @return a request handle |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
* @param recv_buffer the receive buffer | |
* @return a request handle | |
* @param recv_buffer the receive buffer | |
* | |
* @return a request handle |
std::partial_sum(send_sizes_.begin(), send_sizes_.end(), | ||
send_offsets_.begin() + 1); | ||
std::partial_sum(recv_sizes_.begin(), recv_sizes_.end(), | ||
recv_offsets_.begin() + 1); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you remind me this behavior?
the constructor with the default value T() is until c++11, but the others will rely on the allocator, which gives the uninitialized storage?
* The send_buffer must have allocated at least get_send_size number of | ||
* elements, and the recv_buffer must have allocated at least get_recv_size | ||
* number of elements. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it only accepts the plain pointer.
Should we use the gko::array to ensure the size fulfill the requirement?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think so. This would complicate using it by quite a lot. Our MPI wrappers are mostly designed to be very close to the MPI standard, so I would prefer the pointers here.
Signed-off-by: Marcel Koch <[email protected]>
Signed-off-by: Marcel Koch <[email protected]>
Signed-off-by: Marcel Koch <[email protected]>
Co-authored-by: Pratik Nayak <[email protected]> Signed-off-by: Marcel Koch <[email protected]>
Signed-off-by: Marcel Koch <[email protected]>
Co-authored-by: Pratik Nayak <[email protected]> Signed-off-by: Marcel Koch <[email protected]>
- fix include guards - update docs - implement copy/move constructors/assignment with tests - add equality test for collective communicators (needed for testing) - always enable neighborhood comm, just throw if openmpi is too old - define moved-from state as MPI_COMM_NULL + empty sizes/offsets - remove unnecessary namespace - make virtual function protected Co-authored-by: Pratik Nayak <[email protected]> Co-authored-by: Tobias Ribizel <[email protected]>
- documentation - formatting Co-authored-by: Yu-Hsiang M. Tsai <[email protected]> Signed-off-by: Marcel Koch <[email protected]>
- merge tests Co-authored-by: Yu-Hsiang M. Tsai <[email protected]> Signed-off-by: Marcel Koch <[email protected]>
- refactor test - fix docs Co-authored-by: Yu-Hsiang M. Tsai <[email protected]> Signed-off-by: Marcel Koch <[email protected]>
6d548e6
to
cf55d8d
Compare
2bdb2d5
to
c4e0766
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
semi approve from my side because I do not have experience on one-sided communication especially on the construction.
@@ -2,8 +2,8 @@ | |||
// | |||
// SPDX-License-Identifier: BSD-3-Clause | |||
|
|||
#ifndef GINKGO_PARTITION_HPP | |||
#define GINKGO_PARTITION_HPP | |||
#ifndef GKO_CORE_DISTRIBUTED_PARTITION_HPP |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
#ifndef GKO_CORE_DISTRIBUTED_PARTITION_HPP | |
#ifndef GKO_CORE_DISTRIBUTED_DEVICE_PARTITION_HPP_ |
std::copy_n(recv_target_ids_arr->get_const_data(), | ||
recv_target_ids_arr->get_size(), recv_target_ids.begin()); | ||
for (size_type seg_id = 0; | ||
seg_id < imap.get_remote_global_idxs().get_segment_count(); ++seg_id) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
dump question: will imap.get_remote_global_idxs().get_segment_count()
== imap.get_remote_target_ids().get_size()
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes. IMO we could have a better data structure for this, because the remote_target_ids
act like keys to the segments.
This PR adds a communicator that only handles neighborhood all-to-all communication. It implements the new interface
collective_communicator
, which provides different implementations for a selected set of collective mpi routines. Currently, this only includes the non-blocking all-to-all.The communication uses a fixed pattern, i.e. the send/recv sizes are fixed when the neighborhood communicator is constructed. I would have liked to decouple that, but this would require some knowledge of how the sizes are stored at the interface level. If someone has an idea for that, please let me know.
This is the first part of splitting up #1546.
The neighborhood all-to-all has a bug in OpenMPI < v4.1.0, which makes it necessary to disable the neighborhood communicator in this case. As replacement, there is also a dense all-to-all communicator.
Todo:
PR Stack: