Initial changes to support cufile stream I/O. #259

tell-rebanta · 2023-07-31T17:51:55Z

No description provided.

rapids-bot · 2023-07-31T17:51:59Z

Pull requests from external contributors require approval from a rapidsai organization member with write permissions or greater before CI can begin.

quasiben · 2023-07-31T17:53:56Z

/ok to test

madsbk

Thanks @tell-rebanta. Overall, I think it looks good, but I am wondering if it makes sense to de-couple the CUDA stream from StreamHandle?

As it is now, you are forced to create a stream per file, but I think a common use case is to read/write to multiple files using the same stream.

What about implementing read_async() and write_async() in the existing class FileHandle instead? read_async() and write_async() could then take a CUstream as argument.

Side note:

To check and fix the check-style run:

pre-commit run --all-file

cpp/CMakeLists.txt

madsbk · 2023-08-01T11:25:19Z

cpp/examples/basic_io.cpp

+
+     kvikio::StreamHandle s_handle_wr("/tmp/test-file", "w", kvikio::StreamHandle::m644, true);
+     check(cudaMemcpy(a_dev, a, SIZE, cudaMemcpyHostToDevice) == cudaSuccess);
+     kvikio::buffer_register(a_dev, SIZE);


Do we need to register the a buffer in order to use it with stream IO?

- Moved async I/O APIs to file_handle and made stream_handle to be a derived class of file_handle. - Async I/O can be invoked either through file_handle or stream_handle. - Addeed basic I/O examples for both the ways. - Removed unnecessary logs in CMakeLists.txt.

madsbk

Thanks @tell-rebanta, I like that the stream is now an explicit argument for read_async and write_async!
We need to guard the stream API using KVIKIO_CUFILE_STREAM_API_FOUND and I suggest that we exclude StreamHandle from this PR.
Let's discuss how to design the stream registration in a follow-up PR (I think we need to de-couple it from the file handle).

cpp/include/kvikio/file_handle.hpp

cpp/include/kvikio/stream.hpp

madsbk · 2023-08-16T12:34:58Z

/ok to test

Will be added as part of a separate check-in.

madsbk · 2023-08-17T06:35:06Z

/ok to test

cpp/include/kvikio/file_handle.hpp

wence- · 2023-08-17T09:27:11Z

cpp/include/kvikio/file_handle.hpp

+   * @param size Size in bytes to read.
+   * @param file_offset Offset in the file to read from.
+   * @param devPtr_offset Offset relative to the `devPtr_base` pointer to read into.
+   * This parameter should be used only with registered buffers.


Is there a way to distinguish registered buffers in the type system at the moment? That would be preferable over this void * C-like interface.

No, not ATM but it would be a cool addition: #266

…nc calls.

madsbk · 2023-08-18T06:17:37Z

/ok to test

madsbk

I think we can remove the CUDA stream API

cpp/include/kvikio/shim/cuda.hpp

Co-authored-by: Lawrence Mitchell <[email protected]>

madsbk · 2023-08-21T07:07:18Z

/ok to test

madsbk · 2023-08-21T08:17:09Z

Thanks @wence-, added you suggestions.

madsbk · 2023-08-21T13:10:13Z

/ok to test

madsbk · 2023-08-22T06:15:40Z

/merge

Hi there, Thanks for this great repository! I want to use the cuFile async IO in my research project and noticed this kvikio repo. However, the initial support has been done in #259 and tracked in #204, but the Python interface hasn't been done yet. So I exported the write_async and read_async to the CuFile Python class and added test case. This will be very helpful for my project where I want to do the PyTorch training computation and simultaneously load tensors from the SSDs. I created this PR because hopefully, it could be helpful for your repository as well as keeping the Python interface current. Please let me know your thoughts. Thank you. Best Regards, Kun Authors: - Kun Wu (https://github.com/K-Wu) - Mads R. B. Kristensen (https://github.com/madsbk) Approvers: - Mads R. B. Kristensen (https://github.com/madsbk) URL: #376

Initial changes to support cufile stream I/O.

a8519d3

tell-rebanta requested review from a team as code owners July 31, 2023 17:51

madsbk added improvement Improves an existing functionality non-breaking Introduces a non-breaking change labels Aug 1, 2023

madsbk reviewed Aug 1, 2023

View reviewed changes

madsbk changed the base branch from branch-23.08 to branch-23.10 August 16, 2023 09:00

Merge branch 'branch-23.10' into rm-kvikio

603d655

madsbk requested changes Aug 16, 2023

View reviewed changes

cpp/include/kvikio/file_handle.hpp Show resolved Hide resolved

cpp/include/kvikio/file_handle.hpp Show resolved Hide resolved

cpp/include/kvikio/stream.hpp Outdated Show resolved Hide resolved

Removed stream_handle object from this check-in.

9cca6ce

Will be added as part of a separate check-in.

wence- reviewed Aug 17, 2023

View reviewed changes

Guard using KVIKIO_CUFILE_STREAM_API_FOUND for ReadAsync and WriteAsy…

d0d9e62

…nc calls.

madsbk requested changes Aug 18, 2023

View reviewed changes

cpp/include/kvikio/shim/cuda.hpp Outdated Show resolved Hide resolved

cpp/include/kvikio/shim/cuda.hpp Outdated Show resolved Hide resolved

tell-rebanta and others added 2 commits August 18, 2023 10:11

Removed CUDA Stream API symbols since not being used.

3b883bb

typo

ed22eb8

Co-authored-by: Lawrence Mitchell <[email protected]>

doc

c16477e

madsbk requested a review from wence- August 21, 2023 08:17

madsbk mentioned this pull request Aug 21, 2023

Tracking support of new cuFile features #204

Open

4 tasks

madsbk approved these changes Aug 21, 2023

View reviewed changes

rapids-bot bot merged commit 35424d2 into rapidsai:branch-23.10 Aug 22, 2023
27 checks passed

K-Wu mentioned this pull request May 5, 2024

Initial Python Interface for cufile Async IO #376

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial changes to support cufile stream I/O. #259

Initial changes to support cufile stream I/O. #259

tell-rebanta commented Jul 31, 2023

rapids-bot bot commented Jul 31, 2023

quasiben commented Jul 31, 2023

madsbk left a comment •

edited

Loading

madsbk Aug 1, 2023

madsbk left a comment

madsbk commented Aug 16, 2023

madsbk commented Aug 17, 2023

wence- Aug 17, 2023

madsbk Aug 17, 2023

madsbk commented Aug 18, 2023

madsbk left a comment

madsbk commented Aug 21, 2023

madsbk commented Aug 21, 2023

madsbk commented Aug 21, 2023

madsbk commented Aug 22, 2023

Initial changes to support cufile stream I/O. #259

Initial changes to support cufile stream I/O. #259

Conversation

tell-rebanta commented Jul 31, 2023

rapids-bot bot commented Jul 31, 2023

quasiben commented Jul 31, 2023

madsbk left a comment • edited Loading

Choose a reason for hiding this comment

Side note:

madsbk Aug 1, 2023

Choose a reason for hiding this comment

madsbk left a comment

Choose a reason for hiding this comment

madsbk commented Aug 16, 2023

madsbk commented Aug 17, 2023

wence- Aug 17, 2023

Choose a reason for hiding this comment

madsbk Aug 17, 2023

Choose a reason for hiding this comment

madsbk commented Aug 18, 2023

madsbk left a comment

Choose a reason for hiding this comment

madsbk commented Aug 21, 2023

madsbk commented Aug 21, 2023

madsbk commented Aug 21, 2023

madsbk commented Aug 22, 2023

madsbk left a comment •

edited

Loading