[FEA] TZiF reader to support timezone-aware operations #11592
Labels
1 - On Deck
To be worked on next
cuIO
cuIO issue
feature request
New feature or request
libcudf
Affects libcudf (C++/CUDA) code.
Python
Affects Python cuDF API.
Milestone
Related: #10047, #2477.
It is desirable to be able to do timezone-aware operations in cuDF.
One relatively simple approach is to load the IANA time zone database into cuDF as a table, and use existing algorithms to implement timezone-aware operations. This works quite well (see "Additional Context" below).
Unfortunately, the timezone database is typically not distributed or available in a format that libcudf can consume. It is usually distributed and present on user systems as a collection of TZiF files.
It would be great if we had a way to read TZiF files into cudf/libcudf. The first pass at this doesn't even need to be GPU-accelerated.
Additional context
To prototype this approach, I used a
.csv
version of the tzdb distributed by the third-party website timezonedb.com. It looks something like this:With the tzdb loaded into a table:
cudf::upper/lower_bound
)cudf::label_bins
)You can see my implementations of these operations here.
Perf comparison with Pandas:
The text was updated successfully, but these errors were encountered: