-
Notifications
You must be signed in to change notification settings - Fork 8
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add support for chunked reading of Parquet and CSV files in Reader class
- Implemented `get_in_chunks` method to read large datasets in manageable chunks. - Enhanced handling of Parquet files with catalog types, including type mapping for pandas dtypes to pyarrow types. - Updated CSV reading to ensure date fields are correctly parsed. - Added error handling for unsupported file types and improved user feedback for method usage. This change improves memory efficiency and flexibility when processing large ETL files.
- Loading branch information
1 parent
d1b22e7
commit 1be6075
Showing
1 changed file
with
77 additions
and
1 deletion.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters