Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Introduce rawDataBucket #4759

Closed

Conversation

davidmirror-ops
Copy link
Contributor

@davidmirror-ops davidmirror-ops commented Jan 22, 2024

Tracking issue

#4291

Why are the changes needed?

Enables users to specify a separate (from metadata) Raw Data bucket in the form of a userDataBucket parameter in flyte-core.

What changes were proposed in this pull request?

How was this patch tested?

Setup process

Screenshots

Check all the applicable boxes

  • I updated the documentation accordingly.
  • All new and existing tests passed.
  • All commits are signed-off.

Related PRs

Docs link

@dosubot dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. documentation Improvements or additions to documentation enhancement New feature or request labels Jan 22, 2024
Copy link

codecov bot commented Jan 22, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (5300e1b) 58.17% compared to head (4c12d33) 58.08%.
Report is 79 commits behind head on master.

❗ Current head 4c12d33 differs from pull request most recent head debccf7. Consider uploading reports for the commit debccf7 to get more accurate results

Additional details and impacted files
@@            Coverage Diff             @@
##           master    #4759      +/-   ##
==========================================
- Coverage   58.17%   58.08%   -0.09%     
==========================================
  Files         626      367     -259     
  Lines       53833    32625   -21208     
==========================================
- Hits        31315    18949   -12366     
+ Misses      20010    12105    -7905     
+ Partials     2508     1571     -937     
Flag Coverage Δ
unittests 58.08% <ø> (-0.09%) ⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@@ -5,6 +5,7 @@ userSettings:
dbPassword: <DB_PASSWORD>
rdsHost: <RDS_HOST>
bucketName: <BUCKET_NAME>
userBucketName: <RAW_DATA_BUCKET_NAME>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

would it be better to call it rawDataBucketName?

@@ -179,6 +179,8 @@ data:
signedUrls:
durationMinutes: 3
storage.yaml: |
propeller:
rawoutput-prefix: s3://<RAW_DATA_BUCKET_NAME>/data
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why do we need /data here?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not needed, it's the way binary handles it currently. I thought it'd be nice to align them

@@ -498,6 +500,8 @@ data:
profiler-port: 10254
storage-prefix: metadata/datacatalog
storage.yaml: |
propeller:
rawoutput-prefix:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

IIRC, rawoutput-prefix can't be empty. does it work in the sandbox?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I tested on an EKS env and it works fine but not in sandbox yet.
I'll be testing on sandbox, I think I know why it's rendered as empty here (you're right, it cannot be empty)

@davidmirror-ops davidmirror-ops changed the title Introduce userDataBucket Introduce rawDataBucket Mar 1, 2024
@davidmirror-ops davidmirror-ops force-pushed the 4291-separate-buckets branch from debccf7 to fd42f65 Compare March 6, 2024 17:11
@dosubot dosubot bot added size:XS This PR changes 0-9 lines, ignoring generated files. and removed size:L This PR changes 100-499 lines, ignoring generated files. labels Mar 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation enhancement New feature or request size:XS This PR changes 0-9 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants