[ETL-654] Clean up before integration test run #121

BryanFauble · 2024-07-02T19:27:15Z

Problem:

When an integration test is run multiple times on a namespaced branch (Or on the staging namespace) it will affect results.

Solution:

Cleaning out the input bucket, and json intermediate bucket before running the integration test in both namespaced, and main branches

Testing:

Will be verifying against namespaced branches for testing, and then once merged into the main branch

rxu17

LGTM! Just a couple of comments

.github/workflows/README.md

src/scripts/manage_artifacts/clean_staging.py

sonarqubecloud · 2024-07-02T21:35:34Z

Quality Gate passed

Issues
7 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarCloud

rxu17 · 2024-07-02T21:57:25Z

tests/test_json_to_parquet.py

@@ -517,6 +541,28 @@ def glue_crawler_role(namespace):
    role_name = f"{namespace}-pytest-crawler-role"
    glue_service_policy_arn = "arn:aws:iam::aws:policy/service-role/AWSGlueServiceRole"
    s3_read_policy_arn = "arn:aws:iam::aws:policy/AmazonS3ReadOnlyAccess"
+
+    # Cleanup if the role/policy already exist


Oh, were we running into role/policy duplicates/conflicts here when trying to create role when it already exists?

NVM I see what is happening. If the tests abruptly fail, sometimes the role/crawler/database might have been created already and not deleted properly because test failed prior to that. So next time the tests are run, it can run into an error.

These changes looks good to me

Exactly! This is to make sure we're always running the test from a known state.

BryanFauble added 3 commits July 2, 2024 12:19

Clean folders before integration test run

0fffbbd

Print out start of script run

05bdcfd

Require cleanup before integration test run

2d6782c

BryanFauble requested a review from a team as a code owner July 2, 2024 19:27

pre-commit

78d1755

BryanFauble temporarily deployed to develop July 2, 2024 19:28 — with GitHub Actions Inactive

BryanFauble temporarily deployed to develop July 2, 2024 19:30 — with GitHub Actions Inactive

BryanFauble temporarily deployed to develop July 2, 2024 19:45 — with GitHub Actions Inactive

BryanFauble had a problem deploying to develop July 2, 2024 19:49 — with GitHub Actions Failure

Include boto3 explictly in pipfile nad generate new pipfile.lock

d8d09b6

BryanFauble temporarily deployed to develop July 2, 2024 20:01 — with GitHub Actions Inactive

BryanFauble temporarily deployed to develop July 2, 2024 20:04 — with GitHub Actions Inactive

BryanFauble temporarily deployed to develop July 2, 2024 20:10 — with GitHub Actions Inactive

BryanFauble had a problem deploying to develop July 2, 2024 20:12 — with GitHub Actions Failure

rxu17 approved these changes Jul 2, 2024

View reviewed changes

.github/workflows/README.md Outdated Show resolved Hide resolved

src/scripts/manage_artifacts/clean_staging.py Outdated Show resolved Hide resolved

Run python command in pipenv run

37b6aab

BryanFauble temporarily deployed to develop July 2, 2024 20:21 — with GitHub Actions Inactive

Fix readme doc

1d26d0b

BryanFauble temporarily deployed to develop July 2, 2024 20:22 — with GitHub Actions Inactive

BryanFauble temporarily deployed to develop July 2, 2024 21:35 — with GitHub Actions Inactive

BryanFauble temporarily deployed to develop July 2, 2024 21:38 — with GitHub Actions Inactive

BryanFauble temporarily deployed to develop July 2, 2024 21:52 — with GitHub Actions Inactive

BryanFauble temporarily deployed to develop July 2, 2024 21:54 — with GitHub Actions Inactive

BryanFauble temporarily deployed to develop July 2, 2024 21:55 — with GitHub Actions Inactive

rxu17 reviewed Jul 2, 2024

View reviewed changes

BryanFauble temporarily deployed to develop July 2, 2024 22:02 — with GitHub Actions Inactive

BryanFauble temporarily deployed to develop July 2, 2024 22:03 — with GitHub Actions Inactive

BryanFauble temporarily deployed to develop July 2, 2024 22:12 — with GitHub Actions Inactive

BryanFauble temporarily deployed to develop July 2, 2024 22:13 — with GitHub Actions Inactive

BryanFauble merged commit 2c3439f into main Jul 2, 2024
17 checks passed

BryanFauble deleted the etl-654-clean-data branch July 2, 2024 22:22

BryanFauble mentioned this pull request Jul 15, 2024

[ETL-670] Adjust cleanup job to run on main directory of dev env #126

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ETL-654] Clean up before integration test run #121

[ETL-654] Clean up before integration test run #121

BryanFauble commented Jul 2, 2024

rxu17 left a comment

sonarqubecloud bot commented Jul 2, 2024

rxu17 Jul 2, 2024 •

edited

Loading

BryanFauble Jul 2, 2024

[ETL-654] Clean up before integration test run #121

[ETL-654] Clean up before integration test run #121

Conversation

BryanFauble commented Jul 2, 2024

rxu17 left a comment

Choose a reason for hiding this comment

sonarqubecloud bot commented Jul 2, 2024

Quality Gate passed

rxu17 Jul 2, 2024 • edited Loading

Choose a reason for hiding this comment

BryanFauble Jul 2, 2024

Choose a reason for hiding this comment

rxu17 Jul 2, 2024 •

edited

Loading