Epic: Initial end-to-end workbook code refactor #2670

jadudm · 2023-11-01T10:39:02Z

At a glance

In order to migrate historical data
as a product owner
I want to know the process held the data to the same (or higher) standard as our intake.

The code: a brief and likely inaccurate history

Months ago, I developed a workbook generator. It began life in order to exercise our early JSON Schema validations. It helped find many logical errors, and drove a great deal of improvement in our intake.

It grew and evolved, serving in a one-off capacity for generating workbooks. As the API came online, I extended it to run end-to-end. Or, I discovered I could generate workbooks in memory, and feed them to the Django application. This rapidly became an end-to-end test: a set of workbooks could be generated, fed to the app, run through all of our validations (including cross-val), and then the data at time of input is compared with what is available via the API.

It is this code that we are basing our historical data migration on.

Approach

The initial refactoring will clone the existing code to another part of the tree, and then walk it forward.

The code will always generate workbooks that validate end-to-end.
Unit tests for critical/core functions will be developed as needed along the way.
Cleanup/refactoring can and should happen as needed.
No massive/architectural rewrites are part of this work.

At all points in the refactoring the code should continue to generate workbooks that validate.

Tasks

Give feedback

E2E refactor: Clone E2E code from dissemination to census_historical_migration #2723

eng
E2E refactor: Clone tests all_should_pass and all_should_fail, plus workbooks #2727

eng
E2E refactor: Update import statements for new E2E code
E2E refactor: demonstrate execution of code via GH Action #2671

eng
E2E refactor: document execution of code locally, in GH Actions
Add a README for running the end-to-end code
Add a README for running the historical migration code
Options

#2672 is broken out as its own epic for refactoring/improving this code.

When these steps are complete, the team should have a tool that can generate workbooks using the "messy" historical data loaded into our environments via Docker container. It does not yet work with new/"authoritative" Census data.

The text was updated successfully, but these errors were encountered:

jadudm added this to FAC Epic Board Nov 1, 2023

jadudm converted this from a draft issue Nov 1, 2023

jadudm changed the title ~~Epic: Refactor end-to-end workbook code~~ Epic: Initial end-to-end workbook code refactor Nov 1, 2023

jadudm moved this from Next to Now in FAC Epic Board Nov 2, 2023

jadudm added the eng label Nov 2, 2023

github-project-automation bot added this to FAC Nov 2, 2023

github-project-automation bot moved this to Triage in FAC Nov 2, 2023

phildominguez-gsa assigned phildominguez-gsa and sambodeme Nov 3, 2023

danswick removed this from FAC Nov 30, 2023

danswick moved this from Now to Done in FAC Epic Board Dec 11, 2023

danswick closed this as completed Dec 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Epic: Initial end-to-end workbook code refactor #2670

Epic: Initial end-to-end workbook code refactor #2670

jadudm commented Nov 1, 2023 •

edited by sambodeme

Loading

Tasks

Epic: Initial end-to-end workbook code refactor #2670

Epic: Initial end-to-end workbook code refactor #2670

Comments

jadudm commented Nov 1, 2023 • edited by sambodeme Loading

At a glance

The code: a brief and likely inaccurate history

Approach

Tasks

jadudm commented Nov 1, 2023 •

edited by sambodeme

Loading