-
Notifications
You must be signed in to change notification settings - Fork 3
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #120 from nhsengland/ah_data_validation_project
AH Added /sde_data_validation project page
- Loading branch information
Showing
3 changed files
with
33 additions
and
0 deletions.
There are no files selected for viewing
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,30 @@ | ||
--- | ||
title: 'Reusable New Data Product Validation Functions' | ||
summary: 'More Efficient, More Consistent Data Through Shared Validation Functions' | ||
origin: 'NHS England Secure Data Environment Service Data Wranglers' | ||
tags: ['DATA WRANGLERS', 'NHSE_SDE', 'SDE', 'DATA VALIDATION', 'RAP', 'PYTHON'] | ||
--- | ||
![An image showing a stack of boxes on the left and a single box with robotic legs on the right. The stack of boxes has a label "old validation process" along with titles on boxes such as "code not shared", "inconsistent approach", "unreliable" and "manual process". Above the boxes it says "3 days". Next to the boxes an unhappy man is struggling to move them. To the right is a single box with robotic legs, with a happy looking man stood next to it. The box with robotic legs is labeled "new validation process" and has words nearby such as "reusable code", "consistent process" and "easy to re-run". Above the box is a label stating it takes about 30 minutes.](../images/sde_resuable_data_validation_functions.png) | ||
|
||
All data provisioned into the NHS England Secure Data Environment (SDE) must be validated first. The old data product validation process was manual, time consuming and lengthy to re-run. | ||
|
||
Our objectives were to: | ||
- Boost the efficiency and consistency of the data validation process for the Data Access Request Service (DARS) | ||
- Make it re-usable to save time and uphold best practice | ||
- Share the code so others can benefit. | ||
|
||
## Results | ||
|
||
- Validation time down from days to approximately 30 minutes | ||
- Validation code was reusable on other datasets and has already been reused | ||
- Consistent methodology compared to manual approach | ||
- Enabled multiple potential issues that could have hampered research efforts to be addressed earlier. | ||
|
||
Output|Link | ||
---|--- | ||
Open Source Code & Documentation| Coming soon! | ||
Case Study| N/A | ||
Technical report| N/A | ||
Algorithmic Impact Assessment| N/A | ||
|
||
# |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters