Skip to content

Commit

Permalink
add docs
Browse files Browse the repository at this point in the history
  • Loading branch information
JFriel committed Oct 28, 2024
1 parent 7d0cb4e commit 2e2c5a9
Showing 1 changed file with 41 additions and 0 deletions.
41 changes: 41 additions & 0 deletions Documentation/DataLoadEngine/RegexRedactions.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,41 @@
# Regex Redactions

RDMP has the ability to redact incoming and existing catalogue data.
It uses user-defined regex to do this.
This functionality may be useful if you know about problematic data that you wish to clean before it reaches RDMP

## Setup & Configuration
To begin, a redaction configuration is required.
This can be added in the "Configurations" section of RDMP.
Each configuration has:
* A Name
* A Description
* A Regex string to match
* A Replacement value

N.B. The replacement value cannot be longer than the redacted string i.e. "string" => "myRedactedString" will not work.

Once the configuration has been saved, it can be used to redact data.

## Usage in Data Loads
Within the Data Loads mutilators, there is a mutilator named "Regex Redaction Mutilator" that can be used in the RAW or STAGING areas of the load.
This mutilator will redact based on your selected configuration.
The configuration for this mutilator requires:
* A predefined regex redaction configuration (see above)
* Either a regex to select which columns to redact, or a selection of known columns

N.B. Primary Key columns will not be redacted and this functionality is intended for string fields.

Once the configuration has been set, it will redact the desired columns during a data load.

During a run, the mutilator will replace any regex matches with the desired string and store the now known redaction in RDMP.
To view the redactions made, see the section below.


## Usage within RDMP
You can view and restore existing redactions, along with adding new redactions within RDMP.
For a Catalogue, right-click and iopen the "Catalogue Items" submenu.
From here, select the "Regex Redactions" option.
This will open an interface displaying existing redactions.
From here you casn restore existing redactions either individually or in bulk.
Yuo can also apply regex redactions to existing data by selecting a redaction configuration and which columns to redact.

0 comments on commit 2e2c5a9

Please sign in to comment.