Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update plausible unit concept ID list #570

Open
katy-sadowski opened this issue Oct 15, 2024 · 3 comments
Open

Update plausible unit concept ID list #570

katy-sadowski opened this issue Oct 15, 2024 · 3 comments
Assignees
Labels
check bug/enhancement DQ check SQL logic has a bug or needs refinement

Comments

@katy-sadowski
Copy link
Collaborator

katy-sadowski commented Oct 15, 2024

Update all 3 concept-level threshold files to include some additional plausible unit concepts:

  • The number and percent of records for a given CONCEPT_ID 3009201 (THYROTROPIN [UNITS/VOLUME] IN SERUM OR PLASMA) with implausible units (i.e., UNIT_CONCEPT_ID NOT IN (44777578,8719,9040,9093)). (Threshold=5%).
  • The number and percent of records for a given CONCEPT_ID 3010813 (LEUKOCYTES [#/VOLUME] IN BLOOD) with implausible units (i.e., UNIT_CONCEPT_ID NOT IN (44777588,8848,8961,9444)). (Threshold=5%).
  • The number and percent of records for a given CONCEPT_ID 3007461 (PLATELETS [#/VOLUME] IN BLOOD) with implausible units (i.e., UNIT_CONCEPT_ID NOT IN (8848,8961,9444)). (Threshold=5%).
@katy-sadowski katy-sadowski added the check bug/enhancement DQ check SQL logic has a bug or needs refinement label Oct 15, 2024
@katy-sadowski
Copy link
Collaborator Author

The number and percent of records for a given CONCEPT_ID 3010813 (LEUKOCYTES [#/VOLUME] IN BLOOD) with implausible units (i.e., UNIT_CONCEPT_ID NOT IN (44777588,8848,8961,9444)). (Threshold=5%).

Is missing https://athena.ohdsi.org/search-terms/terms/8647

@katy-sadowski
Copy link
Collaborator Author

The number and percent of records for a given CONCEPT_ID 3007461 (PLATELETS [#/VOLUME] IN BLOOD) with implausible units (i.e., UNIT_CONCEPT_ID NOT IN (8848,8961,9444)). (Threshold=5%).

Is missing https://athena.ohdsi.org/search-terms/terms/32706

@adelmestri
Copy link

adelmestri commented Nov 27, 2024

Some more issues to be solved on UNIT_CONCEPT_ID:
Table = ‘Measurement’, category = ‘Plausibility’, Status = ‘Fail’

  1. The number and percent of records for a given CONCEPT_ID 4148615 (NEUTROPHIL COUNT) with implausible units (i.e., UNIT_CONCEPT_ID NOT IN (8784,8848,8961)). (Threshold=5%).” UNIT_CONCEPT_ID = 9444 = ‘1 billion per liter’ should be added to the list (8784,8848,8961).
    In fact
    8848 = thousand per microliter = 10^3 per microliter
    1 billion = 10^9
    1 liter = 10^6 microliter
    1 billion per liter = thousand per microliter

  2. The number and percent of records for a given CONCEPT_ID 4254663 (LYMPHOCYTE COUNT) with implausible units (i.e., UNIT_CONCEPT_ID NOT IN (8848)). (Threshold=5%).
    UNIT_CONCEPT_ID = 9444 = ‘1 billion per liter’ should be added to the list for the same reason as point above

  3. The number and percent of records for a given CONCEPT_ID 4298431 (WHITE BLOOD CELL COUNT) with implausible units (i.e., UNIT_CONCEPT_ID NOT IN (8848,8961)). (Threshold=5%).
    UNIT_CONCEPT_ID = 9444 = ‘1 billion per liter’ should be added to the list for the same reason as point above

  4. The number and percent of records for a given CONCEPT_ID 4030871 (RED BLOOD CELL COUNT) with implausible units (i.e., UNIT_CONCEPT_ID NOT IN (8734,8815,8931,9444)). (Threshold=5%).
    UNIT_CONCEPT_ID = 9445 = 'billion per milliliter' should be added:
    8734 = 'trillion per liter'
    Billion = trillion/10^3
    Milliliter = liter/10^3

  5. The number and percent of records for a given CONCEPT_ID 4197602 (SERUM TSH MEASUREMENT) with implausible units (i.e., UNIT_CONCEPT_ID NOT IN (8719,9040,9093)). (Threshold=5%).
    8719 = ‘milliunit per milliliter’
    9040 = ‘milli-international unit per liter’
    9093 = ‘micro-international unit per milliliter’

The following should be added to the list of acceptable UNIT_CONCEPT_ID:
44777578 = ‘milliunit per liter’
8750 = ‘microunit per liter’
8923 = ‘international unit per liter’

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
check bug/enhancement DQ check SQL logic has a bug or needs refinement
Projects
Status: Merged into develop branch
Development

No branches or pull requests

3 participants