HCMI sample file is missing important metadata and does not cover the entire repository #186

sgosline · 2024-05-21T16:40:22Z

We need to update the current manner in which we capture HCMI metadata to fully track all the samples. Currently there is no tie back tot he case_ids, which are the primary identifiers, nor is there any tracking of cancer diagnosis, tumor vs. normal, or primary vs met. This can be rectified by updating the data capture algorithm using a well-thought-out schema. Using this updated approach we can capture all the data in the current HCMI sample schema:

We should map the following CoderData fields to the hcmi fields. Like the broad_sanger data, there will be multiple rows for a single sampleID.
cancer_type: This should be the 'Clinical tumor diagnosis'
common_name: this should be the 'sample_submitter_id'
other_id: there should be MULTIPLE of these per sample, which can be duplicated

case_submitter_id (other_id_source should be 'case_submitter_id')
if available: diagnosis_id (other_id_source should be 'diagnosis_id') (there can still be multiple aliquots per dx)
if available: treatment_id (other_id_source should be 'treatment_id')
sample_uuid (other_id_source should be 'sample_id'). <----this should be 1:1 with improve_sample_id
case_uuid (other_id_source should be 'case_id')
other_name: add 'tissue_type' here
other_name: add 'tumor_descriptor' here

If fixed this should address the need for #185

The text was updated successfully, but these errors were encountered:

sgosline self-assigned this May 21, 2024

sgosline added the bug Something isn't working label May 21, 2024

sgosline added this to CoderData May 21, 2024

sgosline moved this to Ready in CoderData May 21, 2024

sgosline moved this from Ready to In progress in CoderData May 21, 2024

sgosline mentioned this issue May 22, 2024

updated HCMI to include more metadata #187

Merged

sgosline moved this from In progress to Done in CoderData May 22, 2024

sgosline closed this as completed May 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HCMI sample file is missing important metadata and does not cover the entire repository #186

HCMI sample file is missing important metadata and does not cover the entire repository #186

sgosline commented May 21, 2024 •

edited

Loading

HCMI sample file is missing important metadata and does not cover the entire repository #186

HCMI sample file is missing important metadata and does not cover the entire repository #186

Comments

sgosline commented May 21, 2024 • edited Loading

sgosline commented May 21, 2024 •

edited

Loading