-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bugs in airr-schema.yaml / airr-schema-openapi3.yaml #813
Comments
Yes, this is a good point that we haven't addressed in #773 yet, and is a criteria we applied to the germline gene set schema. Adding a new class is the solution, though I'm not sure the best name. Maybe |
Hi @LonnekeScheffer , we need to differentiate these issues between v1.5 and v2.0. If v2.0 ends up being broken for awhile, that's ok, all the data is currently in v1.5 so that's the priority. Maybe some of these issues are only present in v2.0 so we can ignore them for now. |
Hi @schristley, I have now updated the list to separate the airr v1.5 issues (which also occur in v2.0) from the v2.0-only issues |
For Likewise for |
@bussec @bcorrie This seems to be incorrect usage by AIRR standards. Gene names are not ontology-based exactly, i.e. each individual gene is not an ontology term. This is more like a controlled vocabulary from an external source. I don't think we have a similar situation elsewhere in the AIRR schema, so I think we need to define this differently. Likewise for |
@LonnekeScheffer this is actually correct, it is just uncommon. What this means is that instead of the keys/properties of the dictionary being pre-defined, they are dynamic. The For LinkML, this gets translated into a |
Some issues encountered when auto-generating LinkML from airr-schema.yaml / airr-schema-openapi3.yaml
These are all relatively easy/small fixes.
Issues for both AIRR version 1.5 and 2.0
release_version
has inconsistent type. InAlleleDescription
, type=integer; inGermlineSet
, type=number. Should be integer in both cases.species
is missing source node in some places. This ontology is correctly defined inSubject
, but inAlleleDescription
andGermlineSet
, the expected fieldx-airr/ontology/top_node/id
is missing.locus
(contains 'null' inRearrangement
, but not inAlleleDescription
,GermlineSet
,Genotype
);mhc_class
(contains 'null' inReactivity
, not inMHCGenotype
)x-airr/ontology/top_node/id
is missing.antigen
in class. 'ReceptorReactivity' (v1.5) / 'Reactivity' (v2.0) the expected fieldx-airr/ontology/top_node/id
is missing.nodes
of classTree
should have$ref: '#/Node'
directly undernodes
, now it is placed under a field calledadditionalProperties
which is not used anywhere elseRepertoireGroup
contains propertyrepertoires
. This is not simply an array ofRepertoire
objects, but rather, an array of items which are unnamed (!) objects, where each object contains arepertoire_id
,repertoire_description
,time_point
. There are no other comparable occurrences in the schema files where new objects are defined in such a way. Since this object is listed under 'items', I cannot infer a name for this object when auto-generating LinkML. Therefore, I would suggest creating a new named class, e.g.,RepertoireReference
, which contains these three fields, and is used as a 'type' for this array.Issues for AIRR version 2.0 only
unit
occurs as an ontology across different classes. This ontology does not have the same meaning. E.g.:TimePoint
(unit=UO:0000003),TimeInterval
(unit=UO:0000033),PhysicalQuantity
(unit=UO:0000024),TimeQuantity
(unit=UO:0000033). The field 'unit' should be renamed to something unique (time_unit, time_interval_unit, physical_quantity_unit)orcid_id
in classContributor
the expected fieldx-airr/ontology/top_node/id
is null.affiliation
in classContributor
the expected fieldx-airr/ontology/top_node/id
is null.reactivity_ref
is defined with two different types. Inside classReactivity
, the type is array. Inside classRearrangement
, the type is string, but the description explicitly states that this string should be a comma separated list. I would propose changing the type to array. (and besides,Rearrangement
also containsreactivity_id
which is a comma separated list as string, the type for that should then probably change as well.)The text was updated successfully, but these errors were encountered: