Tracker for Coverage over CellML Model Repository #23

anandijain · 2021-03-21T18:17:13Z

This issue will track our progress in testing CellMLToolkit.jl on the CellML Model Repository.

I have a branch that I've added some functions to query the Model Repo for all of the "exposures" and then curl them here. Additionally I added some functions to create a DataFrame to see which models work and which don't, here.

This work is incomplete and since the model repository is quite large, it takes a while to download.

We are planning to do something similar for SBML.jl and their test-suite so it'd be nice to have some consistency in testing.

I don't have the entire library, but from my sample of ~1000 models, I found that we can call solve on about 10% of these models and get back a Solution.

@shahriariravanian you've mentioned some of the issues that could be contributing to this 10% number. It would be good to mention them, so that as they get fixed we can see how this percentage changes.

The text was updated successfully, but these errors were encountered:

anandijain · 2021-03-26T08:23:01Z

2.1.0 is giving ~178/940

anandijain · 2021-03-27T03:51:52Z

2.2.0 is giving ~477/940

anandijain · 2021-03-27T20:03:56Z

It is a known issue that some files in the CellML Model Repository have bad XML or do not fit the specification of CellML we use. (aside @shahriariravanian which version of CellML are we guaranteeing should work?)

removing Goldbeeter_2006 from my data folder we now get. The problem is caused in EzXML, where if it hits an error in parsing, it pushes to a global error stack that prevents further usage. why they do this, I have no idea...

530/940

anandijain · 2021-03-31T17:07:54Z

861 CellML models
718 successfully converted to ODESystem
635 successfully converted to ODEProblem
595 successfully solved

we get 940 from the curls, but cloning the git repos returns 861, so that's where that discrepancy comes from
595/861 is quite good IMO. as a lot of the models are truly defective

ChrisRackauckas · 2021-03-31T17:09:07Z

What are the issues you see?

anandijain · 2021-03-31T17:09:38Z

this data is from @shahriariravanian. could you shed some light on chris' question?

shahriariravanian · 2021-04-01T23:02:18Z

The remaining issues are:

Some CellML XML files are defective (missing some initial values). Currently, CellMLToolkit throws an error for these. However, the plan is to return a list of uninitiated variables for the user to provide the values.
Some models have more than one iv (in fact, some use partial_diff tag). This is uncommon in CellML models but is supported in the specs.
The main remaining active tissue is to implement imports completely. Currently, we have an incomplete implementation. Full import is rather complicated, as CellML XML files can recursively import and rename components and connections (links between variables from different components) from other files. Because of the connections, we may need to import some components implicitly.
The ODEProblms which were not solved are not a big problem, as we used a fixed solver (TRBDF2) with some default parameters.
Large models (XML size > 500K) can take a long time to generate an ODESystem. I'm going to profile and see where the main problem is, but we may need to change the strategy in how to use structural_simplfy for the very large models.

anandijain · 2021-04-01T23:12:56Z

Great, could you name a model with ? I'd like to look into that. Similarly for a model with missing vars and components.

Also, if you end up doing some profiling, I think it'd be good to add benchmarking to our testing of the model repo. I'm happy to add this too with BenchmarkTools.

This may help pin down inefficiencies, ie "is it dependent on parameter count, state count, etc... ?".

shahriariravanian · 2021-04-02T12:18:25Z

This is the results of the latest run:

#	outcome
867	CellML models
6	too large (>500K, excluded)
744	successfully converted to ODESystem
650	successfully converted to ODEProblem
608	successfully solved

shahriariravanian · 2021-04-02T12:29:57Z

Here is the result file as a CSV file. The res col codes are:

0 -> fail to generate ODESystem
1 -> fail to generate ODEProblem
2 -> fail to solve ODEProblem
3 -> success!
9 -> too large a file, ignored

cellml_results.txt

ChrisRackauckas · 2021-04-02T12:32:22Z

Try setting the runner to a lower tolerance. That should help the domain error cases. If not, generate sqrt -> sqrt(abs so step rejects don't error out but instead reject.

shahriariravanian · 2021-04-10T12:45:22Z

These are the latest tracking results using ver 2.4.1 (to be pushed soon):

#	outcome
867	CellML models
6	too large (>500K, excluded)
775	successfully converted to ODESystem
688	successfully converted to ODEProblem
643	successfully solved

cellml_results_8.txt

anandijain mentioned this issue Apr 1, 2021

SBML.jl and SciML integration LCSB-BioCore/SBML.jl#31

Closed

anandijain mentioned this issue Apr 13, 2021

Coverage tracker for sbml-test-suite and BioModels LCSB-BioCore/SBML.jl#35

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracker for Coverage over CellML Model Repository #23

Tracker for Coverage over CellML Model Repository #23

anandijain commented Mar 21, 2021

anandijain commented Mar 26, 2021

anandijain commented Mar 27, 2021

anandijain commented Mar 27, 2021

anandijain commented Mar 31, 2021

ChrisRackauckas commented Mar 31, 2021

anandijain commented Mar 31, 2021

shahriariravanian commented Apr 1, 2021

anandijain commented Apr 1, 2021

shahriariravanian commented Apr 2, 2021

shahriariravanian commented Apr 2, 2021

ChrisRackauckas commented Apr 2, 2021

shahriariravanian commented Apr 10, 2021

Tracker for Coverage over CellML Model Repository #23

Tracker for Coverage over CellML Model Repository #23

Comments

anandijain commented Mar 21, 2021

anandijain commented Mar 26, 2021

anandijain commented Mar 27, 2021

anandijain commented Mar 27, 2021

anandijain commented Mar 31, 2021

ChrisRackauckas commented Mar 31, 2021

anandijain commented Mar 31, 2021

shahriariravanian commented Apr 1, 2021

anandijain commented Apr 1, 2021

shahriariravanian commented Apr 2, 2021

shahriariravanian commented Apr 2, 2021

ChrisRackauckas commented Apr 2, 2021

shahriariravanian commented Apr 10, 2021