Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Precomputed db #17

Merged
merged 29 commits into from
Jan 27, 2025
Merged

Precomputed db #17

merged 29 commits into from
Jan 27, 2025

Conversation

ArnaudBelcour
Copy link
Collaborator

Add

  • New command esmecata_create_db to create database from different output folders of esmecata (from_runs).
  • Full release of esmecata precomputed associated with the first version of esmecata precomputed database.
  • Option threshold (-t) to precomputed.
  • Add --gseapyCutOff option to gseapy_enrichr.
  • A check after database creation to detect taxon with few predicted proteins compared to higher affiliated taxon.
  • Check the good format of the gzip file.
  • Header KEGG_reaction in annotation_reference from annotation_uniprot to avoid issues with esmecata_create_db.

Fix

  • Issue with protein IDs from UniParc during annotation (incorrect split on '|').
  • Fix issue in get_taxon_obs_name function.
  • Issues in test.

Modify

  • Add database version in log.
  • Rename test_workflow.py into test_workflow_uniprot.py, to better reflect what is done.
  • Update workflow figure.
  • Update readme.
  • Update article_data folder and the associated readme.

@ArnaudBelcour ArnaudBelcour merged commit acf38e8 into main Jan 27, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant