Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable local mode and improve podp mode #207

Closed
wants to merge 26 commits into from

Conversation

CunliangGeng
Copy link
Member

redesign the template of config file

The new design does not allow users to specify the path to their data files, instead they should always put data files to default paths defined by nplinker.

To keep it simple, some settings are removed in the new config template. They could be added later if really needed.

update default values of config

add validations for all settings

remove config test demo2

update unit test for new configs

update utils.py

  • add function transform_to_full_path
  • add code to create download/extract dir if not exist
  • update docstrings to specify that the download/extract dir does not have to be exist

Update config.py

Transform root_dir to full path.

Update mibig_downloader.py

  • add the support of PathLike path

add default data paths to globals

Update globals.py

move PFAM_PATH to where it is used

Update mibig_downloader.py

Update runbigscape.py

  • remove function podp_run_bigscape
  • updated function run_bigscape

update utils on checking md5

  • remove function check_integrity
  • update md5 checking in function download_url

Create arranger.py

  • Add class DatasetArranger
  • Add dataset validation functions validate_gnps, validate_antismash and validate_bigscape

Delete downloader.py and its tests

remove running of bigscape in dataset loader

remove func _start_downloads from DatasetLoader

remove init paths from DatasetLoader

Merge branch 'refactor_project_folder_init' of https://github.com/NPLinker/nplinker into refactor_project_folder_init

The new design does not allow users to specify the path to their data files, instead they should always put data files to default paths defined by nplinker.

To keep it simple, some settings are removed in the new config template. They could be added later if really needed.
- add function `transform_to_full_path`
- add code to create download/extract dir if not exist
- update docstrings to specify that the download/extract dir does not have to be exist
Transform `root_dir` to full path.
- add the support of PathLike path
-  remove function `podp_run_bigscape`
- updated function `run_bigscape`
- remove function `check_integrity`
- update md5 checking in function `download_url`
- Add class `DatasetArranger`
- Add dataset validation functions `validate_gnps`, `validate_antismash` and `validate_bigscape`
Copy link
Member Author

Current dependencies on/for this PR:

This stack of pull requests is managed by Graphite.

@CunliangGeng CunliangGeng changed the title redesign the template of config file Enable local mode and improve podp mode Feb 27, 2024
GNPS file mappings file could be tsv or csv file.
- rename `gnps_file_mapping_tsv_file` to `gnps_file_mappings_file`
- rename `tsv_file` to `gnps_file_mappings_file`
- update relevant docstrings
- remove extension for `GNPS_FILE_MAPPINGS_FILENAME`
@CunliangGeng CunliangGeng deleted the refactor_project_folder_init branch February 27, 2024 14:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant