Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Apply pycytominer-transform to real world CP-derived SQLite #16

Open
gwaybio opened this issue Nov 9, 2022 · 5 comments
Open

Apply pycytominer-transform to real world CP-derived SQLite #16

gwaybio opened this issue Nov 9, 2022 · 5 comments
Assignees

Comments

@gwaybio
Copy link
Member

gwaybio commented Nov 9, 2022

@d33bs - consider applying pycytominer-transform to the existing SQLite data here to test your approach with real world data.

We would be particularly interested in comparing the results from pycytominer.cyto_utils.cells.SingleCells to the pycytominer-transform derived parquet file.

More details about the SingleCells processing is here: https://github.com/WayScience/NF1_SchwannCell_data/blob/5536d86330aba0f66c74e2bd44c6e82ed1c985f9/4_processing_features/extract_single_cell_features.ipynb

@gwaybio
Copy link
Member Author

gwaybio commented Nov 9, 2022

The CellProfiler pipeline that @jenna-tomkinson used to create the SQLite file is here: https://github.com/WayScience/NF1_SchwannCell_data/blob/main/CellProfiler_pipelines/Pipelines/NF1_analysis.cpproj (I believe, Jenna, please correct me if I'm wrong!)

@jenna-tomkinson
Copy link
Member

@gwaybio @d33bs Yes it is! Please run the NF1_illum.ccproj first to get the correct images.

@d33bs
Copy link
Member

d33bs commented Nov 9, 2022

Thank you @gwaybio and @jenna-tomkinson! I will look into using this and performing a notebook-based comparison.

@d33bs
Copy link
Member

d33bs commented Jan 31, 2023

Hi @jenna-tomkinson and @gwaybio - it looks like the data may have changed since the creation of this issue. I can see there are many .sqlite data sources to choose from under CellProfiler_pipelines/Analysis_Output. To make sure I address this issue accurately to the intent, which one(s) would be best to use?

@gwaybio
Copy link
Member Author

gwaybio commented Jan 31, 2023

pycytominer-transform should be able to handle them all, but you can focus on NF1_data_allcp_plate1.sqlite and NF1_data_allcp_plate2.sqlite. Plate 1 is the simplest case, Plate 2 is larger and will be a better memory/speed benchmark.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants