Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Harmonize branch #4

Open
wants to merge 46 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
46 commits
Select commit Hold shift + click to select a range
140e165
Create README.md
MikeLippincott Mar 25, 2024
58be871
setup repo
MikeLippincott Mar 25, 2024
6f75e2a
update README
MikeLippincott Mar 25, 2024
35d2e39
license
MikeLippincott Mar 25, 2024
523b451
Update .pre-commit-config.yaml
MikeLippincott Mar 25, 2024
8f94ff0
address comments
MikeLippincott Mar 25, 2024
8c070dd
comments
MikeLippincott Mar 26, 2024
d2c5efa
Update LICENSE.md
MikeLippincott Mar 26, 2024
235c267
Update README.md
MikeLippincott Mar 26, 2024
b400ead
Update data/README.md
MikeLippincott Mar 26, 2024
fdf7476
Update README.md
MikeLippincott Mar 26, 2024
2a47902
Update README.md
MikeLippincott Mar 26, 2024
d4cf051
comments
MikeLippincott Mar 26, 2024
bcf8ccf
addressing comments
MikeLippincott Mar 26, 2024
2b97329
Merge pull request #1 from WayScience/preprocessing
MikeLippincott Mar 26, 2024
4cd9e82
IC
MikeLippincott Mar 26, 2024
f0ffa29
IC
MikeLippincott Mar 27, 2024
38cdcfe
run pipelines
MikeLippincott Apr 1, 2024
2302d2c
run IC and feature extreaction
Apr 2, 2024
fec6228
extract features
Apr 4, 2024
075eaf3
cytomining
MikeLippincott Apr 5, 2024
e6f0970
extract features
MikeLippincott Apr 8, 2024
3ecf95b
reerun to have image names
MikeLippincott Apr 12, 2024
d1b6115
Update 2.cellprofiler_ic_processing/pipelines/illum_2ch.cppipe
MikeLippincott Apr 23, 2024
72eb74f
Update 2.cellprofiler_ic_processing/pipelines/illum_4ch.cppipe
MikeLippincott Apr 23, 2024
b997d02
Update 2.cellprofiler_ic_processing/scripts/perform_ic.py
MikeLippincott Apr 23, 2024
891ae22
Update 2.cellprofiler_ic_processing/scripts/perform_ic.py
MikeLippincott Apr 23, 2024
4212ab8
Update 3.cellprofiler_analysis/pipelines/analysis_2ch.cppipe
MikeLippincott Apr 23, 2024
acd7861
address Jenna's comments
MikeLippincott Apr 23, 2024
39f4ac8
update for next PR
MikeLippincott Apr 23, 2024
c65b471
Merge pull request #2 from MikeLippincott/begin_profiling
MikeLippincott Apr 23, 2024
df4ee20
add updated pipelines and env
jenna-tomkinson Apr 26, 2024
8749b76
Merge pull request #3 from jenna-tomkinson/update_cp_analysis
jenna-tomkinson Apr 29, 2024
536a4eb
update cp util script to include plugin dir
jenna-tomkinson May 3, 2024
0eb8d01
add check to make sure plugin dir exists
jenna-tomkinson May 3, 2024
bc34674
Merge pull request #4 from jenna-tomkinson/update_cp_util_script
jenna-tomkinson May 3, 2024
1bee545
rerun with preproccess
MikeLippincott May 6, 2024
ad640ea
Merge pull request #5 from MikeLippincott/rerun_analysis
MikeLippincott May 6, 2024
a4fb726
merge cytoplasmic channels
MikeLippincott May 15, 2024
c384e46
Update 3.cellprofiler_analysis/pipelines/analysis_4ch.cppipe
MikeLippincott May 16, 2024
93ca5f3
Update 3.cellprofiler_analysis/pipelines/analysis_4ch.cppipe
MikeLippincott May 16, 2024
5b8d8b0
Update 3.cellprofiler_analysis/pipelines/analysis_4ch.cppipe
MikeLippincott May 16, 2024
370383f
Merge pull request #6 from MikeLippincott/combine_channels
MikeLippincott May 16, 2024
52c9fb7
integrate data
MikeLippincott May 28, 2024
ea7cf50
Merge branch 'data_harmonization' into harmonize_branch
MikeLippincott May 28, 2024
b14cdc7
harmonize data
MikeLippincott Jun 28, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
44 changes: 44 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -23,3 +23,47 @@ data/

# shiny files:
2.scDINO_processing/temporal_shiny_app/rsconnect/
# raw data but keep the metadata
data/2023*

# logs
**.log

# IC logs
2.cellprofiler_ic_processing/notebooks/logs/
2.cellprofiler_ic_processing/scripts/logs/

# pycache
**/__pycache__/

# illumination corrected images
2.cellprofiler_ic_processing/illum_directory

# cellprofiler logs
3.cellprofiler_analysis/scripts/logs
3.cellprofiler_analysis/scripts/*.log
3.cellprofiler_analysis/notebooks/logs
3.cellprofiler_analysis/notebooks/*.logs

# analysis results
3.cellprofiler_analysis/analysis_output

# cytomining results
4.process_features/data/

5.scDINO_analysis/1.scDINO_run/outputdir/*
5.scDINO_analysis/1.scDINO_run/logs/*
5.scDINO_analysis/1.scDINO_run/.snakemake/*

5.scDINO_analysis/2.scDINO_processing/scripts/4.deploy_shiny_app.r
5.scDINO_analysis/2.scDINO_processing/notebooks/4.deploy_shiny_app.ipynb

6.data_harmonization/data/*


data/models/dino_deitsmall16_pretrain_full_checkpoint.pth
data/processed_images/crops/*


# cytomining logs
**runinfo
13 changes: 13 additions & 0 deletions 0.download_data/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
# Download Data

## About the data
The data are timelapse microscopy images of HeLa cells undergoing apoptosis.
The cells are treated with varying concentrations of staurosporine, a drug that induces apoptosis.
For more information about the data, see the [data README](../data/README.md).
We recieved this data through a collaboration with [Saguaro Technologies](https://www.saguarobio.com/), and [Yokogawa](https://www.yokogawa.com/us/solutions/products-and-services/life-science/).

## Downloading the data
This file is a place holder for the instructions downloading the data.
Typically data will stores in a remote location and will be downloaded using a script.
In the past we tend to use sites like `Zenodo`, `Figshare`, or `IDR` to store the data.
More information will be added in the future.
111 changes: 111 additions & 0 deletions 1.pre_process_data/notebooks/0.fix_pathing.ipynb
Original file line number Diff line number Diff line change
@@ -0,0 +1,111 @@
{
"cells": [
{
"cell_type": "markdown",
"metadata": {},
"source": [
"This notebook fixes issues with data paths by replacing spaces with underscores"
]
},
{
"cell_type": "code",
"execution_count": 1,
"metadata": {},
"outputs": [],
"source": [
"import pathlib"
]
},
{
"cell_type": "code",
"execution_count": 2,
"metadata": {},
"outputs": [],
"source": [
"# set data path\n",
"data_path = pathlib.Path(\"../../data\").resolve(strict=True)"
]
},
{
"cell_type": "code",
"execution_count": 3,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"[PosixPath('/home/lippincm/Documents/4TB/data/live_cell_timelapse_apoptosis/data/20231017ChromaLive_6hr_4ch_MaxIP'),\n",
" PosixPath('/home/lippincm/Documents/4TB/data/live_cell_timelapse_apoptosis/data/20230920ChromaLiveTL_24hr4ch_MaxIP'),\n",
" PosixPath('/home/lippincm/Documents/4TB/data/live_cell_timelapse_apoptosis/data/20231017ChromaLive_endpoint w AnnexinV_2ch_MaxIP')]"
]
},
"execution_count": 3,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"# get all directories in data path\n",
"directories = [x for x in data_path.iterdir() if x.is_dir()]\n",
"directories"
]
},
{
"cell_type": "code",
"execution_count": 4,
"metadata": {},
"outputs": [],
"source": [
"# rename the directories to replace spaces with underscores\n",
"for directory in directories:\n",
" new_name = directory.name.replace(\" \", \"_\")\n",
" directory.rename(directory.parent / new_name)"
]
},
{
"cell_type": "code",
"execution_count": 5,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"[PosixPath('/home/lippincm/Documents/4TB/data/live_cell_timelapse_apoptosis/data/20231017ChromaLive_6hr_4ch_MaxIP'),\n",
" PosixPath('/home/lippincm/Documents/4TB/data/live_cell_timelapse_apoptosis/data/20231017ChromaLive_endpoint_w_AnnexinV_2ch_MaxIP'),\n",
" PosixPath('/home/lippincm/Documents/4TB/data/live_cell_timelapse_apoptosis/data/20230920ChromaLiveTL_24hr4ch_MaxIP')]"
]
},
"execution_count": 5,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"# get all directories in data path\n",
"directories = [x for x in data_path.iterdir() if x.is_dir()]\n",
"directories"
]
}
],
"metadata": {
"kernelspec": {
"display_name": "timelapse_env",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.11.8"
}
},
"nbformat": 4,
"nbformat_minor": 2
}
Loading