🧠Deep Learning-based✔️ Ovarian Cancer🙅🏻♀️ Subtypes Identification using Multi-Omics Data✅
- Garima Singh (1806143)
- Mrinal (1806149)
24 August 2020
- Long-Yi Guo
- Ai-Hua Wu,
- Yong-xia Wang,
- Li-ping Zhang,
- Hua Chai
- Xue-Fang Liang
- Second School of Clinical Medicine, Guangzhou University of Chinese Medicine, Guangzhou, 510020, China
- Center for Reproductive Medicine, Guangdong Hospital of Traditional Chinese Medicine, Guangzhou, 510120, China
PMID: 32863885
PMCID: PMC7447574
DOI: 10.1186/s13040-020-00222-x
The model used by us is a logistic regression classification model which uses K-means clustering techniques. We have chosen to align with the paper specifications and techniques as closely as possible but changed certain values to improve accuracy score. The multi-omics data of patients are inputted into the Denoising Autoencoder for generating z. With the help of generated z, the patients are clustered using k-means. The optimal number of clusters was determined using silhouette score. In the model, we tested the k from [2, 8] and we finally used k=2 as it had the highest silhouette score. After obtaining the labels clustered by k-means, we built a light-weighted mRNA model for reducing the number of genes needed to identify cancer subtypes by using a logistic regression algorithm.
This is the python code for denoising autoencoders (DAE)🚀 and the k-means using the reconstructed features. The silhouette scores and Davies Bouldin scores (DBI) were used to evaluate the clustering performances.
- https://jovian.ai/garimasingh128/deep-learning-based-ovarian-cancer-subtypes-identification-using-multi-omics-data
- https://colab.research.google.com/drive/1_ruztGS5i1h9ugO28h_foaK_0Sczsq_H?usp=sharing
- https://colab.research.google.com/drive/1dj2WfGTwTJYjATTJXxSUNyGZuRD2FW5O?usp=sharing
- We designed a novel deep learning-based framework for ovarian cancer subtype identification, and a logistic regression method was used to build the light-weighted classification model.
- Compared to identifying subtypes using single omics data, the multi-omics data analysis can utilize more information. Hence, we proposed a model which in turn would help to robustly identify ovarian cancer subtypes.
- Ovarian cancer ranks 5th in cancer death among women. It has a high mortality rate. Also the risk of getting ovarian cancer is quite high. So, identifying molecular subtypes of ovarian cancer is important.
🚀 It is important to know more about the ovarian cancer heterogeneity between different patients for choosing different treatment programs and predicting clinical outcomes. In this study we proposed a novel deep learning framework for integrating multi-omics data with denoising autoencoders for identifying the ovarian cancer subtypes. Two subtypes from the molecular level were identified in ovarian cancer, and the results show our proposed method is competitive and reliable. The method comparison results indicated our method out-performed than the traditional and deep learning-based methods. More importantly, the classification model was proved by three independent test datasets collected from GEO. All the p-values less than 0.05 show that the differences between the classified cancer subgroups are significant.
By combining the results in DEG and WGCNA analysis, we selected 34 target genes related to ovarian cancer. And using these 34 identified genes, 19 KEGG pathways were enriched including PI3K-Akt signaling pathway and human papillomavirus infection pathway. The literature review shows 19 (56%) biomarkers and 8(42.1%) KEGG pathways identified based on the classification subtypes have been proved to be associated with ovarian cancer.
- https://biodatamining.biomedcentral.com/articles/10.1186/s13040-020-00222-x
- https://linkinghub.elsevier.com/retrieve/pii/S0090825810002623
- https://clincancerres.aacrjournals.org/content/14/16/5198
All the data analyzed during the current study are available in the TCGA and GEO datasets.
The project is created using Python in Jupyter lab.
- Clone the repo
git clone github.com/your_usernameOvarian-Cancer-Subtypes-Identification.git
- Open the folder in your favorite code editor and start adding modifications.
-
Put all the code you want to in necessary folders.
-
Push all the code to your own branch. Once you are sure it is working, merge it with the
dev
branch. Let's maintain only the stable and released versions on themaster
branch. -
Write a kick-ass, readable, and clean code.
Read these articles to get a quick grab on Git and Github:
- https://try.github.io/
- https://docs.github.com/en/github/getting-started-with-github/git-and-github-learning-resources
- https://guides.github.com/activities/hello-world/
Feel free to create issues to suggest and add functionalities and features.
- Google Chrome
- Git
- Code Editor (Jupyter preferrably)
- Python
Please read CONTRIBUTING.md for information on how to contribute to Ovarian-Cancer-Subtypes-Identification.
We want to facilitate a healthy and constructive community behavior by adopting and enforcing our code of conduct.
Please adhere towards our code-of-conduct.md.