permalink |
---|
/ |
This is a short guide about how to use the PaccMann UI and what is happening behind the curtain.
PaccMann
acronyms Prediction of AntiCancer Compound sensitivity with Multimodal Attention-based Neural Networks.
The purpose of the project is to develop an user-friendly and interpretable interface to perform in-silico IC50 drug screenings for cancer cell lines. The screening is performed from the combination of a chemical compound and the transcriptomics profile of the cell line of interest.
The purpose of the PaccMann UI is to provide a PaccMann pipeline from input to result as an easy to use service for the community.
Access the service by clicking Start
on the main page.
NOTE: this landing page is used the initialize the anonymous session that allows to retrieve the predictions you run in PaccMann for one week. It is also used to report links to the Terms of Service and the Privacy settings.
The home page will present to the user a form to input data and submit predictions.
The accordion below the navbar contains general information on the web service usage.
PaccMann supports two type of inputs: molecules and gene expression data.
Chemicals are represented as SMILES (Simplified Molecular Input Line Entry System
) sequences, i.e. an inline text notation for molecules. This 1D abstraction is obtained by a molecular graph traverse resulting in a list of atoms, bonds and meta-symbols.
For example the TKI Masitinib (commercial name Masivet) can be represented as:
CC1=C(C=C(C=C1)NC(=O)C2=CC=C(C=C2)CN3CCN(CC3)C)NC4=NC(=CS4)C5=CN=CC=C5
Services such as PubChem or ZINC can help to retrieve the SMILES sequence of known chemicals.
The user can have more information on the SMILES handling we perform by clicking the corresponding info button.
The user can fill the SMILES field with a chemical or select molecules from a dropdown menu (molecules are sorted alphabetically).
If the user is unfamiliar with SMILES, or prefers to draw the molecules directly, the molecule editor offers a user-friendly way to explore and test molecules interactively.
The molecule editor can be opened by clicking Molecule Editor
and offers various functionalities.
The user can even edit drugs after selection, just pick a molecule in the dropdown.
The accordion below the navbar briefly describes how to use the molecule editor.
The left panel contains the tools for editing the molecule, e.g. the erase tool
(to erase specific atoms or bonds), the bond tool
(to add a bond from a set of typical bonds), the ring structure tool
(to add a ring from a set of typical rings) and the atom and formula tool
which allows to insert new atoms.
Upon finishing the drawing, the user has to confirm the molecule by clicking the grey check button at the bottom of the left panel.
The tools at the top panel allow the user to import own data from a variety of formats including .smi
, .mol
, .cml
, .kcj
, .rxn
, .json
and others. Upon finishing the molecule design the user can alternatively simply download and export the molecule into one of the aforementioned formats.
By clicking the button Done
the user can go back to the submission page.
The user can provide custom data for the prediction. An example can be downloaded from here. In case the user does not provide data the prediction will be performed on all the cell lines from the GDSC (v2) and CCLE studies. Keep in mind this might take up to 5 minutes. No worries, the service run predictions in an asynchronous fashion and use cookies to leave the user session open, so that there is no need to leave the browser tab open.
Information on the genes supported by the model, preprocessing routines and an exmple file, can be accessed via the info tooltip.
The omic data should be provided as
.csv
with genes in the columns and one row
per cell line. The order of the columns does not need to adhere to the order we
provide in our example and additional genes are discarded. Missing genes are
mean-imputed (with 0s).
The set of 2128 genes used for the predictions can be also downloaded here.
Additionally PaccMann provides the possibility to associate a confidence value to its estimates.
Once the parameters are set the form is ready for submission.
By pressing the Predict
button the user will submit a PaccMann job and, upon completion, a Results
button will appear allowing the user to analyse and visualize the data.
The predictions are rendered in an interactive and downloadable containing IC50 estimate, the confidence is requested and metadata if provided in the uploaded CSV.
The table can be sorted in ascending or descending order by every column. The columns can also be filtered by typing a string of interest, e.g. in order to search for the result of a specific sample or show the predictions for all samples of a tissue type.
The width and the vertical ordering of the columns may also be changed according to the user's preferences.
This interactive (filtered) table can be downloaded by clicking the blue Download CSV
button at the bottom of the results section.
The bottom part of the page shows the attention results. On the left a heatmap for the molecule provided with a colorbar shows parts of the molecule attended when performing the predictions. On the right attended genes are shown.
By default the average over the considered samples is reported.
The gene visualization allows to use a slider to show any given number of top attended genes.
Different samples can be selected via a dropdown rendering different attention profiles for specific samples.
Drug attention plots can be downloaded in vector graphic format in a zip file.
Gene attention plots can be downloaded using the tools embedded in the visualization.
The preferred browser for PaccMann website is Firefox. Here is a complete table for the browsers and OS with verified functionality.
OS | Chrome | Edge | Safari | Firefox |
---|---|---|---|---|
Linux | ☑ | N.A. | N.A. | ☑ |
Windows | ☑ | N.A. | ☑ | |
MacOS | ☑ | N.A. | ☑ | ☑ |