Cluster_MST

This Panel app performs a clustering of small molecules based on their structures and visualizes the results as points and connections in a Minimum Spanning Tree (MST), where the points can be colored according to an activity column present in the input file, e.g. from HTS results.

In addition to Panel, the main dependencies of this app are:

RDKit - open source cheminformatics toolkit
Pandas - Python data analysis library
Networkx - implementation of the MST algorithm
graphviz

Usage

The main input for the app is a tab-separated file (<*.tsv>) with at least three columns.

<Compound identifier>: A column with unique identifiers of the compounds in the file
<Activity>: A column with the activitiy for the coloring of the points. If higher values do NOT mean higher activity, check the "Reverse" checkbox.
"Smiles": A column with the name "Smiles" that contains the structures encoded as SMILES.

The app takes the top N active compounds ("Top N active", default: 50) and adds the most similar compounds for each of these ("Number of similar compounds", default: 10), downto a minimum similarity cutoff ("Similarity cutoff", default: 0.6) using the chosen fingerprint method, then generates the MST.

Generally, only linear-scaled values should be used for Activity, e.g. percentages. IC50 values should be converted to pIC50. The points in the plot can be selected using the lasso tool, and the selected compounds are shown in the table below the plot.

Screenshot of the app with an MST generated from a random selection of ~1500 Aurora Kinase A inhibitors, downloaded from ZINC

When you hover over a point in the plot, the structure is displayed as tooltip, together with the Identifier and the Activitiy value:

Selecting points with the lasso tool shows the selected compounds in a downloadable table:

Installation & Running the App

Installation

Download this repository
Change into the dowloaded directory
Create and activate a Python virtual environment, e.g. with conda
Pip-install the dependencies and the package
Download and install the dependency jupy_tools into the same Python virtual env

git clone https://github.com/apahl/cluster_mst
cd cluster_mst

conda create -n clmst python=3.11
conda activate clmst

pip install .

# Install jupy_tools into the same virt. env as described in that repo.

Or install the dependencies also with conda and then pip-install just the package:

git clone https://github.com/apahl/cluster_mst
cd cluster_mst

conda create -n clmst python=3.11 panel holoviews
conda activate clmst

pip install .

# Install jupy_tools into the same virt. env as described in that repo.
# Use `conda install` instead of `conda create`.

Running

To start the Panel app, activate the Python virtual env, if not already done and navigate to the app folder:

# Only perform the following step when the virtual env is not already activated:
conda activate clmst

cd <install-dir>/cluster_mst/apps
panel serve app_cluster_mst.py

Navigate to http://localhost:5006/app_cluster_mst in your browser.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
cluster_mst/app		cluster_mst/app
res		res
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cluster_MST

Usage

Installation & Running the App

Installation

Running

About

Releases

Packages

Languages

License

apahl/cluster_mst

Folders and files

Latest commit

History

Repository files navigation

Cluster_MST

Usage

Installation & Running the App

Installation

Running

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages