Skip to content
/ PLF Public

Peptide location fingerprinting (PLF) is a technique capable of identifying modified proteins and potential causal mechanisms in complex biological samples. Protein Locational Fingerprinter determines statisticaly significant peptide yields from protein functional domains.

License

Notifications You must be signed in to change notification settings

maxozo/PLF

Repository files navigation

Peptide location fingerprinting (PLF) is a technique capable of identifying modified proteins and potential causal mechanisms in complex biological samples. In standard proteomics, proteins are trypsinised which generate peptides whose sequence identities and relative abundances are measured by LC-MS/MS. During this process most proteins are only partially digested, due to differing solubilities, stabilities and enzyme susceptibilities related to their higher order structures. By mapping and quantifying LC-MS/MS-detected peptides within specific regions, PLF enables the detection of statistical differences in the regional digestibility along the protein structure due to ageing and disease mechanisms.

Quick start

Instalation

python version 3.9 is required

Create a new virtual python enviroment

     python -m venv mplf_venv

Activate the enviroment (Linux and Mac):

       source mplf_venv/bin/activate

Activate the enviroment (Windows)

       .\mplf_venv\Scripts\activate.bat

Clone the PLF repo:

      git clone https://github.com/maxozo/PLF.git

Install requirements:

       cd PLF
       pip install -r requirements.txt          

Test data

To just try to run the analysis on test data please run (thats also available for download on our MPLF website ):

     python PLF.py --test --outname My_Test_Run

Note: My_Test_Run can be a path/to/My_Test_Run/Filename

Your own data

  1. Prepeare a file that lists Protein name (optional if source protein not determined), Peptide sequence (remove any special characters from these), Sample of protein belionging and spectra (can be multiple columns as per: spectra_1,spectra_2, etc. -- these will be added up): as per this file.
Protein Sample Peptide spectra spectra_2 spectra_3 spectra_4
FBLN1_HUMAN 20180601_SherrattM_MO_15.raw (Full_Skin_15) CLAFECPENYR 0 1 0 0
FBLN1_HUMAN 20180601_SherrattM_MO_15.raw (Full_Skin_15) CVDVDECAPPAEPCGK 0 1 0 0
  1. Prepeare a tsv file that lists the experimental design - as per this file. If paired make sure that the rows list matching pairs, otherwise any order is ok.
forearm buttock
sample1 sample2
sample3 sample4
sample6 sample7
  1. Run the MPLF pipeline:

      python ../../PLF.py --experimental_design Experiment_feed.tsv --peptides Sample_Data_For_Analysis.csv --spiecies HUMAN --domain_types DOMAINS,REGIONS,TOPO_DOM,TRANSMEM,REPEAT,50AA,75AA,100AA --paired True --outname MPLF_RUN --p_threshold 0.05
    

Params:

      Required:

      --experimental_design  This allows to provide the experimental defign file file

      --peptides   This allows to provide your peptides file

      --spiecies   The spiecies of the peptides

      --paired    Is the samples specified in experimental_design paired or unpaired

      --outname  The name of the output files

      Optional:

      --cpus     (default=max available) How many cpus to use for analysis.

      --p_threshold    (dafault=0.05) Only return proteins that has at least one domain with a significance threshold lover or equal to specified
  1. Results will produce two files {outname specified}.tsv and {outname specified}.mplf file. TSV file will list all the domains, their p values, quantified data, normalised data etc. MPLF file can be uploaded to Manchester Proteome Location Fingerprinter (MPLF) to perform visualisations of the data.

Methods

For details please read our publication

References (please cite)

About

Peptide location fingerprinting (PLF) is a technique capable of identifying modified proteins and potential causal mechanisms in complex biological samples. Protein Locational Fingerprinter determines statisticaly significant peptide yields from protein functional domains.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages