Analysis of large proteomics datasets: pitfalls, challenges and solutions

Analysis of large proteomics datasets: pitfalls, challenges and solutions

Analysis of large proteomics datasets: pitfalls, challenges and solutions (#21)

Jurgen Cox ¹

Max Planck Institute of Biochemistry, ., Germany

Since it is becoming feasible to collect shotgun proteomics data on the scale of the whole human genome it is crucial that computational workflows for the identification and quantification of peptides and proteins are equipped for dealing with mass spectral datasets of enormous sizes. The challenges are two-fold: 1) the requirements for the computational efficiency are daunting including demands for high degree of parallelization and efficient I/O. 2) Prescriptions for restricting the false discovery rate for peptides and proteins need to be utilized in order to ensure validity of the results. We show how these and other crucial problems are solved in the MaxQuant software and present examples of its application to several large-scale proteomics datasets.

Authors contributing to this presentation.

Cox, J

Dr. Cox earned his Master’s degree in physics from RWTH Aachen University in Germany and received his PhD from the Massachusetts Institute of Technology in theoretical particle physics. He then worked at the Basel-based bioinformatics company GeneData and, after a postdoc at the Technical University of Munich, went on to work at the Max Planck Institute of Biochemistry in Munich on problems in computational proteomics. There he heads since 2014 the research lab for computational systems biochemistry. Dr. Cox has contributed greatly to the toolset of computational proteomics by developing the software platforms MaxQuant and Perseus which are in frequent use in the proteomics community. Dr. Cox has co-authored 126 peer-reviewed journal articles.

Analysis of large proteomics datasets: pitfalls, challenges and solutions (#21)

Add notes

Cox, J

Login