Log on / register
BioMed Central home | Journals A-Z | Feedback | Support | My details
Open AccessResearch

An informatic pipeline for the data capture and submission of quantitative proteomic data using iTRAQTM

Jennifer A Siepen1* email, Neil Swainston2* email, Andrew R Jones1,3 email, Sarah R Hart4 email, Henning Hermjakob5 email, Philip Jones5 email and Simon J Hubbard1 email

Faculty of Life Sciences, University of Manchester, M13 9PT, UK

Manchester Interdisciplinary Biocentre, University of Manchester, UK

School of Computer Science, Faculty of Engineering and Physical Sciences, University of Manchester, UK

MBCMS, School of Chemistry, Manchester Interdisciplinary Biocentre, University of Manchester, UK

EMBL Outstation EBI, Wellcome Trust Genome Campus, Hinxton, Cambs, UK

author email corresponding author email* Contributed equally

Proteome Science 2007, 5:4doi:10.1186/1477-5956-5-4

Published: 1 February 2007

Abstract

Background

Proteomics continues to play a critical role in post-genomic science as continued advances in mass spectrometry and analytical chemistry support the separation and identification of increasing numbers of peptides and proteins from their characteristic mass spectra. In order to facilitate the sharing of this data, various standard formats have been, and continue to be, developed. Still not fully mature however, these are not yet able to cope with the increasing number of quantitative proteomic technologies that are being developed.

Results

We propose an extension to the PRIDE and mzData XML schema to accommodate the concept of multiple samples per experiment, and in addition, capture the intensities of the iTRAQTM reporter ions in the entry. A simple Java-client has been developed to capture and convert the raw data from common spectral file formats, which also uses a third-party open source tool for the generation of iTRAQTM reported intensities from Mascot output, into a valid PRIDE XML entry.

Conclusion

We describe an extension to the PRIDE and mzData schemas to enable the capture of quantitative data. Currently this is limited to iTRAQTM data but is readily extensible for other quantitative proteomic technologies. Furthermore, a software tool has been developed which enables conversion from various mass spectrum file formats and corresponding Mascot peptide identifications to PRIDE formatted XML. The tool represents a simple approach to preparing quantitative and qualitative data for submission to repositories such as PRIDE, which is necessary to facilitate data deposition and sharing in public domain database. The software is freely available from http://www.mcisb.org/software/PrideWizard webcite.


© 1999-2010 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.