Transcriptomic and proteomic data in developing tomato fruit

Isma Belouah; Camille Bénard; Alisandra Denton; Mélisande Blein-Nicolas; Thierry Balliau; Emeline Teyssier; Philippe Gallusci; Olivier Bouchez; Björn Usadel; Michel Zivy; Yves Gibon; Sophie Colombié

doi:10.1016/j.dib.2019.105015

Transcriptomic and proteomic data in developing tomato fruit

Data Brief. 2019 Dec 17:28:105015. doi: 10.1016/j.dib.2019.105015. eCollection 2020 Feb.

Affiliations

¹ UMR 1332 BFP, INRA, Univ Bordeaux, F33883, Villenave d'Ornon, France.
² Institute for Botany and Molecular Genetics, BioEconomy Science Center, Worringer Weg 3, RWTH Aachen University, Aachen, 52074, Germany.
³ PAPPSO, GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190 Gif-sur-Yvette, France.
⁴ UMR EGFV, Université de Bordeaux, Institut national de la recherche agronomique, Institut des Sciences de la Vigne et du Vin, 210 Chemin de Leysotte, CS 50008, 33882 Villenave-d'Ornon, France.
⁵ INRA, US 1426, GeT-PlaGe, Genotoul, Castanet-Tolosan, France.

Abstract

Transcriptomic and proteomic analyses were performed on three replicates of tomato fruit pericarp samples collected at nine developmental stages, each replicate resulting from the pooling of at least 15 fruits. For transcriptome analysis, Illumina-sequenced libraries were mapped on the tomato genome with the aim to obtain absolute quantification of mRNA abundance. To achieve this, spikes were added at the beginning of the RNA extraction procedure. From 34,725 possible transcripts identified in the tomato, 22,877 were quantified in at least one of the nine developmental stages. For the proteome analysis, label-free liquid chromatography coupled to tandem mass spectrometry (LC-MS/MS) was used. Peptide ions, and subsequently the proteins from which they were derived, were quantified by integrating the signal intensities obtained from extracted ion currents (XIC) with the MassChroQ software. Absolute concentrations of individual proteins were estimated for 2375 proteins by using a mixed effects model from log₁₀-transformed intensities and normalized to the total protein content. Transcriptomics data are available via GEO repository with accession number GSE128739. The raw MS output files and identification data were deposited on-line using the PROTICdb database (http://moulon.inra.fr/protic/tomato_fruit_development) and MS proteomics data have also been deposited to the ProteomeXchange with the dataset identifier PXD012877. The main added value of these quantitative datasets is their use in a mathematical model to estimate protein turnover in developing tomato fruit.

Keywords: Absolute quantification; Pericarp; Protein turnover; Proteomics; Time-series; Tomato fruit development; Transcriptomics.