On top of our multi-omics approach to assess ocean genomes and genes, we used a series of innovative, automated imaging tools to identify the concentration, taxonomic composition, and morphological characteristics of plankton and non-living suspended particles across organismal size-fractions. These included Underwater Video Profiler (UVP), Zooscan, FlowCAM, Imaging FlowCytobot (IFCB), Flow Cytometry, and a brand-new high content screening 3D-microscopy workflow (e-HCFM, Colin et al., eLIFE, in review), together encompassing a comprehensive organismal size range, from pico-plankton to large gelatinous zooplankton and marine snow, across different taxonomic and trophic groups (Romagnan et al., Sci. Data, in prep), and generating 11 different datasets (Table 2). More classical imaging techniques were also used to generate smaller datasets at higher resolution, including confocal (3D-CSLM) and electron (SEM and TEM) microscopy (Table 2). The full strategy was used from surface to 1000 m depth, with a current production of >6 million images of single plankton from >9,200 size-fractionated plankton communities (>30 Terabytes; Deliverables F14). In the absence of an international framework to share and annotate environmental images (the equivalent of GenBank for DNA sequences), we developed a web- based application, EcoTaxa (http://ecotaxa.sb-roscoff.fr/), which allows for the first time online archiving, exploration and collaborative annotation of plankton images by experts worldwide. EcoTaxa also provides tools for computer-assisted image recognition (including deep learning algorithms) to accelerate time consuming taxonomic assignation by experts. The taxonomy implemented in EcoTaxa corresponds to the universal eukaryotic framework developed online with the world community of expert taxonomists in the UniEuk effort (http://unieuk.org; Berney et al., J. Euk. Microb., 2017), allowing future cross-comparison between imaging and DNA sequencing data.

Up to now, Tara Oceans imaging data has been used to detect and validate organism interactions in combination with ‘omics’ analyses (eHCFM in Lima-Mendez et al., Science, 2015; Mordret et al., ISME, 2016), associate deep carbon sequestration with surface plankton networks (UVP and Zooscan in Roullier et al., Biogeosciences, 2014; Guidi et al., Nature, 2016), and unveil the overlooked but highly significant biomass of Rhizarian protists which exceeds that of zooplankton in (sub)tropical oceans (UVP in Biard et al., Nature, 2016). The TO imaging dataset represents a veritable treasure trove for future analyses in ecology (e.g. whole plankton abundance/size slopes correlated with environmental parameters), comparative morphogenetics (e.g. correlations between shapes, genomes, and taxonomy), and evolution (e.g. analyses of cell/organisms complexification through combined 3D microscopy and single-cell ‘omics’, see our 10Y- vision).

Table 2. Key features, size, and completion of imaging datasets generated from >9 200 plankton samples collected by the Tara Oceans (TO) and Tara Oceans Polar Circle (TOPC) expeditions.

Imaging dataset (Instrument & plankton samples)Plankton size rangeVoyage# of plankton samples analysedSample imaging & processing# objectsClassification (predicted P, curated C)EcoTaxa Release
UVP100 > 1 mmTO + TOPC776100%769 497P + C (100%)available
ZooScan 680 Regent100 > 0.68 mmTO189100%126 389P + C (36%)available
ZooScan WP2 20010 > 0.2 mmTO + TOPC203100%394 956P + C (95%)available
ZooScan Multinet10 > 0.2 mmTO285100%397 723P + C (92%)available
ZooScan Bongo 30010 > 0.3 mmTO92100%154 624P + C (30%)available
ZooScan 680 Regent10 > 0.68 mmTOPC23100%14 433P + C (16%)available
ZooScan Bongo 30010 > 300 µmTOPC19100%42 365P + C (8%)available
FlowCAM 180 Bongo180 > 20 µmTOPC317100%704 053P + C (15%)available
IFCB160 > 5 µmTOPC6982100%2 307 437P + C (30%)available
e-HCFM H5 5-2020 > 5 µmTO76100%336 655P + C (5,5%)available
e-HCFM H20 20-180180 > 20 µmTO + TOPC128100%>1062017
e-HCFM H0.2 >0.20.2 µmTOPC14100%2017
HiRes 3D-CLSM2000 > 1 µmTO6550%3 551C (100%)2017
SEM200 > 0.1 µmTO31STEFIexp427C (100%)2017
TEM virus< 0.1 µmTO + TOPC43100%4300C (100%)2017
TOTAL92436 million