Segmentation data model prototype
Segmentation is the decomposition of 3D volumes into regions that can be associated with defined objects. Following several consultations with the EM community (Patwardhan et al., 2012; Patwardhan et al., 2014; Patwardhan et al., 2017), the EMDB developed a prototype to explore supporting the deposition of volume segmentations with structured biological annotation which is here defined as the association of data with identifiers (e.g., accession codes from UniProt) and ontologies taken from well established bioinformatics resources. To our knowledge, none of the segmentation formats widely used in electron microscopy and related fields currently support structured biological annotation. Third party use of segmentations is further impeded by the prevalence of segmentation file formats and their lack of interoperability. EMDB therefore proposed an open segmentation file format called EMDB-SFF to capture basic segmentation data from application-specific segmentation file formats and provide the means for structured biological annotation. In this way, file formats like EMDB-SFF could not only enable depositions of segmentations but also act as a file interchange format between different applications and facilitate analysis of 3D reconstructions. Furthermore EMDB-SFF prototypes the description of multiple transforms for a segment, thus allowing a segment to be used to describe the placement of a sub-tomogram average onto a tomographic reconstruction.
Model
EMDB-SFF files have the follow features:
- Segmentation metadata:
- name
- version (of schema)
- details (free-form text)
- global external references, e.g. specimen scientific identifier
- bounding box
- primary descriptor contained i.e. one of ‘three_d_volume’, ‘mesh_list’, or ‘shape_primitive_list’ (see schema documentation)
- list of software used to create the segmentation (name, version, processing details)
- list of transforms referenced by segments e.g. transform to place the sub-tomogram average in the tomogram
- Hierarchical ordering of segments through the use of segment IDs and parent IDs;
- Four geometrical representations of segments (volumes, contours, meshes, shapes);
- Can store subtomogram averages and how they map into the parent tomogram through the use of transforms;
- List of associated external references per segment;
- List of associated complexes and macromolecules in a related EMDB entry
Each segment in a segmentation can consist of two types of descriptors:
- textual descriptors;
- geometric descriptors.
Textual descriptors consist of either free-form text or standardised terms. Standard terms should be provided from a [published] ontology or list of identifiers.
Geometric descriptors can take one or more of the following representations:
- ‘three_d_volume’ for 3D volumes;
- ‘mesh_list’ for lists of meshes each of which consists of a set of vertices and polygons;
- lists of shape primitives (ellipsoid, cuboid, cone, cylinder).
Documentation
Download
The current schema (version 0.8.0.dev1) is available here.
Documentation
Complete documentation of the schema is available here.
Auxiliary Tools
sfftk-rw
sfftk-rw is a Python toolkit for reading and writing EMDB-SFF files only. It is part of a family of tools designed to work with EMDB-SFF files.
sfftk-rw has the following utilities:
- convert - interconvert between XML, HDF5 and JSON file formats of the EMDB-SFF data model;
- view - view a file summary
The full documentation is available at readthedocs.
Download
The latest version runs only on Python 3 (version 0.7.1) and may be installed using pip install sfftk-rw. Alternatively, feel free to obtain the source code from Github.
sfftk
sfftk provides a shell command and a Python API to process EMDB-SFF files.
The following utilities are available using sfftk:
- convert - Conversion of application-specific segmentation file formats to EMDB-SFF. Currently, sfftk supports the following formats:
- AmiraMesh (.am)
- Amira HyperSurface (.surf)
- Segger (.seg)
- EMDB Map masks (.map)
- Stereolithography (.stl)
- IMOD (.mod)
- notes - Annotation of EMDB-SFF files.
- view - Brief summaries of segmentation files.
Read the full documentation here.
Download
The latest development version (version 0.5.5.dev1) of sfftk may be downloaded/installed from PyPI or the source may be obtained from GitHub.
Quick links
Recent Entries
(Show all)In situ cryo-electron tomogram of 4days rpn9 surface mutant nucleus
In situ cryo-electron tomogram of 4days glucose control WT nucleus
100 kV cryo-EM structure of apoferritin at 1.91 A with DECTRIS SINGLA detector on CRYO ARM 200 II
Competition for different elements of the nucleosome acidic patch yields distinct functional outcomes. VHH 1G1
In situ cryo-electron tomogram of 4days mlp1delta mlp2delta nucleus
Competition for different elements of the nucleosome acidic patch yields distinct functional outcomes. VHH 1B2
Inward-occluded structure of human GABA transporter 3 bound to substrate GABA
2'-fluoro-modified pyrimidine (FY) RNA aptamer binding to the receptor binding domain (RBD) of the SARS-CoV-2 spike protein. (focus map: PXT origami 'pointer')
2'-fluoro-modified pyrimidine (FY) RNA aptamer binding to the receptor binding domain (RBD) of the SARS-CoV-2 spike protein. (focus map: Spike core)
3-helix origami tile + Broccoli and Pepper aptamers (3HT-BP) with 2'-Fluoro-modified pyrimidines (FY RNA)
2'-fluoro-modified pyrimidine (FY) RNA aptamer binding to the receptor binding domain (RBD) of the SARS-CoV-2 spike protein. (focus map: Spike N-terminal domain (NTD))
2'-fluoro-modified pyrimidine (FY) RNA aptamer binding to the receptor binding domain (RBD) of the SARS-CoV-2 spike protein (Full map, no symmetry)
Cryo-EM structure of mouse heavy-chain apoferritin at 1.24 A on CRYO ARM 200 II
CryoEM structure of delta opioid receptor bound to G proteins and met-enkephalin
CryoEM structure of delta opioid receptor bound to G proteins and Naltrindole
CryoEM structure of delta opioid receptor bound to G proteins and naltrexone
CryoEM structure of delta opioid receptor bound to G proteins and ADL5859
CryoEM structure of delta opioid receptor bound to G proteins and SNC80
CryoEM structure of stabilized dengue 3 virus envelope glycoprotein in complex with Fab of F25.S01
In situ structure of the cardiac thin filament without troponin from MYH7(WT/G256E) human induced pluripotent stem cell-derived cardiomyocytes (AICS-0097-141 ACTN2-mEGFP MYH7(WT/G256E))
In situ structure of the cardiac thin filament without troponin from isogenic control human induced pluripotent stem cell-derived cardiomyocytes (AICS-0097-113 ACTN2-mEGFP MYH7(WT/WT))
In situ structure of the cardiac thin filament without troponin from untreated human induced pluripotent stem cell-derived cardiomyocytes (SCVI-273)
In situ structure of the cardiac thin filament with troponin from MYH7(WT/G256E) human induced pluripotent stem cell-derived cardiomyocytes (AICS-0097-141 ACTN2-mEGFP MYH7(WT/G256E))
In situ structure of the cardiac thin filament without troponin from doxorubicin-treated human induced pluripotent stem cell-derived cardiomyocytes (SCVI-273)
In situ structure of the cardiac thin filament with troponin from untreated human induced pluripotent stem cell-derived cardiomyocytes (SCVI-273)
In situ structure of the cardiac thin filament with troponin from isogenic control human induced pluripotent stem cell-derived cardiomyocytes (AICS-0097-113 ACTN2-mEGFP MYH7(WT/WT))
ln situ structure of the cardiac thin filament with troponin from doxorubicin-treated human induced pluripotent stem cell-derived cardiomyocytes (SCVI-273)
Capsid Subtomogram Average From NL4.3:PR(D25N) Immature HIV-1 Virions
Globally refined map of odorant-bound mouse class II odorant receptor G protein complex
Cryo-EM structure of the large serine recombinase Bxb1 in complex with attP and attB (GT/TT CDN) in the pre-strand exchange state
Cryo-EM structure of the large serine recombinase Bxb1 in complex with attP and attB (GT/TT CDN) in the pre-strand exchange state (attB-R)
Cryo-EM structure of the large serine recombinase Bxb1 in complex with attP and attB (GT/TT CDN) in the pre-strand exchange state (attP-R)
Cryo-EM structure of the large serine recombinase Bxb1 in complex with attP and attB (GT/TT CDN) in the post-strand exchange state
2.62A cryo-EM structure of RNA-directed RNA polymerase L of Crimean-Congo hemorrhagic fever virus (Apo state)
Cryo-EM structure of the large serine recombinase Bxb1 in complex with attP and attB (CA/CA CDN) in the intermediate-strand exchange state 1
The structure of odorant-bound mouse class II odorant receptor-miniGs complex
Cryo-EM structure of the large serine recombinase Bxb1 in complex with attP and attB (GT/TT CDN) in the pre-strand exchange state (attB-L)
Receptor-focused map of odorant-bound mouse class II odorant receptor G protein complex
Cryo-EM structure of the large serine recombinase Bxb1 in complex with attP and attB (CA/CA CDN) in the intermediate-strand exchange state 2
Cryo-EM structure of the large serine recombinase Bxb1 in complex with attP and attB (GT/TT CDN) in the pre-strand exchange state (attP-L)
Cryo-EM structure of the large serine recombinase Bxb1 in complex with attP and attB (CA/CA CDN) in the pre-strand exchange state
Cryo-EM structure of the large serine recombinase Bxb1 in complex with attP and attB (CA/CA CDN) in the post-strand exchange state
2.53A cryo-EM structure of RNA-directed RNA polymerase L of Crimean-Congo hemorrhagic fever virus (RNA bound)
Yeast-expressed polio type 1 stablized virus-like particles with 3G10 Fab
GFP bound to distal DARPin (AHIR dodecamer scaffold system from split dataset with 1500 micrographs)
Ecoli DnaB helicase and Phage Lambda loader P with ADP-Mg in a 6:5 stoichiometry ratio
Transferrin Binding Protein A in complex with transferrin binding protein B and two molecules of transferrin
Transferrin Binding Protein A in complex with transferrin binding protein B and transferrin (iron bound in both lobes of Tf)
Cryo-EM structure of human ATP citrate lyase in complex with inhibitor EVT0185-CoA
Neisseria gonorrhoeae Transferrin Binding Protein A in complex with Transferrin Binding Protein B and transferrin (iron bound in N lobe only)
Structure of the MOR/Gi/Lofentanil Complex, GTP-bound G-ACT-2/3, Global and G Protein Local
Structure of the MOR/Gi/Lofentanil Complex, GTP-bound G-ACT-2', G Protein Local
Structure of the MOR/Gi/Lofentanil Complex, GTP-bound G-ACT-2, G Protein Local
Transferrin Binding Protein A in complex with transferrin binding protein B, transferrin and globular domain of TonB
Structure of the MOR/Gi/Lofentanil Complex, GTP-bound G-ACT-2/3, Global 3DVA Sorted 1
Structure of the MOR/Gi/Lofentanil Complex, GTP-bound G-ACT-2/3, Global 3DVA Sorted 2
Pathogen effector forms a phosphatase holoenzyme complex with host core enzyme to promote disease
Local refinement map of the cytoplasmic lattice (CPL) from mouse oocyte at 3.90 angstrom
Local refinement map of the cytoplasmic lattice (CPL) from mouse oocyte at 3.94 angstrom
Local refinement map of the cytoplasmic lattice (CPL) from mouse oocyte at 3.81 angstrom
Pathogen effector forms a phosphatase holoenzyme complex with host core enzyme to promote disease
Pathogen effector forms a phosphatase holoenzyme complex with host core enzyme to promote disease
Subtomogram average of GEM-mCherry-nanobody particles on A549 cell membranes
Local refinement map of the cytoplasmic lattice (CPL) from mouse oocyte at 3.87 angstrom
Cryo-EM structure of transducer in complex with chimeric receptor
A Cryo-EM structure of LA-PTH-PTH1R-V2RT-Beta-arrestin1 complex (state 1 conformation)
Pathogen effector forms a phosphatase holoenzyme complex with host core enzyme to promote disease
A Cryo-EM structure of LA-PTH-PTH1R-V2R-Beta-arrestin1 complex (state 2 conformation)
Local refinement map of the cytoplasmic lattice (CPL) from mouse oocyte at 4.15 angstrom
A Cryo-EM structure of LA-PTH-PTH1R-B-arrestin1 complex (state 2 conformation)
A Cryo_EM structure of LA_PTH_PTH1R_V2R_Beta_arrestin1 complex(state 2 conformation)
A focused Cryo_EM structure of LA_PTH_PTH1R_V2R_Beta_arrestin1 complex(state 1 conformation)
A focused refinement Cryo_EM structure of A Cryo_EM structure of LA_PTH_PTH1R and V2R_Beta_arrestin1(state 2 conformation)
A Cryo_EM structure of LA_PTH_PTH1R_V2R_Beta_arrestin1 complex(state 1 conformation)
human Argonaute-2 R315V/H316A - miR-122 in complex with a fully complementary target
Structure of human serotonin transporter bound to small molecule zPZd in lipid nanodisc and NaCl
Locally-refined Mu-Opioid Receptor bound with novel compound 0505
Barbed End of Cofilin-2 F-actin, Terminal Actins Occupied with Cofilin
Rhesus rotavirus (consensus structure at 4.7 Angstrom resolution from cryo-ET)
Cryo-EM structure of human dopamine transporter in complex with centanafadine
Structure of Csm6 from Actinomyces procaprae in complex with cyclic penta-adenylate
Cryo-EM structure of human dopamine transporter in complex with tesofensine
Cryo-EM structure of human dopamine transporter in complex with nefazodone
Cryo-EM structure of human dopamine transporter in complex with dasotraline
Cryo-EM structure of human dopamine transporter in complex with ansofasine
Tetrahymena Ribozyme L-16 complex with small molecule inhibitor ZPT-084
Cryo-EM structure of D1R-Gs in complex with de novo designed GEM targetingTM1/2/4 and GEM targeting TM3/4/5, and agonist-positive allosteric GEM targeting TM5/6/7
Cryo-EM structure of D1R-Gs in complex with de novo designed agonist-positive allosteric GEM targeting TM5/6/7
Octamer Msp1 from S.cerevisiae (with a catalytic dead mutation) in complex with an unknown peptide substrate
Cryo-EM structure of D1R in complex with de novo designed negative allosteric GEM targeting TM5/6/7
Structural insights into photosystem I complex of Bryopsis corticulans
Cryo-EM structure of D1R in complex with de novo designed GEM targeting TM1/2/4 and GEM targeting TM3/4/5, and negative allosteric GEM targeting TM5/6/7
Cryo-EM structure of the SARS-CoV-2 spike protein in complex with S416
Cryo-EM structure of D1R-Gs in complex with de novo designed GEM targeting TM1/2/4
Cryo-EM structure of D1R-Gs in complex with de novo designed GEM targeting TM3/4/5
Nonamer Msp1 from S.cerevisiae (with a catalytic dead mutation) in complex with an unknown peptide substrate
Structure of Csm6 from Actinomyces procaprae in complex with cyclic hexa-adenylate
Decamer Msp1 from S.cerevisiae(with a catalytic dead mutation) in complex with an unknown peptide substrate
GFP bound to distal DARPin (AHIR dodecamer scaffold system from split dataset with 5500 micrographs)
GFP bound to distal DARPin (AHIR dodecamer scaffold system from split dataset with 2500 micrographs)
GFP bound to distal DARPin (AHIR dodecamer scaffold system from split dataset with 7500 micrographs)
GFP bound to distal DARPin (AHIR dodecamer scaffold system from split dataset with 11594 micrographs)
GFP bound to distal DARPin (AHIR dodecamer scaffold system from split dataset with 4500 micrographs)
GFP bound to distal DARPin (AHIR dodecamer scaffold system from split dataset with 11500 micrographs)
structure of two human ELF2 transcription factors in complex with a nucleosome
Cryo-EM focus map of prefusion SARS-CoV-2 spike (RBDs: 1 up & 2 down) bound to RBD-targeting MO176-117 antibody
CryoEM map of Intermediate 2 of SARS-CoV-2 spike protein (from revitrified dataset)
Cryo-EM consensus map of prefusion SARS-CoV-2 spike (RBDs: 1 up & 2 down) bound to RBD-targeting MO176-117 antibody
CryoEM map of Intermediate 3 of SARS-CoV-2 spike protein (from revitrified dataset)
GFP bound to distal DARPin (AHIR dodecamer scaffold system from split dataset with 10500 micrographs)
CryoEM map from 60 microseconds revitified sample of SARS-CoV-2 spike protein
CryoEM map of Intermediate 5 of SARS-CoV-2 spike protein (from revitrified dataset)
CryoEM map of Intermediate 4 of SARS-CoV-2 spike protein (from revitrified dataset)
Cryo-EM focus map of prefusion SARS-CoV-2 spike (RBDs: 2 up & 1 down) bound to RBD-targeting MO176-117 antibody
Cryo-EM Structure of Self-assembled Zymomonas mobilis Levansucrase Nanotube
Nitrogenase maturase NifEN in complex with the cofactor chaperone NifX
CryoEM map from 30 microseconds revitified sample of SARS-CoV-2 spike protein
Cryo-EM consensus map of prefusion SARS-CoV-2 spike (RBDs: 2 up & 1 down) bound to RBD-targeting MO176-117 antibody
GFP bound to distal DARPin (AHIR dodecamer scaffold system from split dataset with 6500 micrographs)
GFP bound to distal DARPin (AHIR dodecamer scaffold system from split dataset with 500 micrographs)
GFP bound to distal DARPin (AHIR dodecamer scaffold system from split dataset with 8500 micrographs)
GFP bound to distal DARPin (AHIR dodecamer scaffold system from split dataset with 3500 micrographs)
GFP bound to distal DARPin (AHIR dodecamer scaffold system from split dataset with 9500 micrographs)
Subtomogram average of nucleosomes extracted from vitreous sections of Drosophila melanogaster embryos
Roseiflexus castenholzii cells with contractile injection systems.
Roseiflexus castenholzii cells with contractile injection systems.
Roseiflexus castenholzii cells with contractile injection systems.
Roseiflexus castenholzii cells with contractile injection systems.
Escherichia coli ribosome arrested on a chimeric, 130-residue construct featuring a firefly-luciferase truncation followed by SecM(3W)
Roseiflexus castenholzii cells with contractile injection systems.
Roseiflexus castenholzii cells with contractile injection systems.
Escherichia coli ribosome arrested on a chimeric, 190-residue construct featuring a firefly-luciferase truncation followed by SecM(3W)
Escherichia coli ribosome arrested on a chimeric, 110-residue construct featuring a firefly-luciferase truncation followed by SecM(3W)
Roseiflexus castenholzii cells with contractile injection systems.
Roseiflexus castenholzii cells with contractile injection systems.
Scaffold attached to quinine-I aptamer (Tonic) local refinement of aptamer
RNA scaffold attached to 8-oxoguanine riboswitch aptamer, combined core plus aptamer
Scaffold attached to quinine-I aptamer (Tonic) local refinement of core
Scaffold attached to quinine-I aptamer (Tonic) refinement of aptamer and core
Reconstruction of the intranuclear varicella-zoster virus capsid.
Reconstruction of the intracellular varicella-zoster virus capsid with portal.
Reconstruction of intracellular varicella zoster virus CAI-capsid with portal.
Reconstruction of the varicella-zoster virus C-capsid with the portal vertex.
Reconstruction of the portal vertex from intracellular varicella-zoster virus CAI-capsids.
2'-fluoro-modified pyrimidine (FY) RNA aptamer binding to the receptor binding domain (RBD) of the SARS-CoV-2 spike protein. (focus map: RBD-aptamer)
Scaffold attached to Mango without ligand, local refinement of core, tilted data collection of tetramer
RNA scaffold attached to 8-oxoguanine riboswitch aptamer Glacios data
Scaffold attached to Mango without ligand, local refinement of aptamer, tilted data collection of tetramer
Cryo-EM structure of the CHSY3-CHPF1 chondroitin synthase heterodimer
Structure of the MOR/Gi/DAMGO Complex, GTP-Bound, G-Primed, AHD-Sorted
RNA scaffold attached to 8-oxoguanine riboswitch core Glacios data
RNA scaffold attached to 8-oxoguanine riboswitch aptamer core only
Artemia ferritin cell-free expression with reverse his purification
Human 80S ribosome bound to IDB-002 stalled on FPAK-containing nascent chain
Structure of the MOR/Gi/Mitragynine Pseudoindoxil Complex, GTP-bound G-Primed, Consensus Refinement
Structure of the MOR/Gi/Mitragynine Pseudoindoxil Complex, GTP-bound G-ACT-2/3 Consensus Refinement
Cryo-EM Map of the FtsH.HflK/C Complex Solubilized in DDM from Tobramycin-Treated Cells
Cryo-EM map of the FtsH.HflK/C membrane assembly extracted in carboxy-DIBMA from tobramycin-treated cells
Cryo-EM structure of the engineered HflK/C variant stabilized in the closed conformation via disulfide bond crosslinking.
