Happy Holidays and New year from the EMDB! Please be advised that the EMDB team will be operating at reduced capacity between December 15th - January 12th. With this in mind we would ask that you be prepared for longer reply times through both our helpdesk system and the OneDep deposition system. well done on a productive 2025, wishing you all the best and we will see you in 2026!
Segmentation data model prototype
Segmentation is the decomposition of 3D volumes into regions that can be associated with defined objects. Following several consultations with the EM community (Patwardhan et al., 2012; Patwardhan et al., 2014; Patwardhan et al., 2017), the EMDB developed a prototype to explore supporting the deposition of volume segmentations with structured biological annotation which is here defined as the association of data with identifiers (e.g., accession codes from UniProt) and ontologies taken from well established bioinformatics resources. To our knowledge, none of the segmentation formats widely used in electron microscopy and related fields currently support structured biological annotation. Third party use of segmentations is further impeded by the prevalence of segmentation file formats and their lack of interoperability. EMDB therefore proposed an open segmentation file format called EMDB-SFF to capture basic segmentation data from application-specific segmentation file formats and provide the means for structured biological annotation. In this way, file formats like EMDB-SFF could not only enable depositions of segmentations but also act as a file interchange format between different applications and facilitate analysis of 3D reconstructions. Furthermore EMDB-SFF prototypes the description of multiple transforms for a segment, thus allowing a segment to be used to describe the placement of a sub-tomogram average onto a tomographic reconstruction.
Model
EMDB-SFF files have the follow features:
- Segmentation metadata:
- name
- version (of schema)
- details (free-form text)
- global external references, e.g. specimen scientific identifier
- bounding box
- primary descriptor contained i.e. one of ‘three_d_volume’, ‘mesh_list’, or ‘shape_primitive_list’ (see schema documentation)
- list of software used to create the segmentation (name, version, processing details)
- list of transforms referenced by segments e.g. transform to place the sub-tomogram average in the tomogram
- Hierarchical ordering of segments through the use of segment IDs and parent IDs;
- Four geometrical representations of segments (volumes, contours, meshes, shapes);
- Can store subtomogram averages and how they map into the parent tomogram through the use of transforms;
- List of associated external references per segment;
- List of associated complexes and macromolecules in a related EMDB entry
Each segment in a segmentation can consist of two types of descriptors:
- textual descriptors;
- geometric descriptors.
Textual descriptors consist of either free-form text or standardised terms. Standard terms should be provided from a [published] ontology or list of identifiers.
Geometric descriptors can take one or more of the following representations:
- ‘three_d_volume’ for 3D volumes;
- ‘mesh_list’ for lists of meshes each of which consists of a set of vertices and polygons;
- lists of shape primitives (ellipsoid, cuboid, cone, cylinder).
Documentation
Download
The current schema (version 0.8.0.dev1) is available here.
Documentation
Complete documentation of the schema is available here.
Auxiliary Tools
sfftk-rw
sfftk-rw is a Python toolkit for reading and writing EMDB-SFF files only. It is part of a family of tools designed to work with EMDB-SFF files.
sfftk-rw has the following utilities:
- convert - interconvert between XML, HDF5 and JSON file formats of the EMDB-SFF data model;
- view - view a file summary
The full documentation is available at readthedocs.
Download
The latest version runs only on Python 3 (version 0.7.1) and may be installed using pip install sfftk-rw. Alternatively, feel free to obtain the source code from Github.
sfftk
sfftk provides a shell command and a Python API to process EMDB-SFF files.
The following utilities are available using sfftk:
- convert - Conversion of application-specific segmentation file formats to EMDB-SFF. Currently, sfftk supports the following formats:
- AmiraMesh (.am)
- Amira HyperSurface (.surf)
- Segger (.seg)
- EMDB Map masks (.map)
- Stereolithography (.stl)
- IMOD (.mod)
- notes - Annotation of EMDB-SFF files.
- view - Brief summaries of segmentation files.
Read the full documentation here.
Download
The latest development version (version 0.5.5.dev1) of sfftk may be downloaded/installed from PyPI or the source may be obtained from GitHub.
Quick links
Recent Entries
(Show all)a5b3 GABAAR bound to Etomidate, GABA, and Mb25 in a desensitized state in saposin nanodiscs
a5b3 GABAA Receptor bound to GABA and Mb25 in desensitized state in detergent micelles
a5b3 GABAA Receptor in 1 a5 to 4 b3 stoichiometry in desensitized state
a5b3 GABAAR bound to GABA and Mb25 in a desensitized state in saposin nanodiscs after long GABA treatment
Hexamer Msp1 from S.cerevisiae (with a catalytic dead mutation) in complex with an unknown peptide substrate
Core filament of the spirochete periplasmic flagella of Leptospira biflexa
core filament of the spirochete periplasmic flagella of Leptospira biflexa wild type
core filament of the spirochete periplasmic flagella of Leptospira biflexa deleted fcpB strain
Octamer Msp1 from S.cerevisiae(with a catalytic dead mutation) in complex with an unknown peptide substrate
core filament of the spirochete periplasmic flagella of Leptospira biflexa from the flaA2-complemented stain
Straight and symmetrical filament of the spirochete periplasmic flagella of Leptospira biflexa
core filament of the spirochete periplasmic flagella of Leptospira biflexa from the deleted fcpB_CL13 strain
Sheathed filament of the spirochete periplasmic flagella of Leptospira biflexa from the deleted fcpB_CL13 strain
Straight and symmetrical filament of the spirochete periplasmic flagella of Leptospira biflexa deleted fcpB strain
Heptamer Msp1 from S.cerevisiae(with a catalytic dead mutation) in complex with an unknown peptide substrate
sheathed filament of the spirochete periplasmic flagella of Leptospira biflexa wild type
sheathed filament of the spirochete periplasmic flagella of Leptospira biflexa from the flaA2-complemented stain
Negative stain EM map of EBV glycoprotein gH/gL in complex with glycoprotein gp42 and FAB ATX-42-1.1 partially open conformation
Negative stain EM map of EBV glycoprotein gH/gL in complex with glycoprotein gp42 and FAB ATX-42-2 closed conformation
Negative stain EM map of EBV glycoprotein gH/gL in complex with glycoprotein gp42 and HLA-DR1 Beta chain
Negative Stain EM map of EBV glycoprotein gp350 in complex with ATX-350-2 FAB
Negative Stain EM map of EBV glycoprotein gp350 in complex with 72A1 FAB
Negative Stain EM map of EBV glycoprotein gp350 in complex with ATX-350-1 FAB
Negative Stain EM map of EBV glycoprotein gp350 in complex with ATX-350-1 FAB and 72A1 FAB
Negative stain EM map of EBV glycoprotein gH/gL in complex with glycoprotein gp42 and FAB ATX-42-2 Open conformation
Globular domain of monkeypox virus OPG153 (A28) in complex with antibodies 08E11 and 12I12
Globular domain of monkeypox virus OPG153 (A28) bound to antibody 02M12
Negative stain EM map of EBV glycoprotein gH/gL in complex with glycoprotein gp42 and FAB ATX-42-1.1 open conformation
Cryo-EM structure of MERS-CoV nsp10-nsp14 (E191A) in complex with T20P14-G RNA
Cryo-EM structure of MERS-CoV nsp10-nsp14 (E191A) in complex with T20P14-A RNA
Cryo-EM structure of MERS-CoV nsp10-nsp14 (E191A) in complex with T20P14-U RNA
Cryo-EM structure of MERS-CoV nsp10-nsp14 (E191A) in complex with T20P15 RNA, dimeric form
Cryo-EM structure of MERS-CoV nsp10-nsp14 (E191A) in complex with T20P15 RNA, monomeric form
In situ HHT and CHX treated human eEF1A-A/T-P-Z state 80S ribosome
Structure of E. Coli DNA protection during starvation protein (DPS) from single particle cryoEM
In situ human unrotated hibernating without CCDC124 state 80S ribosome
In situ human unrotated hibernating with CCDC124 state 80S ribosome
In situ human Hibernating class1 (rotate3) without E tRNA state 80S ribosome
In situ human Hibernating rotate3 with Z site tRNA state 80S ribosome
In situ human Hibernating rotate 3 with E-site tRNA state 80S ribosome
C. thermocellum UvrA-UvrB in complex with DNA with a fluorescein modification and AMPPNP: consensus map
C. thermocellum UvrA-UvrB in complex with DNA with a fluorescein modification and AMPPNP: focused map
Structure of the Chaetomium thermophilum Pmt4 homodimer (C2 symmetry)
Structure of the Chaetomium thermophilum Pmt4 homodimer (C1 symmetry)
Structure of human 1918 influenza A polymerase heterotrimer in complex with 1918 NEP.
Yeast 80S with nascent chain in complex with Ssb1-ADP in the S1 state
Structure of human 1918 influenza A polymerase heterotrimer in complex with WSN NEP.
structure of bundle-shaped PBS with both long rod and (ApcA2B3ApcD) trimer
Structure of the energy converting methyltransferase (Mtr) of Methanosarcina mazei in complex with a novel protein binder
Structure of the energy converting methyltransferase (Mtr) of Methanosarcina mazei in complex with a novel protein binder
Structure of the energy converting methyltransferase (Mtr) of Methanosarcina mazei in complex with a novel protein binder
Structure of the energy converting methyltransferase (Mtr) of Methanosarcina mazei in complex with a novel protein binder
Subtomogram averaging of HTLV-1 Gag capsid from immature particles
Microtubules in presence of Ndc80 and Ska, non-Ndc80 microtubule-bound molecules
Focused-Refinement Cryo-EM Map of the RAZR-ZFD Domain within the In Vitro Reconstituted RAZR.Gp77 Complex
Cryo-EM Structure of Gp77 within the In Vitro Reconstituted RAZR:GP77 Complex
Cryo-EM Map of the Copurified Ring-Activated Zinc-Finger RNase (RAZR, H154A) in Complex with Phage Protein Gp77
Cryo-EM map of the in vitro reconstituted RAZR:GP77 complex with AlphaFold-predicted models fitted into the density.
Cryo-EM structure of the dCas12f-gRNA-dsDNA complex (full R-Loop)
Cryo-EM structure of the dCas12f-gRNA-DNA complex (partial R-Loop)
Pol II-DSIF-SPT6-PAF1c-TFIIS-IWS1-ELOF1-LEDGF-nucleosome CDC73 local map J
Cryo-EM structure of active human green cone opsin in complex with chimeric G protein (miniGist)
Pol II-DSIF-SPT6-PAF1c-TFIIS-IWS1-ELOF1-LEDGF-nucleosome RPB5 local map I
Pol II-DSIF-SPT6-PAF1c-TFIIS-IWS1-ELOF1-LEDGF-nucleosome non-template DNA local map H
Pol II-DSIF-SPT6-PAF1c-TFIIS-IWS1-ELOF1-LEDGF-nucleosome CTR9+WDR61 local map K
Pol II-DSIF-SPT6-PAF1c-TFIIS-IWS1-ELOF1-LEDGF-nucleosome CTR9 TPR local map L
Pol II-DSIF-SPT6-PAF1c-TFIIS-IWS1-ELOF1-LEDGF-nucleosome PAF1+LEO1 local map M
Pol II-DSIF-SPT6-PAF1c-TFIIS-IWS1-ELOF1-LEDGF-nucleosome LEO1 C-terminus map N
Pol II-DSIF-SPT6-PAF1c-TFIIS-IWS1-ELOF1-LEDGF-nucleosome SPT6 local map O
Subtomogram average of influenza hemagglutinin (A/Puerto Rico/8/1934)
Mycobacterium smegmatis 70S ribosome with small molecule drug MK-7762
Putative ATP-bound class from combined 10, 20, 30 mM ATP datasets
Constricted-pore class from combined 0, 10, 20, 30 mM ATP datasets
Cryo-EM structure of SARS-CoV-2 KP.2 spike RBD in complex with ACE2
Cardiac lambda-6 light chain amyloid AL-224L single protofilament
Yeast 80S with nascent chain in complex with Ssb1-ADP in the S2 state
Human pannexin 1 channel from combined 0, 10, 20, 30 mM ATP datasets
Cryo-EM reconstruction of PI3KC3-C2 in complex with Rubicon Middle Region of C terminus, focused map 2/2
Cryo-EM reconstruction of PI3KC3-C2 in complex with Rubicon Middle Region of C terminus, composite map
Cryo-EM reconstruction of PI3KC3-C2 in complex with Rubicon Middle Region of C terminus, focused map 1/2
Cryo-EM structure of human PI3KC3-C2 in complex with Rubicon Middle Region of C terminus
Co-transcriptional histone H3K36 methylation complex containing RNA polymerase II elongation complex, Set2, and the upstream nucleosome. (temp115, type A)
Co-transcriptional histone H3K36 methylation complex containing RNA polymerase II elongation complex, Set2, and the upstream nucleosome. (temp115, type B)
Co-transcriptional histone H3K36 methylation complex containing RNA polymerase II elongation complex, Set2, and the upstream nucleosome. (temp130, type B)
Co-transcriptional histone H3K36 methylation complex containing RNA polymerase II elongation complex, Set2, and the upstream nucleosome. (temp130, type A)
The complex structure of 0086-0043 and NET determined with Cryo-EM.
The complex structure of Y510-9709 and NET determined with Cryo-EM
Co-transcriptional histone H3K36 methylation complex containing RNA polymerase II elongation complex, Set2, and the upstream nucleosome. (temp115, FACT-hexamer)
consensus map of Neddylated CUL5-ARIH2-L3A2-1 bound to ASB9-EloB/C-CKB
Structure of the Complement classical and lectin pathway C3 convertase
Consensus Map of Neddylated CUL5 C-terminal region-RBX2-ARIH2~L3A2-1~Ub
Structure of the Complement classical and lectin pathway proconvertase, C4b2
Structure of the Complement classical and lectin pathway C3 convertase in complex with substrate C3
Focus refined map of Neddylated CUL5 C-terminal region-RBX2-ARIH2~L3A2-1~Ub
Cerebellar GluA2/4 NTD heterophilic tetramer interface (focused refinement)
GluA4 N-terminal domain bound to nanobody NB74 (focused refinement)
Structure of Neddylated CUL5 C-terminal region-RBX2-ARIH2~L3A2-1~Ub
Structure of the Complement classical and lectin pathway proconvertase, C4b2
Focus refined map of Neddylated CUL5-ARIH2-L3A2-1 bound to ASB9-EloB/C-CKB, focus refined on ARIH2-L3A2-1
Cryo-EM map of focus refined ASB9-Elob/C-CKB bound to Nedd8-CUL5-RBX2-ARIH2-L3A2-1
P116 from Mycoplasma pneumoniae in complex with mild growth suppressor monoclonal antibody
cryoEM structure of the B-chain of the human OGA-L Catalytic Dimer
cryoEM structure of the A-chain of the human OGA-L Catalytic Dimer
cryoEM structure of the human OGA-L Catalytic Dimer, extra A-chain density
Low-resolution electron density map of C. elegans PEZO-1 Isoform L
Structure-Guided Design of Picomolar-level Macrocyclic TRPC5 Channel Inhibitors with Antidepressant Activity
Structure of the wild-type ABCC2 in Arabidopsis thaliana in the apo state
Cryo-EM structure of amyloid peptide-silk block protein fibril, Type 2
Structure of ABCC2(E1404Q) in Arabidopsis thaliana in the apo state
Structure of ABCC2(E1404Q) in Arabidopsis thaliana in the DNP-GS bound state
Structure of ABCC2(E1404Q) monomer in Arabidopsis thaliana in the DNP-GS bound state
Structure of ABCC2(E1404Q) in Arabidopsis thaliana in the ATP bound state
Structure of the wild-type ABCC2 dimer in Arabidopsis thaliana in the apo state
Cryo-EM structure of renal amyloid fibril from an immunoglobulin light chain amyloidosis patient in polymorph B
Cryo-EM structure of renal amyloid fibril from an immunoglobulin light chain amyloidosis patient in polymorph A
Consensus map of KATP channel in complex with centipede toxin SpTx1
C. thermocellum UvrA in complex with unmodified DNA and AMPPNP (ATP-bound conformation 1)
C. thermocellum UvrA in complex with unmodified DNA and AMPPNP (ATP-bound conformation 2)
C. thermocellum UvrA in complex with unmodified DNA and AMPPNP (basal conformation)
C. thermocellum UvrA in complex with DNA with a fluorescein modification (basal conformation)
C. thermocellum UvrA in complex with DNA with a fluorescein modification and AMPPNP (ATP-bound conformation 1)
C. thermocellum UvrA in complex with DNA with an abasic site (basal conformation)
C. thermocellum UvrA in complex with DNA with an abasic site and AMPPNP (ATP-bound conformation 2)
C. thermocellum UvrA in complex with DNA with an abasic site and AMPPNP (ATP-bound conformation 1)
C. thermocellum UvrA in complex with DNA with a fluorescein modification and ADP (basal conformation)
C. thermocellum UvrA (K647A) in complex with DNA with a fluorescein modification (basal conformation)
C. thermocellum UvrA (K39A) in complex with DNA with a fluorescein modification and AMPPNP (basal conformation)
C. thermocellum UvrA-UvrB in complex with DNA with a fluorescein modification and AMPPNP (composite map)
