Segmentation data model prototype
Segmentation is the decomposition of 3D volumes into regions that can be associated with defined objects. Following several consultations with the EM community (Patwardhan et al., 2012; Patwardhan et al., 2014; Patwardhan et al., 2017), the EMDB developed a prototype to explore supporting the deposition of volume segmentations with structured biological annotation which is here defined as the association of data with identifiers (e.g., accession codes from UniProt) and ontologies taken from well established bioinformatics resources. To our knowledge, none of the segmentation formats widely used in electron microscopy and related fields currently support structured biological annotation. Third party use of segmentations is further impeded by the prevalence of segmentation file formats and their lack of interoperability. EMDB therefore proposed an open segmentation file format called EMDB-SFF to capture basic segmentation data from application-specific segmentation file formats and provide the means for structured biological annotation. In this way, file formats like EMDB-SFF could not only enable depositions of segmentations but also act as a file interchange format between different applications and facilitate analysis of 3D reconstructions. Furthermore EMDB-SFF prototypes the description of multiple transforms for a segment, thus allowing a segment to be used to describe the placement of a sub-tomogram average onto a tomographic reconstruction.
Model
EMDB-SFF files have the follow features:
- Segmentation metadata:
- name
- version (of schema)
- details (free-form text)
- global external references, e.g. specimen scientific identifier
- bounding box
- primary descriptor contained i.e. one of ‘three_d_volume’, ‘mesh_list’, or ‘shape_primitive_list’ (see schema documentation)
- list of software used to create the segmentation (name, version, processing details)
- list of transforms referenced by segments e.g. transform to place the sub-tomogram average in the tomogram
- Hierarchical ordering of segments through the use of segment IDs and parent IDs;
- Four geometrical representations of segments (volumes, contours, meshes, shapes);
- Can store subtomogram averages and how they map into the parent tomogram through the use of transforms;
- List of associated external references per segment;
- List of associated complexes and macromolecules in a related EMDB entry
Each segment in a segmentation can consist of two types of descriptors:
- textual descriptors;
- geometric descriptors.
Textual descriptors consist of either free-form text or standardised terms. Standard terms should be provided from a [published] ontology or list of identifiers.
Geometric descriptors can take one or more of the following representations:
- ‘three_d_volume’ for 3D volumes;
- ‘mesh_list’ for lists of meshes each of which consists of a set of vertices and polygons;
- lists of shape primitives (ellipsoid, cuboid, cone, cylinder).
Documentation
Download
The current schema (version 0.8.0.dev1) is available here.
Documentation
Complete documentation of the schema is available here.
Auxiliary Tools
sfftk-rw
sfftk-rw is a Python toolkit for reading and writing EMDB-SFF files only. It is part of a family of tools designed to work with EMDB-SFF files.
sfftk-rw has the following utilities:
- convert - interconvert between XML, HDF5 and JSON file formats of the EMDB-SFF data model;
- view - view a file summary
The full documentation is available at readthedocs.
Download
The latest version runs only on Python 3 (version 0.7.1) and may be installed using pip install sfftk-rw. Alternatively, feel free to obtain the source code from Github.
sfftk
sfftk provides a shell command and a Python API to process EMDB-SFF files.
The following utilities are available using sfftk:
- convert - Conversion of application-specific segmentation file formats to EMDB-SFF. Currently, sfftk supports the following formats:
- AmiraMesh (.am)
- Amira HyperSurface (.surf)
- Segger (.seg)
- EMDB Map masks (.map)
- Stereolithography (.stl)
- IMOD (.mod)
- notes - Annotation of EMDB-SFF files.
- view - Brief summaries of segmentation files.
Read the full documentation here.
Download
The latest development version (version 0.5.5.dev1) of sfftk may be downloaded/installed from PyPI or the source may be obtained from GitHub.
Quick links
Recent Entries
(Show all)Capping protein bound to the barbed end of F-actin, actin local refinement
Capping protein bound to the barbed end of cofilactin, consensus refinement
Capping protein bound to the barbed end of cofilactin, co-bound with cofilin on the B-1 actin, consensus map
Capping protein bound to the barbed end of cofilactin, co-bound with cofilin on the B-1 actin, actin local refinement
Capping protein bound to the barbed end of cofilactin, co-bound with cofilin on the B-1 actin, capping protein local refinement
Capping protein bound to the barbed end of cofilactin, actin local refinement
Capping protein bound to the barbed end of cofilactin, barbed end local refinement
Capping protein bound to the barbed end of F-actin, Consensus refinement
Cryo-EM structure of a tmRNA-rescue complex imaged in a cell lysate
The SRA domain of human UHRF1 bound to a mononucleosome with hemimethylated DNA at superhelical location -6.2.
UHRF1 bound to a mononucleosome in its pre-active state, with the RING domain bound to the SRA domain.
Consensus map of human UHRF1 bound to mononucleosome with hemimethylated DNA at superhelical location -6.2.
The focused refinement map of the TTD-PHD domain of UHRF1 bound to a mononucleosome with hemimethylated DNA at superhelical location -6.2.
The RING domain of human UHRF1 bound to a mononucleosome in its pre-active state.
The activated state of human UHRF1 bound to a mononucleosome, with the finger loop ordered and linker 4 disordered.
Human UHRF1 bound to a mononucleosome with hemimethylated DNA at superhelical location -5.6.
Cryo-EM structure of 70S ribosome in the classical state (P/P, E/E) imaged in a cell lysate
Cryo-EM structure of 70S ribosome in the classical state (A/A, P/P, E,E) imaged in a cell lysate
Cryo-EM structure of 70S ribosome bound to RaiA imaged in a cell lysate
Human UHRF1 bound to a mononucleosome with hemimethylation at superhelical location -6.2 and a Histone H3K9me3 methylation mark.
Cryo-EM structure of 70S ribosome bound to EF-Tu imaged in a cell lysate
Human UHRF1 bound to a mononucleosome with a hemimethylation site in the linker DNA.
Cryo-EM structure of large ribosomal subunit (class B1) imaged in a cell lysate
Cryo-EM structure of large ribosomal subunit (class B2) imaged in a cell lysate
Cryo-EM structure of 30S ribosome in a closed conformation with tRNA bound imaged in a cell lysate
Cryo-EM structure of large ribosomal subunit (class A2) imaged in a cell lysate
Cryo-EM structure of 70S ribosome in the hybrid state (A/P*, P/E) imaged in a cell lysate
Cryo-EM structure of large ribosomal subunit (class C1) imaged in a cell lysate
Cryo-EM structure of large ribosomal subunit (class A1) imaged in a cell lysate
Cryo-EM structure of large ribosomal subunit (class 3) imaged in a cell lysate
Cryo-EM structure of 30S ribosome in a H44 inactive-dislodged state imaged in a cell lysate
Cryo-EM structure of large ribosomal subunit (class 5) imaged in a cell lysate
Cryo-EM structure of 30S ribosome in a H44 active state imaged in a cell lysate
Cryo-EM structure of 50S ribosome (class I) imaged in a cell lysate
Cryo-EM structure of 30S ribosome in a H44 inactive-unresolved state imaged in a cell lysate
Cryo-EM structure of large ribosomal subunit (class 2) imaged in a cell lysate
Cryo-EM structure of 30S ribosome with tRNA bound in an open conformation imaged in a cell lysate
Cryo-EM structure of large ribosomal subunit (class 4) imaged in a cell lysate
Cryo-EM structure of large ribosomal subunit (class 6) imaged in a cell lysate
Cryo-EM structure of the human P2X2 receptor in the apo closed state
Cryo-EM structure of the human P2X3 receptor in the camlipixant-bound inhibited state
Cryo-EM structure of the human P2X2/3 heteromer receptor bound to AF-219
Cryo-EM structure of the human P2X2/3 heteromer receptor - class 2 (2:1)
Cryo-EM structure of the human P2X2 receptor in the ATP-bound desensitised state
Cryo-EM structure of the human P2X2/3 heteromer receptor - class 1 (1:2)
The structure of Nav1.7 with veratridine standing near the IFM motif (site I)
Cryo-EM structure of the human P2X2 receptor in the ATP-bound open state
Prefusion stabilized spike glycoprotein of human coronavirus OC43.
Apoferritin from equine spleen, recovered from the synchrotron with 1 MGy X-ray dose
Apoferritin from equine spleen, recovered from the synchrotron with 0 MGy X-ray dose
Apoferritin from equine spleen, recovered from the synchrotron with 100 MGy X-ray dose
Structure of a membrane-bound inositol phosphorylceramide synthase and ceramide complex
Pointed end of Cofilin-2 Bound F-actin, Focused Refinement on Terminal Pointed End Actins
One CAP-1 Bound to the Pointed End of F-actin, Actin Local Refinement
Capping protein bound to the barbed end of F-actin, Barbed end local refinement
One CAP-1 Bound to the Pointed End of F-actin, HFD Local Refinement
One CAP-1 Bound to the Pointed End of Cofilin F-actin, PE local map
Two CAP-1 bound to the pointed end of cofilin F-actin, actin local refinement
Structure of a membrane-bound inositol phosphorylceramide synthase and Aureobasidin A complex
Two CAP-1 Bound to the Pointed End of F-actin, Actin Local Refinement
Pointed end of Cofilin-2 Bound F-actin, Focused Refinement on Actin Filament
One CAP-1 Bound to the Pointed End of Cofilin F-actin, Actin local refinement
Two CAP-1 bound to the pointed end of cofilin F-actin, pointed end local refinement
Two CAP-1 Bound to the Pointed End of F-actin, CAP1 HFD Local Refinement
Structural Insights into HIV-1 Vif-Mediated Ubiquitination and Degradation of APOBEC3H
13 protofilament P. falciparum paclitaxel stabilised GDP microtubule
Structural Insights into HIV-1 Vif-Mediated Ubiquitination and Degradation of APOBEC3H
Polyclonal immune complex of Fab from serum of animal 1 at week 35 binding H1 HA
Polyclonal immune complex of Fab from serum of animal 1 at week 23 binding H1 HA
Polyclonal immune complex of Fab from serum of animal 6 at week 33 binding H1 HA
Polyclonal immune complex of Fab from serum of animal 4 at week 35 binding H1 HA
Polyclonal immune complex of Fab from serum of animal 5 at week 35 binding H1 HA
Computationally optimized broadly reactive influenza B hemagglutinin BC2 bound by antibodies #46 and #3978
Computationally optimized broadly reactive influenza B hemagglutinin BC2 bound by antibody #3978
Polyclonal immune complex of Fab from serum of animal 4 at week 23 binding H1 HA
Cryo-EM structure of Serendipita indica sulfate transporter SiSulT in Apo state
Polyclonal immune complex of Fab from serum of animal 6 at week 23 binding H1 HA
cryo-EM structure of Serendipita indica sulfate transporter SiSulT in sulphate bound state
Polyclonal immune complex of Fab from serum of animal 6 at week 35 binding H1 HA
Computationally optimized broadly reactive influenza B hemagglutinin BC2 bound by antibodies #46 and #3978
Polyclonal immune complex of Fab from serum of animal 2 at week 35 binding H1 HA
Polyclonal immune complex of Fab from serum of animal 4 at week 33 binding H1 HA
Cryo-EM structure of Serendipita indica sulfate transporter SiSulT in oxlate bound state
Computationally optimized broadly reactive influenza B hemagglutinin BC2 bound by antibody #46
S305I Frontotemporal Lobar Degeneration (FTLD) type I tau filament
S305I Frontotemporal Lobar Degeneration (FTLD) type II tau filament
SARS-CoV-2 spike S2 subunit in complex with polyclonal Fabs (Apex-B epitope)
SARS-CoV-2 spike S2 subunit in complex with polyclonal Fabs (Apex-A epitope)
Tetrameric complexes of the Borna disease virus 1 nucleoprotein arranged in a parallel configuration
S-shaped octameric complex of the Borna disease virus 1 nucleoprotein
Heptameric complex of the Borna disease virus 1 nucleoprotein bound to RNA (mutant Arg341Ala)
Tetrameric complex of the Borna disease virus 1 nucleoprotein (mutant Arg341Ala)
Triangular complex of Borna disease virus 1 nucleoprotein with 12 subunits
Hexameric complex of the Borna disease virus 1 nucleoprotein bound to RNA (mutant Arg341Ala)
Tetrameric complexes of the Borna disease virus 1 nucleoprotein (mutant Arg341Ala) in an antiparallel configuration
S-shaped nonameric complex of Borna disease virus 1 nucleoprotein
Hexameric oval ring-like complex of Borna disease virus 1 nucleoprotein
Tetrameric complex (3D class 1) of the Borna disease virus 1 nucleoprotein
Tetrameric complex (3D class 2) of the Borna disease virus 1 nucleoprotein
Octameric ring-like complex of the Borna disease virus 1 nucleoprotein bound to RNA
Hexameric flat ring-like complex of the Borna disease virus 1 nucleoprotein
Asymmetric tetrameric complex of the Borna disease virus 1 nucleoprotein (mutant Arg341Ala)
Triangular complex of the Borna disease virus 1 nucleoprotein with 13 subunits
Heptameric ring-like complex of the Borna disease virus 1 nucleoprotein bound to RNA
Cryo-EM structure of SARS-CoV-2 PT Spike Protein complex with a potent broad-spectrum macrocyclic peptide inhibitor 6L3-3P11K
Hexameric oval ring-like complex of Borna disease virus 1 nucleoprotein with central extra density
Hexameric ring-like complex of the Borna disease virus 1 nucleoprotein bound to RNA
Hexameric flat ring-like complex of the Borna disease virus 1 nucleoprotein (mutant Lys164Ala)
Heptameric ring-like complex of the Borna disease virus 1 nucleoprotein (mutant Lys164Ala)
Tetrameric complexes of the Borna disease virus 1 nucleoprotein are arranged in an antiparallel configuration
Pentameric complex of Borna disease virus 1 nucleoprotein (mutant Lys164Ala)
Tetrameric complex of the Borna disease virus 1 nucleoprotein (mutant Lys164Ala)
Hexameric ring-like complex of the Borna disease virus 1 nucleoprotein (mutant Lys164Ala)
Heptameric flat ring-like complex of the Borna disease virus 1 nucleoprotein (mutant Lys164Ala)
Triangular hexameric complex of the Borna disease virus 1 nucleoprotein (mutant Lys164Ala)
Tetrameric complexes of the Borna disease virus 1 nucleoprotein (mutant Lys164Ala) in an antiparallel configuration
Structure of the Bacterial Ribosome with human tRNA Lys(mcm5s2U34) and mRNA(AAG)
S-shaped octameric complex of the Borna disease virus 1 nucleoprotein (mutant Lys164Ala)
Hexameric oval ring-like complex of the Borna disease virus 1 nucleoprotein (mutant Lys164Ala)
Structure of the Bacterial Ribosome with human tRNA Lys(mcm5h2U34) and mRNA(AAG)
Structure of the Bacterial Ribosome with human tRNA Lys(mcm5h2U34) and mRNA(AAA)
Structure of the Bacterial Ribosome with human tRNA Lys(mcm5s2U34) and mRNA(AAA)
A LINC complex lattice in the nuclear envelope of ejaculated human sperm
Plant chloroplast dicarboxylate transporter AtDiT2.1 bound with malate
Plant chloroplast dicarboxylate transporter AtDiT1 bound with 2-OG
Plant chloroplast dicarboxylate transporter AtDiT2.1 bound with Glu
Plant chloroplast dicarboxylate transporter AtDiT1 bound with OAA
Influenza A/H7N9 polymerase in complex with a 70-mer RNA template, in stalled elongation.
ADP-bound wild type Wzm-Wzt from Mycobacterium abscessus in nanodiscs
ATP-bound wild type Wzm-Wzt from Mycobacterium abscessus in nanodiscs
Nucleotide-free wild type Wzm-Wzt from Mycobacterium abscessus in LMNG
Influenza A/H7N9 polymerase in complex with a 70-mer template in stalled elongation with backtracking and stem.
ATP-bound E178Q Wzm-Wzt from Mycobacterium abscessus in nanodiscs
Nucleotide-free wild type Wzm-Wzt from Mycobacterium abscessus in nanodiscs
ArsB from L. ferriphilum bound to antimonite in inward-facing state (antiparallel dimer)
ArsB from L. ferriphilum in inward-facing state (antiparallel dimer)
ArsB from L. ferriphilum bound to arsenite in inward-facing state (parallel dimer)
Cryo-EM structure of the type I pilus from Escherichia Coli and the surrounding water network
Cryo-EM map of SARS-CoV-2 BA.2.75 Spike Protein complex with macrocyclic peptide 6L3 (All RBDs up)
Cryo-EM structure of the maize CER6-GL2 complex (inactive C222A mutant) in the presence of 28:0 CoA
Cryo-EM map of SARS-CoV-2 BA.2.75 Spike Protein complex with a macrocyclic peptide 6L3-3P11K (Two RBDs up)
Cryo-EM map of SARS-CoV-2 BA.2.75 Spike Protein (Three RBDs down)
Cryo-EM structure of SARS-CoV-2 BA.2.75 Spike Protein complex with a potent broad-spectrum macrocyclic peptide inhibitor 6L3-3P11K
The cryo-EM structure of in situ amplified (ISA) alpha-synuclein fibrils from DLB homogenate
Structure of the flagellar filament in short-length at 3.02 angstroms resolution
The cryo-EM structure of in situ amplified (ISA) alpha-synuclein fibrils from PD homogenate
Cryo-EM structure of DICER-D991G-H992G/26S-GU complex in dicing state
Cryo-EM structure of DICER with pre-mir-517a-GU in pre-dicing state
Hexamer Msp1 from S.cerevisiae (with a catalytic dead mutation) in complex with an unknown peptide substrate
Hexamer Msp1 from S.cerevisiae (with a catalytic dead mutation) in complex with an unknown peptide substrate
Hexamer Msp1 from S.cerevisiae (with a catalytic dead mutation) in complex with an unknown peptide substrate
Assembly intermediate of human mitochondrial ribosome small subunit bound to METTL15, RBFA, and mtIF2 (State M2.1)
Consensus map prior to focused analysis of the human LRP2 ectodomain
Assembly intermediate of human mitochondrial ribosome small subunit bound to METTL15 and RBFA (Inward conformation) (State M2)
Focused map on the arm 2 of the human LRP2 in complex with LRPAP1
Focused map on the lower body of the human LRP2 in complex with LRPAP1
Focused map of the upper body of the human LRP2 in complex with LRPAP1
Focused map on the Arm 1 of the human LRP2 in complex with LRPAP1
Further focused map on the Arm 1 of the human LRP2 in complex with LRPAP1
Focused map on the arm 2 and RAPd3 copy of the human LRP2 in complex with LRPAP1
CryoEM structure of the human LRP2 receptor ectodomain in complex with LRPAP1
Focused map on additional extra density at the level of the Arm 2 of the human LRP2 in complex with LRPAP1
CryoEM structure of apo Toxin B (TcdB) from Clostridioides difficile in the closed CROP state
CryoEM structure of Toxin B (TcdB) from clostridioides difficile complexed with methyl cholate
CryoEM structure of Toxin B (TcdB) from clostridioides difficile complexed with taurochenodeoxycholic acid (TCDCA)
Cryo-ET of FIB-milled synapse region between CD19+ NALM6 leukemia cells and T-cells expressing 8H8 CD19 CAR construct (sample3)
N1 neuraminidase of influenza A/Vietnam/1203/2004 H5N1 in complex with four FNI9 Fab molecules
CryoEM structure of apo Toxin B (TcdB) from Clostridioides difficile in the open CROP state
30S focus refined map of E.coli 70S ribosome complexed with P-site fMet-tRNAfMet and A-site R-beta(2)hydroxyBocK-tRNAPyl
50S focus refined map of E.coli 70S ribosome complexed with P-site fMet-tRNAfMet and A-site R-beta(2)hydroxyBocK-tRNAPyl
Transferrin Binding Protein A in complex with transferrin (iron bound in N lobe only)
Transferrin Binding Protein A in complex with transferrin binding protein B and transferrin (iron bound to N lobe only)
beta-barrel assembly machine from Escherichia coli in an late state of LptD assembly
