 |
EBI PSI meeting October 2002
 |
Proteomics Standards Initiative |
|
Mass spectroscopy agenda
This is a preliminary, open agenda. Please send me any changes/additions.
- What is the potential use of a public archive of mass spectrometry data?
- Confirmation of hypothetical proteins in sequence databases
- Making raw data available for third-party analysis and verification of
experimental results.
Example: Links from identified 2D gel spot to original spectrum.
- Re-analysing existing spectra with new approaches, or on the basis of new data,
for example updated versions of sequence databases.
- Improvement of future identifications by spectra comparison for
defined samples?
- Provision of reference sets from defined samples?
Aim: Describe intended uses and define their requirements, possibly split into
consecutive development phases.
- What data should be stored?
- raw, instrument-dependent data
- instrument-independent data after noise reduction ("peak lists")
- interpreted data: Identifications (in which form)
- Scope of auxiliary data, for example source system: Should the auxiliary data allow to
recreate the experiment, or just
compare data from different experiments.
- Elements of auxiliary data:
- source system: organism, cellstage, ..
- machine type
- experimental setup
- data context: individual identifications of proteins versus
"proteome of cell X"
- Which existing systems/controlled vocabularies/ontologies can be used,
where do we need additional systems?
- Representation of proposed data format: XML/UML?
- Beyond data format: Data collection
- Database setup:
- central database,
- federated database with data exchange,
- federated database with common query interface?
- data exchange:
- flat/XML files
- Define availability levels:
- free
- free to end user (no redistribution)
- only "detail" access via tools
- private
- Data ownership, update policy.
- Quality control:
- accept only published data?
- Synchronised effort to create reference sets?
|
Contact
 |