|
PICR - Protein Identifier Cross-Reference Service
Project description
-
Each major protein database uses its own conventions when assigning protein identifiers.
Resolving the various, potentially unstable, identifiers that refer to identical proteins is a major
challenge. This is a common problem when attempting to unify datasets that have been annotated with
proteins from multiple data sources or querying data providers with one flavour of protein identifiers
when the source database uses another.
The Protein Identifier Cross-Reference (PICR) service is a web application
that provides interactive and programmatic (SOAP and REST) access to a mapping algorithm that uses the
UniProt Archive (UniParc) as a data warehouse to offer protein cross-references based on 100% sequence
identity to proteins from over 84 distinct source databases loaded into UniParc. Mappings can be limited
by source database, taxonomic ID and activity status in the source database. Users can copy/paste or
upload files containing protein identifiers or sequences in FASTA format to obtain mappings using the
interactive interface. Search results can be viewed in simple or detailed HTML tables or downloaded
as comma-separated values (CSV) or Microsoft Excel (XLS) files suitable for use in a local database
or a spreadsheet. Alternatively, a SOAP interface is available to integrate PICR functionality in
other applications, as is a lightweight REST interface.
Documentation:
- User guides and programmer documentation
- Publications
Downloads:
Mailing lists, bug reports
Acknowledgements:
-
PICR is supported through BBSRC iSPIDER.
|