Beta Testing
The tool is being developed and currently in beta testing. Please read the provided documentation before using the tool. To provide feedback please use our Support Form with the subject Sequence Feature Annotation Tool.
Sequence Feature Annotation API Help Page
The IPD-KIR Database has developed a REST API to allow users to annotate sequences to support the submission of KIR data to the IPD-KIR Database.
API Query
Direct calls can be made to the sequence feature annotation API using the HTTP Post method. The API can be reached from:
| Entry Point | https://www.ebi.ac.uk/cgi-bin/ipd/py/sequence_feature_annotator/run.cgi/annotate_features |
|---|
The API takes the following parameters:
| Parameters | Description |
|---|---|
| sequence | Sequence to annotate. Can be genomic, coding or protein | locus | Must be a valid KIR locus | level | The type of sequence supplied, either protein (PROT), coding (CDS) or genomic (GEN) |
A worked example using the python requests package is provided using the genomic sequence of A*01:01:01:01:
import requests
url = 'https://www.ebi.ac.uk/cgi-bin/ipd/py/sequence_feature_annotator/run.cgi/annotate_features'
sequence = 'GTTCGGGAGGTTGGATCTCAGACGTGTTTTGAGTTGGTCATAGTGAAGGACACTAGGTGTCAAATTCTAGCGAGAACAATTTCCAGGAAGCCGTGTTCCGCTCTTGAGCGAGCACCCACTGGGCCTCATGCAAGGTAGAAAAAGCCTGCGTACGTCACCCTCCCATGATGTGGTCAACATGTAAACTGCATGGGCAGGGCGCCAAATAACATCCTGTGCGCTGCTGAGCTGAGCTCGGTCGCGGCTGCCTGTCTGCTCCGGCAGCACCATGTCGCTCTTGGTCGTCAGCATGGCGTGTGTTGGTGAGTCCTGGAAAGCAATAGAGGGAGGGAGTGAGGGGATGGAGATCTGGGCCCAGAGGTGGAGATATAGGCCTGGAGGTGGAGTTATGGGCCTGGAGTGGAGATCTGGGCCTGGAGTGGATATATGGGCCTAGAGATGGAGTGATGGGCCTAGAAGTGGAGATCTGGGCCCAGAGGTCGAGATATAGGCCTGGAGGTGGAGTGATGGGACTGTAGTGGAGATCTGGGCCTGGAGTGGAGATAGGAACCTGGAGGGGAGATAGGAACCTGGAGGGGAGATATGGGCCTGGAGGTGGAGATATGGGCCTGGAGTGGAGTCATGGGCCTGGAGGTGGAGTTACGGGCCTGCAGTAGAGATATGGGCCTGAAGTGGAGACATGGGCCTGGAGTGGAGATATGGGCCAGGAGTGGAGATATGGGCCTAGAGGTCGATATCTGGGCCTGGAGTGGAGATATGGGCCAGGAGTGGAGATATGGGCCTAGAGGTCGATATCTGGGCCTGGAGAGGAGATATGTGCCTAGGATGGAGATACGGGCCTGGGTGTGGAGATATGGGACTGGAGAGGATATATGGGCCTGGAGTGGAGATATGGGACTGGAGAGGAGATATGGACCTGGAGTGGAGATAAGGGCCTGGATTGGAGATATGGGCCCAGGGTGGAGATCTGAGCCTGGATTGGAGATATGGGCCTGGATTGGCGATATGGGCTTAGGGTGGAAATATCGGCCTGGAGTGGAGATATGGGCCTGGAGTGGAGATATGGGCTTGAGGTGGGGATATGGACCTGGAGGCTGGGTCTCTGCACAACCGACAGCCCTGTTCTTGGGTGCAGGTAGGCACTGAGGGTGAGTTTACCTTCAGCCCAGGAAGGGCCTGGCTACCAAGACTCACAGCCCAGTGGGGGCAGCAAGGGTGCCCTGGTTTGCCTGCAGATGGGTCATCCATCATGATCTTTCTTTCCAGGGTTCTTCTTGCTGCAGGGGGCCTGGCCACATGAGGGTGAGTCCTTCTCCAAACCTTCGGGTGTCATCTCCCCACATAAGAGGATTTTCCTGAAATGGGAGGGAAGTCCTGTCAGGGAGTCTCTCATAAACTAGGAAGAAGGGACCCTGGGGTGCTGGGCCCACATTTCTGACCTTGCCTCCCTGGCCTTTCATTCCCTTGGCAGAGTCAAGTTCTGTGGGGACCAGGGTTAGACTACGGTGCTCAAAGCTGGGGTGTGTGGTGGGGAAGTGGTAGGAACAGCAGATCCTCTGAGGACAAAGGTGTTACTCACACACTTCAGCGTTTCCATGACGGTAGGGGCTGCAGTGTGGCTGCTGTCATTCTACCAGAAGAGGTGGGAAAACCACAGCCATGGCCCTGACATTCCAATCCTCTGATGGGGACTCAGTTGTTTATTTTCGTTCAGGCATCGGCTGATATTCCATTCTCAAAGGACATGCCCTCCACCCCATGTCTACCCTGTGTTGTTTTATGTGAGTAATCTTACAGTATTAAAATCTAGTAGGAGTCTCTTACTCAGCACTTGCTCAAAGTTCTCAGCTGACACTTTTGTTGTAGGGAGACACCTTGTGTTTGCGGGATGGGTCCTTCCTTTAGCCCTGGGCACCAAGGTGTGATAGCAGCCATAGAAACTTGGAAAGCGAGGAGAATCTTCAGAGCACAGGGAGGGAGGGGCGGCTCCACATCCTCCTCTCTAAGGCGGTGCCTCCTTCTCCCCACGGTGGTCAGGACAAGCCCTTGCTGTCTGCCTGGCCAAGCCCTGTGGTGCCTCCAGGACATGTGATTCTTCAGTGTCATTCTTATCTTGGGTTTAACAACTTCAGTCTGTAAAAGGAAGATGGGGTGCCTGTCCCTGAGCTCTACAACATAATATTCTGGAACAGCCTTTTCATGGGCCCTGTGACCCCAGCACACGCAGGGACCTATACATGTCGGGGTTCACAACCACACTACCCCAGTGGGTGGTCGGCACCCAGCAACCCCCTGGAGATCACGGTCACAGGTCAGAGGGCTCCTGTCTGGGATTCTCCTTGTCCCACCTCCTGAATCCCAGAGCTCCTGGTGGGCGTGTCCTTGCGGGTCCCATCATGCAAGTCCTGACTGTATTTGGGGTAAAGGGGGATTGAATACAGGGAAATGGGTGCTGTGGTGGGAAGAATAATTGTCCCCAGTGATGACTACATTCTAATCCCTGGAGTCTGTGACTATTTATGATATAGGGGAAGGGACTGAAGGAGAAGATGGAGGTCAGGTTGTTGATGAGTTGACCTTGAGATGGGGAGACAGCCTGGACTGTCCTGATGGGCTCAGTGTAGTCACAGGGGTCCACATGAAAGGAGGAGGAAGAGGGGAGTGGGGATTACAGCAGCATAATGGGAGTCTCCATCAGCTTTGAAGGTGGAGGAAGTCCAGGAGCCATGAATGCAGGTGGCCTATAGAGGCTGGAAAAGTCAAGGAACTGATTCTCCTGAGTCTCCAGAGGGAACGAAGCCCTGCAGGTGCCTTGATTTTACCCACGACAAACAGGGTCCGATTTCTGTCTCCAGAATTGGAAGGGGTTAGTGTGCTCTCTCCTGGTGCCATGCTTCTGATAATTTTCTACAGCAGCAACAGGAAACCAACACTGGAACCCAGGTCAAGGACAAGTTAAGAAACAACACAAGGATAGCCAGGCATGGTGGCAGGTGCATGTAATCCTAGCGACTTGGGAGGCTGAGGGCAGGAGAATCACTTGAACCCAGGAGACAGAGGTTGCAGTGAGCCTAGACCACACCACTTCACTCCAGCCTGGGCAAAGGAGTGAGACTCTGTTGCCAAAATTAATTAATTAATTAAAGAAACCAAACAAGGAGAAGGTTGGCTACACTGAGATCAGCAAGGCTCAGATGATGATGCCACCACCAGGCTCCATCCACATAGGGAGCGGTTGATACTCCTCCAACCAGCACCAGGAGCCAGCCTATGGAAGCTGGCACTGGCATGGCAAGAGTGGCTCCCAGTCCCTACCAGGAACAGGGTGTGTGGCCACTGGTGCCTGCCTTACTGATCAGTTCATACCTCCTGCCAAGGATTCCAATTCGTCCAAAAGAGATTGAACCAGGCTGCTAAGAGCCTGGATGTGCAGCCTATCCTGGTTCCTCTTCCACCCCCACATAGACAGCAGGAAAGACATTAGTTCGAAATAGATACAACAGCCCAAGAGATGAGGCTGAGCCCAGCGGCAAGGGAATCAGAGGCTACTAGAGACAGAGGGACAGAGAAGAGTGAGGGAGACAGATGGAAGGACCTGCACCAGGAGTTATGGGCACAGAAAAGAACATGAAGACACAGAGAGGAAGGAGAGAGATAAGACACCAGGAAGGGGAAGCCTGACTCAATCCAGGTGCCATGGATGGGATGATAAAGAGAGACACCTTCTAAACTCACAACCTCTCTTCCTAGGAGTCCACAGAAAACCTTCCCTCCTGGCCCACCCAGGTCCCCTGGTGAAATCAGAAGAGACAGTCATCCTGCAATGTTGGTCAGATGTCATGTTTGAACACTTCCTTCTGCACAGAGAGGGGATGTTTAACGACACTTTGCGCCTCATTGGAGAACACCATGATGGGGTCTCCAAGGCCAACTTCTCCATCAGTCGCATGACGCAAGACCTGGCAGGGACCTACAGATGCTACGGTTCTGTTACTCACTCCCCCTATCAGGTGTCAGCTCCCAGTGACCCTCTGGACATCGTGATCATAGGTGAGAGTGTCCAGACTTTCTTCTCATTGTCATTGGGATGCAGAGTGAATGATCCAGGAATTGGAGACCCAGGTGGCTGTAAGGAAGATGAGCTTGGTATTCTTATGGAGAGAGACTGACTTGGTGAGGTCTGTGCCAACAGAGACAGAGAAACAGGAGACACAAGTAGAGACCAGGTGTCATAACAGAGAACAGACACAGGGGCCATACCGGGAGTTAGAAAAGACAGAAAGAGTTAAAGGAGACACACAGACAGACATGTCCCAGAGAGAGGTGTCCCTCCATGCTGACTTTGCTCAGAGACCTGGCACAGGTTAGAAGTTTCATTTCTGTTTTACCTCCACAAAGTGTTCTCTACCAGGAGAACCCAAGGACACCCATATTTCTGACCTGAGTTGGGCCCTGTGGCCTCAGGCCTTGTGGCACCTACAGATGCCATGTTTATTCTGACACCTCTGCCTTCCATGTAATGGAGAGTAATCGTCCCAGGATATCATGGCCCCACAACACCAACCCCTGTATGCTGTGTGAACTTGTAGTCTCCAGACTGGATTCTGAGGCTCATATTCCAAATAAGCCCACTTATGAGAGGATCAGTGAGAGGCACAGAGAGAAATCAGGGACACCAAAAAGCAAAGACATAAACACACAGAGAATGAGCCAGAGGAAGGAGATTGAGAGACTCACAGACACATAAAGAGAGAGAAAAGAGGGCAGAGAAGTGAGAATGATGGAAGGGAGCAGAGAAAAGCACTAAAATTAGACTCCTGAGGGAGAGGCACAAGGACATTGAAAGATGGAGATGTGGGGATGAATTGCAGAGATTCCAAAGAGAACTAGAGAGACCGAGAGGCAGAGCAAGACAGATGATAGATGGATAGATATAGATAGATGATAAATAGGTAGATGATAGATAATAGGTTATAGATACATAGATGATGATTGATTGATTCATTAATAGATGAGACATAGAGATGATGATGATGAAGACAGATAGATAGATAATACATAGAGATACAGAGGCAGACATAGAGAAATCATAGAGAGAGAGAGATGATACATAGATATAGATAATAGATGATTGATGGATAGATAGACAATTGATGGATAAATAGATGATATATAGATATAGATGACAGGTAGAGAATTTGTAGATAGGCACCGAATAGATAAATAGATAGATCGATAGATAATAGATAGAAATATGCAGAAAGTTATGAACAGGACACAAAGTGAGAAACTCAGAATTAAAAAAAGTAACATCAAGTCAACCAATCCAAGGAGAGTCAGAGAGAATAAAACAATCCAAAAAGAGAAAACATATCTAGAGGTGGGGAAGTGAGGTCAGAGACCTAGAGAGACAGAGAAGGTGGAAGGAGGAAATAGACATGAAGAGCGATGGGGTAGAGGGTGAGAGAGAGAGAGAGAGAGCATTAGGTCATAGAACAGGGGAGTGAGTTCTCAGCTCAGGTGAAGGGAGCTGTGACAAAGAAGATCCTCCCTGAGGAAACTGCCTCTTCTCCTTCCAGGTCTATATGAGAAACCTTCTCTCTCAGCCCAGCCGGGCCCCACGGTTCTGGCAGGAGAGAATGTGACCTTGTCCTGCAGCTCCCGGAGCTCCTATGACATGTACCATCTATCCAGGGAAGGGGAGGCCCATGAACGTAGGCTCCCTGCAGGGCCCAAGGTCAACGGAACATTCCAGGCTGACTTTCCTCTGGGCCCTGCCACCCACGGAGGGACCTACAGATGCTTCGGCTCTTTCCATGACTCTCCATACGAGTGGTCAAAGTCAAGTGACCCACTGCTTGTTTCTGTCACAGGTGAGGAAAGCCCATGGCTGTCCCATGTCCTATGATCCTAGAGCCTTAGCTGAGGAGCTTCCTGCTGAGGATGGAGAGAAGCATGGACAGATGCAGAGAGAAGACGCAGCCTCGGTGTGAGGGAGGGATCAGGGCACAGGATGGCCGACAGGGCACCTCCAAACCCTCCTACATGGCCTGCATGGAGGCCCACGGCCAGGGCTCCAGGCACCCAGGCAGATGGAGAAAGCGGTCAGGAGAGACCCAGAGGAGGGAGACTGGGCTCAGTTTGGGGAGATCAGAGGTTCCCTCAGCCCCTCAACCTTACCCATTTCCCAGAAGCCCATCCTGGCCTCTCACCCACACAGAGATGTCATCACCAGCAACCCCTACACCCTTTACTTTTCTTTGAAGAAATATTTATTGAGGATAAATATACCTATATAGCTTACCACTTTTAACATTTTTTTTTGAGGTGGAGTCTAGCTCTGTCCCCTATGATGGAGTGCAGTGGCACAATCTCAGCTCACTGCAACCTCCGCCTCCTGGGTTCAAGCGATTCTCCTGCCTCAGCCACCTGAGTAGCTAGTGCTACAGGCACGCACCACCACGCCAGGCTACTTTTTGTATTTTTAGTAGAGAGGTGGTTTCACCATGTTGGTCGAGCTGGTCTCGAACTCCTGACCACGTGATCCACCCGCATCAGCCTCCCAAAGTGCTGGGATTACAGGCATGGGCCACCAGGCCCAGCCACATTTACCATTTTTAAGTGTAAAGTCTAGTGGTCATAAATACATTTTTATATATATATATATATACATTTTTTTTACCCTCCACCCTTTTCTTCCTGTCCTCCAGTAGCCACCATTCTACTCTCTACCTTCATGAGATCCACCTTTTAGCTCCTGTATATGGGTGAGAAATGGGAATCTTTTTAATGACCTCCAGTTCCATCCATGTGGCTGCAAATGACAGGATGTTATTCTTTCTATGGATGAGTAGTCTCCACTGTGCGTATGTACTACATTCTCTCTATCCATTCACCCACTGATGGGCAGGTAGGTTGACTCCTCATCTTGGCTACTGTGAACAGTGCTGCACCAATCATACGAGTGCAGATATCACTTCGATATGTTGATTTACTTTCCTTTGGATATAAACCCAGTAGTGAAATTGCTGGATACTATGAAAGTTCTCTTTTTTTTTTTTTTTTCTTTTTTGAGAAAGAGTTTCCCTCCTTAGCCCAAGCTGGAGTCAAAGTGGTGCAACGTTGGCTCATTGCAACCTCCGCCTCCTGGGTTCAAATGATTTTCCTGCCTCAGCCTCCCTAGTAGCTGGGATTACAGGTGCACACCACCATGCCTGGCTACTTTTTGGTTTTTTTAGTATAGATGCGGTTTCCCCATGTTGGCTGGGCTGCTCTCAAACTCATGACCTCAACTGAGGTGCCCGCCTCAGTCTCCCAAAGTGCCGGGATTACAGGCATGATCCACCTCACCCAACCTCTTTTTAGTTCTTTAAAGGACTTCCATACTTTTCTCCGTAATGGCTGTACTAATTTACACTCCTCCCAACAGGGTACCAGGGTTCTCCTTTCTCTACCACCTTGCCAGCATTTCTTTTGCCTGTCTTGCAGCTAAAAGCCATTTTATTTTATTTCATTTTATTTTGAGATGGAGTTTTGCTCTTCTCACCCAGGCTGGAGTGCAGTGGCGCTATCTCGGCTCACCACAACCTCCACCTCCCAGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGAATTACAGGCACACGCCACCACGCCCTACTAATTTTTGTATTTTTAGTAGAGACAGCGTTTCTCTATGTGGGTCAGACTGGTCTCAAACTCCCAACCTTATGAGATTCACCCACCTCAGGTTCTCAAAGTTCTAGGATGACACAAGTGAGCCACCTCACCCGGCCTAAAAGCCATTTTAATGGGGTGAGATGAAAACTCACTTTGATTTTAATTTGCGTTTCTCTGATGATGAGTGATACTGAGCACTTTTTCGTATGTGGGGAAATTTCATGTCTTTTGCTCCTTTTTCAATTAAATCATTTGTTTTATTGAGTTGTTTGAGCTTCTTATATTTCTAGTTATTAATCCCATCTCAGATGCATAGTTTGCACATATTTGCTCCCAATCTGTGGGTTGTCTCTTCACTTTGTTGGTTTATTTTTAGCAGTGCTGAAGTTGCTTAGTTTGAGGTAATCCCAATGGTCTATTTTTGCTTCGATTACTTGTGTTTTGAAGGTTTAAAACAAAATGTCTTCCTTCAGACAAACGTCCTGGAGCATTTCCCCAATATTTTGTTCTACGTGTTTCATAGGTTCAGGCCTTAGACTCACATCTTTAATCCATTTTCATTTGATTTTTGTGTATGGTGACAGGTAGAGTTGCAGTTTCATTCCTCTGCATGTAGATGTCCAGGTTTCCCTGCACTGTTTATTGAAAAGACTGTCCTTTCCTGATTGTGAGTTCTTGGCATCTTTGTCAAAGTCCATTGGATGGGCTGGGCTTGGTGGCTAACACCTGCAATTTCAGCACTTTGGGAGCCCGAGGTGGGTGGATCACCTGAGGCCAGGAGTTCAAGATTAGTCTGGCCAACGTGATGAAACATCGTCTCCACTAAAAATATAAAAATTAGCTGAGCATGGTGGTCAGCACCTGTAATACCACTACTCAGGAATTTGAGGCAAGAGAATGATTGAACCCAGGAGGCTGAGGTTGCAGTGAACCGAGATTGCACCTCTGCACTCCAGCCTGAGTGACAGAGCAAGACTCCATCTCAAAAGAAAAAATAAAAAACCATTGGATGTAAATGCATGGAATATATCTGTGTTATTCATTCTGCTCCGTTGTTCTATGTGCCTTTCTTTATGCCAATGTCATGCTATTTTGCTTACTACAGCTCTGTAACATATTTTGAGATCAGGTAGTGTGATGCTCCTGTTTTCTCTTTATATCTTGAAGTCTCAAGACAGTGGGTGTCATATAAAAAAATTATGGAAAAAAGGATCCCAGGACTCCCAGGGCCCAATATTAGATAAGAGAGTGTTGGCCATGAACCATCCTCAAAGATTTCCACTGAGTGGAGGACAGACACCCTCATTTCCTCACCTCTCTCCTGTCTCATGTTCTAGGAAACCCTTCAAATAGTTGGCCTTCACCCACTGAACCAAGCTCCAAAACCGGTGAGTACAGAACCCTCTTATATCCGCTTTTGGAACCCTGGGGAGGTGGGAACCTTGGATTCAGGCGTTGACTCAGCATCTCACAGCTCTGACATTGTACACTTGTCTTCCACCATCTCCGAACTCCAGATACTCCTACAGCGAAAGGGATCTGGGCCCAACACAGGGCTCAGTGAAATCTCTTCATCTCTCATTTTATGGAGCTGAGACCTCCTACAAGCTAGAAGAATGATTGCCAATCTGACATCCTTCTCAGGAAAAATGCAATGTTTGTTCTACCTGCATTCCTAACTGGAGGATAAATTCCTGGAGACTTGAGAGAGGGAAGGGAAGGGAACATCTGATGAGGGCAAGGTGTTTTAGAGAAGTTCCACTTGCCAAGGAATGAGCTCCTGTAGGTCATGAAGCAACCCTGGCTGACTCCGCAGAGAAAGAGCCTTGCCGTAACAGAGAACAGAGCTCATGCACGCACACTTCGACTCACTGACTCATTCAGCCACGGCCCCATGCTCAGGCTGTGCAGTGTGGAACCTTTTCCTATTGTTGCCATAACAAATTTCCACAAGATTCGTGGGTGAAAACAAAACGGTTTTTTAATTATCTTACAGTGCTGTAGCTCAAAGTAGGAAGTGCATCTTACTGGGCTAAAATCAAGGTGACAGCAAGGCTGCCTTCCCTCTGAGGATTCCAGGCACGAATCTGCTTCTCACTTGTCCCAGCTTCTAAAGGCTCCCAGTTCCTTGGCTCCTGGTCCCCTTCCTCCTTCCTCAAAGCCCACAAAGACTGGTCACATCTCACATGGCATCACTCAGTGCCTTCTTCCTTACCACACCTCTTTCTCTGAGTGCTGCTCTCCCTTCTTCCTCATCTTTTGAAAACTTGGGGATTCTATTGGGTTCACCAAGATGAAAATCCCTCATAATCTCCTGGAAATCATCCAGGATACCCTTGTTTTAAGTTCAGCTGATTAGCAACCATAATTCCATCTGCAATCTTCATTCCTCCTTTCCATGTAAAATAACATATTCACAAGCTGTGGAGGCTAGGACAGGGACATTTTGGGGTGGGACAGCATTCTCCTGCCTTCCACAAACAGTGAACAAGATGCATTTGGCCTCTGCCCTTGGGACACTGATATTGCAGATGGTTAAATGGGAGGGCAGAAAATGAACGCACAAGTGGATCTATAAATGAATGGTCCATTGGGAAGCATCTGTGCATGAAATCTATTTTTTGTTTGTTCTTTTGTTTATTGAGACAGAGTCGCCCTCTGTCTTCCAGGCTACAGTGCAGTGTCACGATCTTGGCTCACTGCAACCTGCGTCTCCTGGATTCAAGTGATTCTCCTGCCTCCGCCTCTCGAGTAGCTGGGATTACAGGCAACTGCCACCGTGCCCGGCTAATTCTTTTTGTATATTTTTTGTAGAGAGGATGTTTCACCACGTTGGCCAAGCTTGTCTGAAACTCCCAACCTCAAGTGATCCGACCGTCTCAGCATGCCAAAGTAATGGGACTACAGGCGTGAGCCACTGTGCCCAGCCAGAATTCAAAATCAATAATAGATAATGCTGAGTGTATGATTTCAGGTGACAAAGAAGGTCTCACTATTCAGATATTTGTGACATTAATGAAAAACACGGAATGAACCCCTGAAAGATTGGCGGAAGGATTTTGCACACACAGCTGTCAGCCATGAAGGCACAAAGGTGAAAACAATCTGATGTGGAAGGAAGAGGCTCTGCCTCAAATGCTGGGAATGAGGTGGGGAGAATGACAAGACGACTGTAGAGAGACGGAGAGCACACTGGGTACACAGGAAACTAAGGAGGAACAAGGAGTGTGTGTTTGACACTCACAGCCATTGGATTCACCTCGGGGTAACCAGGAATCCCTACATGATTAATATGACTGACATGAAAATAAGGGAGGCCCAGGTGCATAACTGGAATCTAGGAGACCGTGGAAAAGGCAATTGCCGCCCCACTGGTGAAATGTGGTGCTGATTTAGACACTAAATGAATGAAGTAGATGGATATAAGATATGTTTGTGAGGTAGAATCATTGACTGGAAAGGCTTACTGGGTTTGATTTTCCTACTTGTTTAATCCTCGCTTAATTAATTTCTTTCTGAGATTTATTCATCCTACACATAAATCAATACCTGGCAAAGGAGTGACAGATATATGAGTGGTGGTGGAAATGAAGAGACTTATTATAGCATAATATACAAGTCTGTGAACAGTGGCTCACGCCTGTAACCTAGCACTGCAGGAGGCCAAGGTGGGTGGATTCCATGAAGTCAGGAGTTCCAGACCAGCCTGGCCAACGTGGTGAAACCCTATCTCTACTAAAAATACAAAAATTAGCCGAGCACGATGGTGCATCCCTGTAATCCCAGCTCCTATTCTGGAGGATGAAGCAGGAGAATGACTTCAACCCAGTAGGTGGAGGTTGCAGTGAGTGGAGATTGCATCACTGCACTCCAGCCTGGGGGACACAAGGAGACTCTATCTCAAAAAATAAAAATAAGAAATACATAAATATAATAAAACACACACGAATGACAAAGGCACCTGAATTCCAATCATGATTTTTCTATTTCTCTATAATTACTTCTTTGATCCTTTATCTTATCCATTAGGCAATGAGCCTAAAACCTCTTCCCTATTTGGCTTTCTGTGAGCATGAGATCATATAGAAAATGTGAAAGCCCGCTGAATCCTCCAGCACAGATCCTGGAATAGAGAAAGTGCTCTGGTCATCACAAAAAAAACTTGCCCACTCACCCAAATCCCCCACCTCACCCCTACTTCCAATCACCTGTGGAGATTCAGATAGACCATGGGGAGGTAAACATTAACACTCCTTGGAGTGAGTCCAGATCTTGGAATCAGAGATCAGTGCCAGCACTAGCTCCTGCTCCCCTTTCCTACTAATTCACAGGAGGACAGGTGGTTTTGAAGCAATAGATGGCCGAGGGGGTGGTCCTTCCCCCAGCCTCTCGGGTAGAACAGCAGCCTAATATGTGTCTCCCGAGATCACAAAGAGCAGCAGGTTTCACACGGGCTTCAACACTATTTCCTGGCCGTTTGACATAAGAGAATTCTATTTCGCTTTTTTTATCTTGATTTCACTTTTATTTTCTTTCCTTGGAGAATGCAAGTTGTTTGATTCAAGAATGCTGTGGATGTAGAAACCCTAAAGCACATTCGCTGTGAATCAATCCCAGTCCAGTCTTCCCAGAGAAGACTCTAAACACCTCCTGGACTGCACCTGGGCCTATGCCAATTCCTATCACTCACCGTCACTCCAGGGAGACAGAACACACAGAGAATACGTTACATAGGCAGGTTCATTACTAACAGATAAGCAGCGAGTGACAACAGAAACCTATATTTCAATGTGACCCAGTCCCTCAAGGCTCAGAAAAGCTCCTCGGGACATATGGAGTCACCCCATTTGCAGTGTAGCTGCGGGAAGCCAGAAAGCAGCCCAGCCTGGGTTTTGTACCCTGGAGCCACAGGAAGCACTCAGCTAAAGCACTGCATGAGGTCCTCCAGGAAGAACAGGAAGACAGCCCAGGGTGTTCTGAGACGTTCCTCCTGATCTCAGGAAGTTGCTGTCTTAGGCCATTTTTGTTGCTCTAAAGGAACACTTGAGCCTCGGTAACTTCTAAAGAAAAGAGATTGGTTTGCCTCACCGTTCTGCAGGCTGTACTGGAAGCATGGCACCAGCATCTATTTCTCGTGACGGCCTCAGGCTGCTCCCACTCTGGCAGAAGGGAAGGAGGGTCTGTCTGTGCAGAGACCACAGAGATCACACGGCAAGAGAGGGAGCAAGGGGGAGGGGGAGTGATGGAGCTTCCAAGCTCTTTTTAACAACCAGCTCTCCGGGAACTAATAGAGGGGGAACTTGCTAACCCCGTCTCCTTGGGACAGCATTGATGTGTTCATGATGGATCCACCTCCATGACCCAAACACCTCTCAAGAGGCCCAACCTCCCACAGTGGGGGTGAAATTTCAATGTGAGGTTTGAAGGGGTCAAACATCTCAACTAAAGTAGTCGTATCCTCAGCACGTTCTATGGTTACTATGAGAGCTATAACTGAAAAAGCAGGAGAAAGCTGGGTCTCCTGCCATCTGGGTGCTTGTCCTAAAGAGGTGTTTTATGTGGTTACCTGTCAATCAAGAAATGCGAGACAATTCATAAAGAGGAACTGCTAAGATTAGCTTCTTATTGGTGTCTCATCTTCTTCCAGGTAACCCCCGACACCTGCACATTCTGATTGGGACCTCAGTGGTCATCATCCTCTTCATCCTCCTCTTCTTTCTCCTTCATCGCTGGTGCTCCAACAAAAAAAGTAAGTCTCATGAAGCAGAGGCCAGAGAGCTCAGGGCCATGTGGGGAAGCAGGATGGGAGCACTCAGGTGTGTGTTCCTCACAAACAGGATGGTCCCTGGCCCAAGGCAGCAGCCACAGAGGCAGGACTTTCTAGAGAGGGCACCAGACTCCCTGTCCCTGCCTTCAACTCACAGACCGTTGCCTGATTCTGAACTGTATCCTCATGTCCCCTGCAGCCACTCACATCCAGGAGAAGGTTCCATGACAGGCAGAAAGTGGGAGACAGAATCAATGGGATGGGAACTCAGAGCTATTCATGGGATGGGTCCTTGAGCTCAGAGAGATAGAATGTCTGAGTCTGCTGTTGGCAACTGAGGGACCTCAGCCACCTATGGTCTCCCCCTGTATGTTGGTATCTGCTTATGAAATGAGGACCCAGAAGTGCCCTCCGAGCTGTTTTGTTGACTTCCGTCTTCTACAGATGCTGCGGTAATGGACCAAGAGTCTGCAGGAAACAGAACAGCGAATAGCGAGGTAGGTACTCCTCGGCCCGGGCTCGTGGCTACTGTTATTCCCAAAGAGTCCTGGAAAATGTGAGCACCCTCCCTCACTCAGCATTTCCCTCTCTCCAGGACTCTGATGAACAAGACCCTCAGGAGGTGACATACACACAGTTGAATCACTGCGTTTTCACACAGAGAAAAATCACTCGCCCTTCTCAGAGGCCCAAGACACCCCCAACAGATATCATCGTGTACACGGAACTTCCAAATGCTGAGTCCAGATCCAAAGTTGTCTCCTGCCCATGAGCACCACAGTCAGGCCTTGAGGGCGTCTTCTAGGGAGACAACAGCCCTGTCTCAAAACCGGGTTGCCAGCTCCCATGTACCAGCAGCTGGAATCTGAAGGCATGAGTCTGCATCTTAGGGCATCGATCTTCCTCACACCACAAATCTGAATGTGCCTCTCACTTGCTTACAAATGTCTAAGGTCCCCACTGCCTGCTGGAGAAAAAACACACTCCTTTGCTTAGCCCACAATTCTCCATTTCACTTGACCCCTGCCCACCTCTCCAACCTAACTGGCTTACTTCCTAGTCTACTTGAGGCTGCAATCACACTGAGGAACTCACAATTCCAAACATACAAGAGGCTCCCTCTTAACGCAGCACTTAGACACGTGTTGTTCCACCTTCCCTCATGCTGTTCCACCTCCCCTCAGACTAGCTTTCAGTCTTCTGTCAGCAGTAAAACTTATATATTTTTTAAAATAACTTCAATGTAGTTTTCCATCCTTCAAATAAACATGTCTGCCCCCAT'
level = 'GEN'
locus = '2DL1'
response = requests.post(url, data={'locus': locus, 'level': level, 'sequence': sequence})
API Results
The API uses an alignment based approach to annotate the exon, intron boundaries of a sequence provided. The results contain the sequence and features of all relevant types in a nested fashion:
- genomic - provides exon intron boundaries, coding sequence and features and translated protein
- coding - generates exon boundaries and translates to protein
- protein - provides signal peptide and mature protein features
The results return in JSON format, with the following keys and values:
| Field | Description |
|---|---|
| feature | Contains all feature information with subkeys genomic, coding and protein |
| feature.coding | A list of coding features, each containing length, number, partial, start, and type. |
| feature.genomic | A list of genomic features, each containing length, number, partial, start, and type. |
| feature.protein | A list of protein features, each containing length, number, partial, start, and type. |
| comments | An optional string that can be atatched to any feature. Contains information about complex annotation, or annotation that should be reviewed. |
| sequence | An object containing sequences for coding, genomic, and protein, along with the codon start for coding. |
| sequence.coding | A string containing the coding sequence. |
| sequence.codon_start | An integer indicating the start position for translating the coding sequence into protein. |
| sequence.genomic | A string containing the genomic sequence. |
| sequence.protein | A string containing the protein sequence. |
Each feature contains the following values:
| Field | Description |
|---|---|
| length | The length of the feature. Will be absent if a feature has no length. |
| number | The number of the feature |
| partial | Sequence stops partway through feature |
| start | The start position of the feature in the sequence. Will be absent if a feature has no length. |
| type | The type of feature, exon/intron/utr |
| comments | When included contains important information on the annotation of the feature, including presence of splice mutations, absent features |
The result of the worked example above for A*01:01:01:01 is given below:
{
"feature":{
"coding":[
{"length":73, "number":"1", "partial":false, "start":1,"type":"exon"},
{"length":270,"number":"2","partial":false,"start":74,"type":"exon"},
{"length":276,"number":"3","partial":false,"start":344,"type":"exon"},
{"length":276,"number":"4","partial":false,"start":620,"type":"exon"},
{"length":117,"number":"5","partial":false,"start":896,"type":"exon"},
{"length":33,"number":"6","partial":false,"start":1013,"type":"exon"},
{"length":48,"number":"7","partial":false,"start":1046,"type":"exon"},
{"length":5,"number":"8","partial":false,"start":1094,"type":"exon"}
],
"genomic":[
{"length":300,"number":"5","partial":false,"start":1,"type":"utr"},
{"length":73,"number":"1","partial":false,"start":301,"type":"exon"},
{"length":130,"number":"1","partial":false,"start":374,"type":"intron"},
{"length":270,"number":"2","partial":false,"start":504,"type":"exon"},
{"length":241,"number":"2","partial":false,"start":774,"type":"intron"},
{"length":276,"number":"3","partial":false,"start":1015,"type":"exon"},
{"length":579,"number":"3","partial":false,"start":1291,"type":"intron"},
{"length":276,"number":"4","partial":false,"start":1870,"type":"exon"},
{"length":102,"number":"4","partial":false,"start":2146,"type":"intron"},
{"length":117,"number":"5","partial":false,"start":2248,"type":"exon"},
{"length":442,"number":"5","partial":false,"start":2365,"type":"intron"},
{"length":33,"number":"6","partial":false,"start":2807,"type":"exon"},
{"length":142,"number":"6","partial":false,"start":2840,"type":"intron"},
{"length":48,"number":"7","partial":false,"start":2982,"type":"exon"},
{"length":169,"number":"7","partial":false,"start":3030,"type":"intron"},
{"length":5,"number":"8","partial":false,"start":3199,"type":"exon"},
{"length":300,"number":"3","partial":false,"start":3204,"type":"utr"}
],
"protein":[
{"length":24,"number":null,"partial":false,"start":1,"type":"signal"},
{"length":342,"number":null,"partial":false,"start":25,"type":"mature"}
]
},
"sequence":{
"coding":"ATGGCCGTCATGGCGCCCCGAACCCTCCTCCTGCTACTCTCGGGGGCCCTGGCCCTGACCCAGACCTGGGCGGGCTCCCACTCCATGAGGTATTTCTTCACATCCGTGTCCCGGCCCGGCCGCGGGGAGCCCCGCTTCATCGCCGTGGGCTACGTGGACGACACGCAGTTCGTGCGGTTCGACAGCGACGCCGCGAGCCAGAAGATGGAGCCGCGGGCGCCGTGGATAGAGCAGGAGGGGCCGGAGTATTGGGACCAGGAGACACGGAATATGAAGGCCCACTCACAGACTGACCGAGCGAACCTGGGGACCCTGCGCGGCTACTACAACCAGAGCGAGGACGGTTCTCACACCATCCAGATAATGTATGGCTGCGACGTGGGGCCGGACGGGCGCTTCCTCCGCGGGTACCGGCAGGACGCCTACGACGGCAAGGATTACATCGCCCTGAACGAGGACCTGCGCTCTTGGACCGCGGCGGACATGGCAGCTCAGATCACCAAGCGCAAGTGGGAGGCGGTCCATGCGGCGGAGCAGCGGAGAGTCTACCTGGAGGGCCGGTGCGTGGACGGGCTCCGCAGATACCTGGAGAACGGGAAGGAGACGCTGCAGCGCACGGACCCCCCCAAGACACATATGACCCACCACCCCATCTCTGACCATGAGGCCACCCTGAGGTGCTGGGCCCTGGGCTTCTACCCTGCGGAGATCACACTGACCTGGCAGCGGGATGGGGAGGACCAGACCCAGGACACGGAGCTCGTGGAGACCAGGCCTGCAGGGGATGGAACCTTCCAGAAGTGGGCGGCTGTGGTGGTGCCTTCTGGAGAGGAGCAGAGATACACCTGCCATGTGCAGCATGAGGGTCTGCCCAAGCCCCTCACCCTGAGATGGGAGCTGTCTTCCCAGCCCACCATCCCCATCGTGGGCATCATTGCTGGCCTGGTTCTCCTTGGAGCTGTGATCACTGGAGCTGTGGTCGCTGCCGTGATGTGGAGGAGGAAGAGCTCAGATAGAAAAGGAGGGAGTTACACTCAGGCTGCAAGCAGTGACAGTGCCCAGGGCTCTGATGTGTCTCTCACAGCTTGTAAAGTGTGA",
"codon_start":1,
"genomic":"CAGGAGCAGAGGGGTCAGGGCGAAGTCCCAGGGCCCCAGGCGTGGCTCTCAGGGTCTCAGGCCCCGAAGGCGGTGTATGGATTGGGGAGTCCCAGCCTTGGGGATTCCCCAACTCCGCAGTTTCTTTTCTCCCTCTCCCAACCTACGTAGGGTCCTTCATCCTGGATACTCACGACGCGGACCCAGTTCTCACTCCCATTGGGTGTCGGGTTTCCAGAGAAGCCAATCAGTGTCGTCGCGGTCGCTGTTCTAAAGTCCGCACGCACCCACCGGGACTCAGATTCTCCCCAGACGCCGAGGATGGCCGTCATGGCGCCCCGAACCCTCCTCCTGCTACTCTCGGGGGCCCTGGCCCTGACCCAGACCTGGGCGGGTGAGTGCGGGGTCGGGAGGGAAACCGCCTCTGCGGGGAGAAGCAAGGGGCCCTCCTGGCGGGGGCGCAGGACCGGGGGAGCCGCGCCGGGAGGAGGGTCGGGCAGGTCTCAGCCACTGCTCGCCCCCAGGCTCCCACTCCATGAGGTATTTCTTCACATCCGTGTCCCGGCCCGGCCGCGGGGAGCCCCGCTTCATCGCCGTGGGCTACGTGGACGACACGCAGTTCGTGCGGTTCGACAGCGACGCCGCGAGCCAGAAGATGGAGCCGCGGGCGCCGTGGATAGAGCAGGAGGGGCCGGAGTATTGGGACCAGGAGACACGGAATATGAAGGCCCACTCACAGACTGACCGAGCGAACCTGGGGACCCTGCGCGGCTACTACAACCAGAGCGAGGACGGTGAGTGACCCCGGCCCGGGGCGCAGGTCACGACCCCTCATCCCCCACGGACGGGCCAGGTCGCCCACAGTCTCCGGGTCCGAGATCCACCCCGAAGCCGCGGGACTCCGAGACCCTTGTCCCGGGAGAGGCCCAGGCGCCTTTACCCGGTTTCATTTTCAGTTTAGGCCAAAAATCCCCCCGGGTTGGTCGGGGCGGGGCGGGGCTCGGGGGACTGGGCTGACCGCGGGGTCGGGGCCAGGTTCTCACACCATCCAGATAATGTATGGCTGCGACGTGGGGCCGGACGGGCGCTTCCTCCGCGGGTACCGGCAGGACGCCTACGACGGCAAGGATTACATCGCCCTGAACGAGGACCTGCGCTCTTGGACCGCGGCGGACATGGCAGCTCAGATCACCAAGCGCAAGTGGGAGGCGGTCCATGCGGCGGAGCAGCGGAGAGTCTACCTGGAGGGCCGGTGCGTGGACGGGCTCCGCAGATACCTGGAGAACGGGAAGGAGACGCTGCAGCGCACGGGTACCAGGGGCCACGGGGCGCCTCCCTGATCGCCTATAGATCTCCCGGGCTGGCCTCCCACAAGGAGGGGAGACAATTGGGACCAACACTAGAATATCACCCTCCCTCTGGTCCTGAGGGAGAGGAATCCTCCTGGGTTTCCAGATCCTGTACCAGAGAGTGACTCTGAGGTTCCGCCCTGCTCTCTGACACAATTAAGGGATAAAATCTCTGAAGGAGTGACGGGAAGACGATCCCTCGAATACTGATGAGTGGTTCCCTTTGACACCGGCAGCAGCCTTGGGCCCGTGACTTTTCCTCTCAGGCCTTGTTCTCTGCTTCACACTCAATGTGTGTGGGGGTCTGAGTCCAGCACTTCTGAGTCTCTCAGCCTCCACTCAGGTCAGGACCAGAAGTCGCTGTTCCCTTCTCAGGGAATAGAAGATTATCCCAGGTGCCTGTGTCCAGGCTGGTGTCTGGGTTCTGTGCTCTCTTCCCCATCCCGGGTGTCCTGTCCATTCTCAAGATGGCCACATGCGTGCTGGTGGAGTGTCCCATGACAGATGCAAAATGCCTGAATTTTCTGACTCTTCCCGTCAGACCCCCCCAAGACACATATGACCCACCACCCCATCTCTGACCATGAGGCCACCCTGAGGTGCTGGGCCCTGGGCTTCTACCCTGCGGAGATCACACTGACCTGGCAGCGGGATGGGGAGGACCAGACCCAGGACACGGAGCTCGTGGAGACCAGGCCTGCAGGGGATGGAACCTTCCAGAAGTGGGCGGCTGTGGTGGTGCCTTCTGGAGAGGAGCAGAGATACACCTGCCATGTGCAGCATGAGGGTCTGCCCAAGCCCCTCACCCTGAGATGGGGTAAGGAGGGAGATGGGGGTGTCATGTCTCTTAGGGAAAGCAGGAGCCTCTCTGGAGACCTTTAGCAGGGTCAGGGCCCCTCACCTTCCCCTCTTTTCCCAGAGCTGTCTTCCCAGCCCACCATCCCCATCGTGGGCATCATTGCTGGCCTGGTTCTCCTTGGAGCTGTGATCACTGGAGCTGTGGTCGCTGCCGTGATGTGGAGGAGGAAGAGCTCAGGTGGAGAAGGGGTGAAGGGTGGGGTCTGAGATTTCTTGTCTCACTGAGGGTTCCAAGCCCCAGCTAGAAATGTGCCCTGTCTCATTACTGGGAAGCACCTTCCACAATCATGGGCCGACCCAGCCTGGGCCCTGTGTGCCAGCACTTACTCTTTTGTAAAGCACCTGTTAAAATGAAGGACAGATTTATCACCTTGATTACGGCGGTGATGGGACCTGATCCCAGCAGTCACAAGTCACAGGGGAAGGTCCCTGAGGACAGACCTCAGGAGGGCTATTGGTCCAGGACCCACACCTGCTTTCTTCATGTTTCCTGATCCCGCCCTGGGTCTGCAGTCACACATTTCTGGAAACTTCTCTGGGGTCCAAGACTAGGAGGTTCCTCTAGGACCTTAAGGCCCTGGCTCCTTTCTGGTATCTCACAGGACATTTTCTTCCCACAGATAGAAAAGGAGGGAGTTACACTCAGGCTGCAAGTAAGTATGAAGGAGGCTGATGCCTGAGGTCCTTGGGATATTGTGTTTGGGAGCCCATGGGGGAGCTCACCCACCCCACAATTCCTCCTCTAGCCACATCTTCTGTGGGATCTGACCAGGTTCTGTTTTTGTTCTACCCCAGGCAGTGACAGTGCCCAGGGCTCTGATGTGTCTCTCACAGCTTGTAAAGGTGAGAGCTTGGAGGGCCTGATGTGTGTTGGGTGTTGGGTGGAACAGTGGACACAGCTGTGCTATGGGGTTTCTTTGCGTTGGATGTATTGAGCATGCGATGGGCTGTTTAAGGTGTGACCCCTCACTGTGATGGATATGAATTTGTTCATGAATATTTTTTTCTATAGTGTGAGACAGCTGCCTTGTGTGGGACTGAGAGGCAAGAGTTGTTCCTGCCCTTCCCTTTGTGACTTGAAGAACCCTGACTTTGTTTCTGCAAAGGCACCTGCATGTGTCTGTGTTCGTGTAGGCATAATGTGAGGAGGTGGGGAGAGCACCCCACCCCCATGTCCACCATGACCCTCTTCCCACGCTGACCTGTGCTCCCTCTCCAATCATCTTTCCTGTTCCAGAGAGGTGGGGCTGAGGTGTCTCCATCTCTGTCTCAACTTCATGGTGCACTGAGCTGTAACTTCTTCCTTCCCTATTAAAA",
"protein":"MAVMAPRTLLLLLSGALALTQTWAGSHSMRYFFTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQKMEPRAPWIEQEGPEYWDQETRNMKAHSQTDRANLGTLRGYYNQSEDGSHTIQIMYGCDVGPDGRFLRGYRQDAYDGKDYIALNEDLRSWTAADMAAQITKRKWEAVHAAEQRRVYLEGRCVDGLRRYLENGKETLQRTDPPKTHMTHHPISDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDGTFQKWAAVVVPSGEEQRYTCHVQHEGLPKPLTLRWELSSQPTIPIVGIIAGLVLLGAVITGAVVAAVMWRRKSSDRKGGSYTQAASSDSAQGSDVSLTACKVX"
}
}