ID M21540; SV 1; linear; genomic DNA; STD; HUM; 4944 BP. XX AC M21540; XX DT 22-APR-1989 (Rel. 19, Created) DT 14-NOV-2006 (Rel. 89, Last updated, Version 7) XX DE Human alpha-1-acid glycoprotein 2 (AGP2) gene, complete cds. XX KW alpha-1 acid glycoprotein; orosomucoid. XX OS Homo sapiens (human) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. XX RN [1] RP 1-4944 RX DOI; 10.1016/0378-1119(88)90228-4. RX PUBMED; 2970990. RA Merritt C.M., Board P.G.; RT "Structure and characterisation of a duplicated human alpha 1 acid RT glycoprotein gene"; RL Gene 66(1):97-106(1988). XX DR MD5; 5cfd65eccccb518c147111ffe635792b. DR EPD; EP60006; HS_ORM2. DR Ensembl-Gn; ENSG00000228278; homo_sapiens. DR Ensembl-Tr; ENST00000431067; homo_sapiens. XX FH Key Location/Qualifiers FH FT source 1..4944 FT /organism="Homo sapiens" FT /map="9q32" FT /mol_type="genomic DNA" FT /db_xref="taxon:9606" FT prim_transcript 1594..>4940 FT /note="AGP2 mRNA and introns" FT exon <1609..1722 FT /gene="ORM2" FT /note="alpha-1-acid glycoprotein 2; G00-120-251" FT CDS join(1609..1722,2138..2280,2499..2569,3274..3381, FT 3532..3635,4776..4841) FT /codon_start=1 FT /note="alpha-1-acid glycoprotein 2" FT /db_xref="GOA:P19652" FT /db_xref="HGNC:HGNC:8499" FT /db_xref="InterPro:IPR000566" FT /db_xref="InterPro:IPR001500" FT /db_xref="InterPro:IPR012674" FT /db_xref="PDB:3APU" FT /db_xref="PDB:3APV" FT /db_xref="PDB:3APW" FT /db_xref="PDB:3APX" FT /db_xref="UniProtKB/Swiss-Prot:P19652" FT /protein_id="AAA51549.1" FT /translation="MALSWVLTVLSLLPLLEAQIPLCANLVPVPITNATLDRITGKWFY FT IASAFRNEEYNKSVQEIQATFFYFTPNKTEDTIFLREYQTRQNQCFYNSSYLNVQRENG FT TVSRYEGGREHVAHLLFLRDTKTLMFGSYLDDEKNWGLSFYADKPETTKEQLGEFYEAL FT DCLCIPRSDVMYTDWKKDKCEPLEKQHEKERKQEEGES" FT intron 1723..2137 FT /note="AGP2 intron A" FT exon 2138..2280 FT /number=2 FT intron 2281..2498 FT /note="AGP2 intron B" FT exon 2499..2569 FT /number=3 FT intron 2570..3273 FT /note="AGP2 intron C" FT exon 3274..3381 FT /number=4 FT intron 3382..3531 FT /note="AGP2 intron D" FT exon 3532..3635 FT /number=5 FT intron 3636..4775 FT /note="AGP2 intron E" FT exon 4776..>4841 FT /number=6 FT /note="alpha-1-acid glycoprotein 2" XX SQ Sequence 4944 BP; 1206 A; 1369 C; 1292 G; 1077 T; 0 other; ggatccgctg aaaaatgaaa cagaaatgag tcgtgatggg cagggaggga gaagcaaggg 60 agacgagaag tggggaacat ggaaggaaaa gccacgtgag gaagaaacca gaggtcaaga 120 gaaaaagaat catggaggta gaggaagcaa aaaacacaca taacaaagaa tgtggacttt 180 ggagtcaaac taatgtgagt ccaaacccag gctctctccc aaaccagttt gcggcagatg 240 gccagtggaa cctcactctc ctcatcagta aaaagggggc agagtgaggg tcctgagagc 300 tagtacaggg actgtgtgaa gtagacaatg cccagtgttt agcgtaagaa tcagggtcca 360 gctggtgctc cctaaacagc agctgctgtt cactgttgaa aggcgctctg gaaggccagg 420 cgcggtggct catgcttgta atcccagcac tgtgggaggc cgaggtgggc ggatcacctg 480 aggtagggag ttcgagacca gcctgaccaa cgtggagaaa ccccatctct cctaaaaata 540 caaaattagc caggcgtggt agcacatacc tgtaatccca gcgactcggg aggctgaggc 600 aagagaattg cttgaaacca gcaggggagg ttgtggtgag ccaagatcga gccattgcac 660 tccagccagg gcaacaagag gcaaaatggc gaaactccat ctccgagaaa aaaaaaaaaa 720 aagaatactt tctgaaagta tttattcata caaataaaga cttgacccat aaggtaggaa 780 cgcaaatggg ccacggaatc actcattcca cagtatacac cgagtgccct tgaagtgctg 840 ggcactgctc caggattggg ggcatattgg tgaaaagaga agcaagcctg cctgctcaga 900 tggcagggaa tggggaaaaa cagggagaca gtttcctgtt tgagatgttg ggagtctgct 960 tcgagtagta tatttactgg aaatagacca ctaacttgga tgtccctttt tggaaatgtg 1020 cctgcgtcca gggctgggtt ggggccccaa tgaactttgg ctctgacata gctgttgcca 1080 cactcagtgg aactgaatcc atgtttgcct tcacccggca tccttcaccc caactctccc 1140 cgccacaaca tacatcccat gccagcctgg ggaccctcaa aggtgcttca tcattaggtt 1200 tgtggctggg tcctactgaa gtaagtcttg gcactcagag ggataggaat tgaatgaaga 1260 catgagattc ctctgcggga ggcctctcta ggaaatctgt ggactcacac gtttactaat 1320 gttgctgcag ccccgcaccc accttggcct tgggcagcca tactctaggg cttttgtaac 1380 ctctccatgt gaggaactca aattagacct gggtttggag gcggtgctcc gagctggcct 1440 ttgggggagg ttttgtgcga ggcatttccc aagtgctggc aggattgtgt cacagacaca 1500 gagtaaactt ttgctgggct ccaagtgacc gcccatagtt tattataaag gtgactgcac 1560 cctgcagcca ccagcactgc ctggctccac gtgcctcctg gtctcagtat ggcgctgtcc 1620 tgggttctta cagtcctgag cctcctacct ctgctggaag cccagatccc attgtgtgcc 1680 aacctagtac cggtgcccat caccaacgcc accctggacc gggtgagtgc ctgggctagc 1740 cctgtcctga gcacatgggc agctgcctcc cttctctggg cttcccttta cctgctggct 1800 gtggtcgcac ccccactccc agctctgcct ttttctcttc tgggtcccca gggtgaaatt 1860 ctcaccagcc caggggactc tggaggcacc ccctgcctcc aaacacagaa gcctcactgc 1920 agagtccttc acggaggacg gttctgtgct gggcctggag gggctgcctg gggggcaatg 1980 actgatcctc agggtgagct cctgcatgcg cactgcccac caggggcctc atctccccat 2040 ctgcaaaatc agggagagat ctgcctgagt ctcctcccag ctgacagtca aagattcagc 2100 atcaagcccc catcaccagc tccccccttc tccccagatc actggcaagt ggttttatat 2160 cgcatcggcc tttcgaaacg aggagtacaa taagtcggtt caggagatcc aagcaacctt 2220 cttttacttt acccccaaca agacagagga cacgatcttt ctcagagagt accagacccg 2280 gtgagagccc ccattccaat gcacccccga tctcagctgt ctggccagaa gacctgagca 2340 agtccctcct tcttcctggc cttggccttc ccatgggtgg aaccgggagg gttggcttta 2400 atctccacca gaactcttgc cccgggactg tgatgggcga ttggccactt ctcctcgata 2460 acattactgt ttttcttccg ccttctggtt gactttagcc agaaccagtg cttctataac 2520 tccagttacc tgaatgtcca gcgggagaat gggaccgtct ccagatacgg tgagggccag 2580 ccctcaggca ggagggttca ccgtgggaac agggcaggcc agcataaggt gggggctgga 2640 tgtagagccc tggaggcttt gggcacagag aaataaccac taacattttt gagctcttac 2700 cacgtgctca gaaaaaatcc ctaagaagac actgagagaa ttagatgagg aaacataaga 2760 acagagacct caaatagttt ccccaaggtc acacagctta taattagaac tagaattgga 2820 actccaggct ggcttcagat ctgcctctct ctcacgccct ctttaagatc ctttgcaaac 2880 caatggtaga agcctgtatg ttggagaggt ggtaccttca actatgtccc ccatcaccgc 2940 agaggtggca catggcaggg atctgatgga gctgaactga catcatttag catcccgagc 3000 ctcctctctg ggcctcattt tcctcctctg taaaacgggg agaaaggccc tgacagccac 3060 agtctgtgtg aggctcctga gatctcatgt acagaaagtg cttggcgtgg agctgggcac 3120 gcagcagggg ctgggcacac ggtggcccaa aggagacccg ggccttcact gatgggcttt 3180 gtggccccgg acacacctag gactcctcac ctgtaagaca ggcaccattg tgccatccca 3240 tgttctcacc cagaggctct ttttctcttc cagagggagg ccgagaacat gttgctcacc 3300 tgctgttcct tagggacacc aagaccttga tgtttggttc ctacctggac gatgagaaga 3360 actgggggct gtctttctat ggtaggcatg cttagcagcc ccaaactcat gcccctctca 3420 ggcctcaccc cccattcacc cacccctggg ctggccccta gaaccccagc cctccctggc 3480 ctccgccggg ccccaccatg tccccagtca gtctccttgc tccccctgca gctgacaagc 3540 cagagacgac caaggagcaa ctgggagagt tctacgaagc tctcgactgc ttgtgcattc 3600 ccaggtcaga tgtcatgtac accgactgga aaaaggtaaa cgcaagggat tggacattgc 3660 ccaccttgtc catggcccaa cttgggcagc cccagaggcc cagagcagga aagctgccag 3720 gcaaggctgc acagctaggc agatcttctg cttttaggca cctgcctcac tgtagggaca 3780 gctgagctct acagaggccc aggggtggtg gatgagagcc caggagggag aagtccctgt 3840 gaaaccaggg aggacctgaa agctaacagg agggaacagc gtgagccacg gggttggggg 3900 attggcaatt ggaggggacg taatgcgggg agttaccacc tacagacgcg tcccaaaccc 3960 caggctttca ccccaacctc cactccccgc tcatttttaa tacccgtgca gtggggaatt 4020 gatactgtgg ttttcaatgt cacccacact gcagcacggc cacagtcacc atcccgattt 4080 ttgctacaaa tgaaaattac tgtataatga gctccttaac acttttcttt aaacctgtgt 4140 ttggaagact tgtgttggtg tggccctgtg ccctaatacc tgtgaaatca cagcaccgat 4200 gagctggttc caatttttaa aatatataca tgcagtactt ccatgactat tcaaagaaaa 4260 acaattcctt ccatttgcca cctgagatga ccaccaggga tgtgaactac ctcctgcccc 4320 atccccagcc ccaggatcct gggacagggc ttatgaacgc aaccactgta gtcagctcac 4380 ttgatccaca gcctggcacc tccactgtct ggctagggag cctcgaatgg gtcccaaggc 4440 caccctgctc ctcagttaca tcatctgcat agtagtggtg gttgtgagga attcaggagc 4500 tgcagcataa gggccctgca ggtactatgt gctcagtaaa tgccagtggt tcttaagggt 4560 ctgagctccc attgtagagg caagtaagct gaggttcaga gacagaaaat gacttgccca 4620 agatcaccca gctgggaagt gacagtgcca gggttggagc cctggttgag ctggttccac 4680 aggccagagc tcattctgcc ctctccccgg aagacctccc accctgtccc catgcctctg 4740 cttctccctc accccaattc cccgctgcct tctaggataa gtgtgagcca ctggagaagc 4800 agcacgagaa ggagaggaaa caggaggagg gggaatccta gcaggacaca gccttggatc 4860 aggacagaga cttgggggcc atcctgcccc tccaacccga catgtgtacc tcagcttttt 4920 ccctcacttg catcaataaa gctt 4944 //