Dbfetch
ID BC046610; SV 1; linear; mRNA; STD; MUS; 2991 BP. XX AC BC046610; XX DT 13-FEB-2003 (Rel. 74, Created) DT 24-SEP-2008 (Rel. 97, Last updated, Version 12) XX DE Mus musculus endoplasmic reticulum aminopeptidase 1, mRNA (cDNA clone DE MGC:54451 IMAGE:6397585), complete cds. XX KW MGC. XX OS Mus musculus (house mouse) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; OC Murinae; Mus; Mus. XX RN [1] RP 1-2991 RX DOI; 10.1073/pnas.242603899. RX PUBMED; 12477932. RG Mammalian Gene Collection Program Team RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D., RA Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F., RA Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H., RA Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K., RA Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F., RA Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S., RA Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J., RA Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J., RA Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M., RA Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., RA Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., RA Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C., RA Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I., RA Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.; RT "Generation and initial analysis of more than 15,000 full-length human and RT mouse cDNA sequences"; RL Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002). XX RN [2] RC NIH-MGC Project URL: http://mgc.nci.nih.gov RP 1-2991 RG NIH MGC Project RA ; RT ; RL Submitted (10-FEB-2003) to the INSDC. RL National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, RL MD 20892-2590, USA XX DR MD5; 44bc2771b5de4f0aed07b176be54de8f. DR Ensembl-Gn; ENSMUSG00000021583; mus_musculus. DR Ensembl-Gn; MGP_129S1SvImJ_G0020768; mus_musculus_129s1svimj. DR Ensembl-Gn; MGP_AJ_G0020723; mus_musculus_aj. DR Ensembl-Gn; MGP_AKRJ_G0020697; mus_musculus_akrj. DR Ensembl-Gn; MGP_BALBcJ_G0020720; mus_musculus_balbcj. DR Ensembl-Gn; MGP_C3HHeJ_G0020514; mus_musculus_c3hhej. DR Ensembl-Gn; MGP_C57BL6NJ_G0021157; mus_musculus_c57bl6nj. DR Ensembl-Gn; MGP_CASTEiJ_G0020032; mus_musculus_casteij. DR Ensembl-Gn; MGP_CBAJ_G0020473; mus_musculus_cbaj. DR Ensembl-Gn; MGP_DBA2J_G0020598; mus_musculus_dba2j. DR Ensembl-Gn; MGP_FVBNJ_G0020578; mus_musculus_fvbnj. DR Ensembl-Gn; MGP_LPJ_G0020671; mus_musculus_lpj. DR Ensembl-Gn; MGP_NODShiLtJ_G0020607; mus_musculus_nodshiltj. DR Ensembl-Gn; MGP_NZOHlLtJ_G0021197; mus_musculus_nzohlltj. DR Ensembl-Gn; MGP_PWKPhJ_G0019785; mus_musculus_pwkphj. DR Ensembl-Gn; MGP_WSBEiJ_G0020089; mus_musculus_wsbeij. DR Ensembl-Tr; ENSMUST00000169114; mus_musculus. DR Ensembl-Tr; MGP_129S1SvImJ_T0037610; mus_musculus_129s1svimj. DR Ensembl-Tr; MGP_AJ_T0037555; mus_musculus_aj. DR Ensembl-Tr; MGP_AKRJ_T0037522; mus_musculus_akrj. DR Ensembl-Tr; MGP_BALBcJ_T0037572; mus_musculus_balbcj. DR Ensembl-Tr; MGP_C3HHeJ_T0037327; mus_musculus_c3hhej. DR Ensembl-Tr; MGP_C57BL6NJ_T0038030; mus_musculus_c57bl6nj. DR Ensembl-Tr; MGP_CASTEiJ_T0037282; mus_musculus_casteij. DR Ensembl-Tr; MGP_CBAJ_T0037232; mus_musculus_cbaj. DR Ensembl-Tr; MGP_DBA2J_T0037378; mus_musculus_dba2j. DR Ensembl-Tr; MGP_FVBNJ_T0037342; mus_musculus_fvbnj. DR Ensembl-Tr; MGP_LPJ_T0037496; mus_musculus_lpj. DR Ensembl-Tr; MGP_NODShiLtJ_T0037338; mus_musculus_nodshiltj. DR Ensembl-Tr; MGP_NZOHlLtJ_T0038085; mus_musculus_nzohlltj. DR Ensembl-Tr; MGP_PWKPhJ_T0036916; mus_musculus_pwkphj. DR Ensembl-Tr; MGP_WSBEiJ_T0036727; mus_musculus_wsbeij. XX CC Contact: MGC help desk CC Email: cgapbs-r@mail.nih.gov CC Tissue Procurement: Jeffrey E. Green, M.D. CC cDNA Library Preparation: Life Technologies, Inc. CC cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) CC DNA Sequencing by: National Institutes of Health Intramural CC Sequencing Center (NISC), CC Gaithersburg, Maryland; CC Web site: http://www.nisc.nih.gov/ CC Contact: nisc_mgc@nhgri.nih.gov CC Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., CC Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., CC Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., CC Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., CC Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., CC McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., CC Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., CC Young,A., Zhang,L.-H. and Green,E.D. CC Clone distribution: MGC clone distribution information can be found CC through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov CC Series: IRAK Plate: 100 Row: h Column: 13 CC This clone was selected for full length sequencing because it CC passed the following selection criteria: matched mRNA gi: 31542148. CC Differences found between this sequence and the mouse C57BL/6J CC genome (build 36) are described in misc_difference features below. XX FH Key Location/Qualifiers FH FT source 1..2991 FT /organism="Mus musculus" FT /lab_host="DH10B" FT /strain="FVB/N" FT /mol_type="mRNA" FT /clone_lib="NCI_CGAP_Co24" FT /clone="MGC:54451 IMAGE:6397585" FT /tissue_type="Colon, normal. 5 month old male mouse." FT /note="Vector: pCMV-SPORT6" FT /db_xref="taxon:10090" FT gene 1..2991 FT /gene="Erap1" FT /note="synonyms: ERAAP, PILSA, PILSAP" FT CDS 82..2874 FT /codon_start=1 FT /gene="Erap1" FT /product="endoplasmic reticulum aminopeptidase 1" FT /db_xref="GOA:Q9EQH2" FT /db_xref="InterPro:IPR001930" FT /db_xref="InterPro:IPR014782" FT /db_xref="InterPro:IPR024571" FT /db_xref="InterPro:IPR033520" FT /db_xref="InterPro:IPR034016" FT /db_xref="InterPro:IPR042097" FT /db_xref="MGI:MGI:1933403" FT /db_xref="UniProtKB/Swiss-Prot:Q9EQH2" FT /protein_id="AAH46610.1" FT /translation="MPSLLPLVLTFLSVSSPSWCQNSDIESLKASNGDSFPWNNMRLPE FT YMTPIHYDLMIHANLSTLTFWGKTEVEIIASRPTSTIIMHSHHLQISKATLRRGAGEML FT SEEPLKVLEYPAHEQVALLAAQPLLAGSLYTVIIDYAANLSESFHGFYKSTYRTQEGEM FT RILAATQFEPTAARMAFPCFDEPALKASFSIKIKRDPRHLAISNMPLVKSVNVAEGLIE FT DHFDITVKMSTYLVAFIISDFKSVSKMTKSGVKVSVYAVPDKINQADYALDAAVTLLEF FT YEDYFNIPYPLPKQDLAAIPDFQSGAMENWGLTTYRESSLLYDKEKSSASSKLGITMIV FT SHELAHQWFGNLVTMEWWNDLWLNEGFAKFMEFVSVTVTHPELKVEDYFFGKCFNAMEV FT DALNSSHPVSTPVENPAQIREMFDDVSYEKGACILNMLRDYLSADTFKRGIVQYLQKYS FT YKNTKNEDLWNSMMHICPTDGTQTMDGFCSRSQHSSSTSHWRQEVVDVKTMMNTWTLQK FT GFPLITITVSGRNVHMKQEHYMKGSERFPETGYLWHVPLTFITSKSDSVQRFLLKTKTD FT VLILPEAVQWIKFNVGMNGYYIVHYADDGWASLSGLLKEAHTTISSNDRASLINNAFQL FT VSIEKLSIEKALDLTLYLKNETEIMPIFQALNELIPMYKLMEKRDMIEVETQFKDFLLK FT LLKDLIDKQTWTDEGSVSERMLRSQLLLLACVRNYQPCVQRAERYFREWKSSNGNMSIP FT IDVTLAVFAVGAQNTEGWDFLYSKYQSSLSSTEKSQIEFSLCTSKDPEKLQWLLDQSFK FT GEIIKTQEFPHILTLIGRNPVGYPLAWKFLRENWNKLVQKFELGSSSIAHMVMGTTDQF FT STRARLEEVKGFFSSLKENGSQLRCVQQTIETIEENIRWMDKNFDKIRLWLQKEKPELL FT " FT misc_difference 2969..2991 FT /gene="Erap1" FT /note="polyA tail: 23 bases do not align to the mouse FT genome." XX SQ Sequence 2991 BP; 910 A; 615 C; 658 G; 808 T; 0 other; ggcggccaga ctggcggctg tgatgaaggt aatttgagca agaagatagt gtctctgctc 60 ctcaaatagc cacttgcaag catgccctct cttcttcccc tagtattgac atttttatct 120 gtgtcatctc cttcttggtg tcagaacagt gatatagaat ctctaaaagc tagtaatgga 180 gactcattcc cttggaataa tatgcgactt cctgagtata tgaccccgat tcattatgat 240 ctcatgatcc atgcaaacct cagcactctg actttctggg gaaaaacaga agtagaaatc 300 atagctagcc ggcccaccag caccattatt atgcatagtc accacctgca aatatctaag 360 gccaccctca gaagaggagc tggagagatg ctgtctgaag aaccactgaa ggtcctggaa 420 tatcctgctc atgagcaagt tgcactgctg gctgcccagc cgcttcttgc tgggtccctg 480 tacacagtta tcatcgacta tgctgccaac ctgtctgaga gtttccatgg attctataag 540 agcacctaca gaacacagga aggtgaaatg agaatacttg cagcaacaca gtttgaaccc 600 acagctgcta gaatggcttt tccctgcttt gatgaacctg ccctcaaggc aagtttttcc 660 atcaagataa agagggatcc aaggcacctg gccatctcca acatgcccct ggtgaaatct 720 gtgaatgttg ctgaaggact catagaagac cattttgaca tcactgtgaa gatgagtacc 780 tacctagtgg ccttcatcat ttctgatttt aagtctgtga gcaagatgac taagagtgga 840 gtcaaggttt ctgtgtatgc tgtgccagac aagataaatc aagccgatta tgctctggat 900 gctgcagtga ctcttctaga gttttatgag gattatttca acattccata tcctttaccc 960 aagcaagatc ttgctgcaat cccagacttt cagtctggtg ctatggaaaa ctggggactg 1020 accacataca gagagtcctc tctgttatac gataaagaaa agtcttctgc atctagtaag 1080 cttggcatca caatgattgt gtcccatgaa ctggctcacc agtggttcgg gaacctggtc 1140 accatggaat ggtggaatga cctttggctc aatgaaggat ttgccaaatt tatggagttt 1200 gtgtctgtca ctgtgaccca tcctgaacta aaagttgaag actatttctt tggcaagtgt 1260 tttaatgcaa tggaagttga tgcattaaac tcctctcacc ctgtatccac acctgtggag 1320 aatcctgcgc agattcggga gatgtttgat gatgtttctt atgaaaaggg agcttgtatt 1380 ctgaatatgc taagagatta tctgagtgca gatacattta aaagaggaat tgtacagtat 1440 ctccaaaagt atagttataa aaacacaaaa aacgaggacc tgtggaatag catgatgcat 1500 atttgcccta cagatggcac acaaacaatg gatggcttct gctctagaag ccaacactca 1560 tcatcaacct cacactggcg tcaggaggtt gtagatgtaa agaccatgat gaacacatgg 1620 acactgcaga agggctttcc tctgataacc atcactgtga gtgggcggaa tgtgcacatg 1680 aagcaagaac actacatgaa gggttcagaa cgcttcccag agactgggta tttgtggcat 1740 gttccactga cattcattac cagcaaatca gactcagtcc agagattttt gctgaagaca 1800 aagacagatg tgctcatcct cccagaagca gtgcagtgga tcaaatttaa cgtgggaatg 1860 aatggctatt acattgtgca ttacgcggat gatggatggg cttctctgag tggcctttta 1920 aaagaagcgc acacaacaat cagcagtaat gatcgggcaa gtctcatcaa caatgcattt 1980 cagctggtca gcattgaaaa gttatcaata gaaaaagccc tggatttaac cctgtacttg 2040 aaaaatgaaa ctgaaattat gcccatattc caagctttga atgaactcat acctatgtat 2100 aagttaatgg agaaaagaga tatgattgag gtggaaactc aatttaagga cttccttctc 2160 aagttgctga aggaccttat tgataagcag acttggacag atgaaggctc tgtctcagag 2220 aggatgctca ggagccagct cctcctcctc gcatgtgtgc gcaattatca gccctgtgtg 2280 cagagggcag agcgctattt cagagagtgg aagtcatcta atggaaacat gagcatccct 2340 attgatgtga ccttggctgt gtttgctgtc ggggcccaaa acacagaagg ctgggatttt 2400 ctttatagta aatatcagtc ttctttgtct agtactgaga aaagccaaat tgaattttcc 2460 ctctgtacaa gcaaagatcc agaaaaactt cagtggcttc tagatcaaag ttttaaggga 2520 gagataataa aaactcagga atttccacat attctcacac tcattggcag aaacccagtg 2580 ggttatccat tggcctggaa gtttctgagg gaaaattgga ataagcttgt acagaaattt 2640 gaacttggct catcgtccat agctcacatg gtaatgggaa caacagacca gttttccacc 2700 agagcacgtc ttgaagaggt aaaaggattc ttcagttcct tgaaagaaaa cggttctcag 2760 ctccgttgtg ttcaacaaac cattgagacc attgaagaaa acatacgatg gatggataag 2820 aattttgata aaataagact gtggctgcaa aaagaaaagc cagagttgct gtgacaatat 2880 gtttcttgcc aagttccaga atttatgatg agcaaaattt tgttcaattt gagtattttt 2940 aaactaaaga tagttgtttt ggctactaaa aaaaaaaaaa aaaaaaaaaa a 2991 //