ID BC039902; SV 1; linear; mRNA; STD; HUM; 3348 BP. XX AC BC039902; XX DT 21-NOV-2002 (Rel. 73, Created) DT 15-OCT-2008 (Rel. 97, Last updated, Version 7) XX DE Homo sapiens angiotensin I converting enzyme (peptidyl-dipeptidase A) 2, DE mRNA (cDNA clone MGC:47598 IMAGE:5243048), complete cds. XX KW MGC. XX OS Homo sapiens (human) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. XX RN [1] RP 1-3348 RX DOI; 10.1073/pnas.242603899. RX PUBMED; 12477932. RG Mammalian Gene Collection Program Team RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D., RA Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F., RA Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H., RA Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K., RA Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F., RA Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S., RA Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J., RA Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J., RA Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M., RA Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., RA Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., RA Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C., RA Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I., RA Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.; RT "Generation and initial analysis of more than 15,000 full-length human and RT mouse cDNA sequences"; RL Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002). XX RN [2] RC NIH-MGC Project URL: http://mgc.nci.nih.gov RP 1-3348 RG NIH MGC Project RA ; RT ; RL Submitted (15-NOV-2002) to the INSDC. RL National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, RL MD 20892-2590, USA XX DR MD5; d4b1fd4203094787a3d16f621ae72873. DR Ensembl-Gn; ENSG00000130234; homo_sapiens. DR Ensembl-Tr; ENST00000252519; homo_sapiens. DR Ensembl-Tr; ENST00000427411; homo_sapiens. XX CC Contact: MGC help desk CC Email: cgapbs-r@mail.nih.gov CC Tissue Procurement: Life Technologies, Inc. CC cDNA Library Preparation: Life Technologies, Inc. CC cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) CC DNA Sequencing by: National Institutes of Health Intramural CC Sequencing Center (NISC), CC Gaithersburg, Maryland; CC Web site: http://www.nisc.nih.gov/ CC Contact: nisc_mgc@nhgri.nih.gov CC Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., CC Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., CC Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., CC Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., CC Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., CC McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., CC Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., CC Young,A., Zhang,L.-H. and Green,E.D. CC Clone distribution: MGC clone distribution information can be found CC through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov CC Series: IRAK Plate: 82 Row: f Column: 15 CC This clone was selected for full length sequencing because it CC passed the following selection criteria: matched mRNA gi: 11225608. CC Differences found between this sequence and the human reference CC genome (build 36) are described in misc_difference features below. XX FH Key Location/Qualifiers FH FT source 1..3348 FT /organism="Homo sapiens" FT /lab_host="DH10B" FT /mol_type="mRNA" FT /clone_lib="NIH_MGC_121" FT /clone="MGC:47598 IMAGE:5243048" FT /tissue_type="Brain, fetal, whole pooled" FT /note="Vector: pCMV-SPORT6" FT /db_xref="taxon:9606" FT gene 1..3348 FT /gene="ACE2" FT /note="synonyms: DKFZP434A014, ACEH" FT CDS 19..2436 FT /codon_start=1 FT /gene="ACE2" FT /product="angiotensin I converting enzyme FT (peptidyl-dipeptidase A) 2" FT /db_xref="GOA:Q9BYF1" FT /db_xref="H-InvDB:HIT000095947.12" FT /db_xref="HGNC:HGNC:13557" FT /db_xref="InterPro:IPR001548" FT /db_xref="InterPro:IPR031588" FT /db_xref="PDB:1R42" FT /db_xref="PDB:1R4L" FT /db_xref="PDB:1XJP" FT /db_xref="PDB:2AJF" FT /db_xref="PDB:3D0G" FT /db_xref="PDB:3D0H" FT /db_xref="PDB:3D0I" FT /db_xref="PDB:3KBH" FT /db_xref="PDB:3SCI" FT /db_xref="PDB:3SCJ" FT /db_xref="PDB:3SCK" FT /db_xref="PDB:3SCL" FT /db_xref="PDB:6ACG" FT /db_xref="PDB:6ACJ" FT /db_xref="PDB:6ACK" FT /db_xref="PDB:6CS2" FT /db_xref="UniProtKB/Swiss-Prot:Q9BYF1" FT /protein_id="AAH39902.1" FT /translation="MSSSSWLLLSLVAVTAAQSTIEEQAKTFLDKFNHEAEDLFYQSSL FT ASWNYNTNITEENVQNMNNAGDKWSAFLKEQSTLAQMYPLQEIQNLTVKLQLQALQQNG FT SSVLSEDKSKRLNTILNTMSTIYSTGKVCNPDNPQECLLLEPGLNEIMANSLDYNERLW FT AWESWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYEVNGVDGYDYSRGQL FT IEDVEHTFEEIKPLYEHLHAYVRAKLMNAYPSYISPIGCLPAHLLGDMWGRFWTNLYSL FT TVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFFVSVGLPNMTQGFWENSMLTDPGNVQ FT KAVCHPTAWDLGKGDFRILMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGANEG FT FHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEINFLLKQALTIVGTLPFTYMLEK FT WRWMVFKGEIPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVSNDYSFIRYYT FT RTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLFNMLRLGKSEPWTLALENVVGA FT KNMNVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGDKAYEW FT NDNEMYLFRSSVAYAMRQYFLKVKNQMILFGEEDVRVANLKPRISFNFFVTAPKNVSDI FT IPRTEVEKAIRMSRSRINDAFRLNDNSLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGVI FT VVGIVILIFTGIRDRKKKNKARSGENPYASIDISKGENNPGFQNTDDVQTSF" FT misc_difference 3306..3348 FT /gene="ACE2" FT /note="polyA tail: 43 bases do not align to the human FT genome." XX SQ Sequence 3348 BP; 1049 A; 635 C; 747 G; 917 T; 0 other; cttggctcac aggggacgat gtcaagctct tcctggctcc ttctcagcct tgttgctgta 60 actgctgctc agtccaccat tgaggaacag gccaagacat ttttggacaa gtttaaccac 120 gaagccgaag acctgttcta tcaaagttca cttgcttctt ggaattataa caccaatatt 180 actgaagaga atgtccaaaa catgaataat gctggggaca aatggtctgc ctttttaaag 240 gaacagtcca cacttgccca aatgtatcca ctacaagaaa ttcagaatct cacagtcaag 300 cttcagctgc aggctcttca gcaaaatggg tcttcagtgc tctcagaaga caagagcaaa 360 cggttgaaca caattctaaa tacaatgagc accatctaca gtactggaaa agtttgtaac 420 ccagataatc cacaagaatg cttattactt gaaccaggtt tgaatgaaat aatggcaaac 480 agtttagact acaatgagag gctctgggct tgggaaagct ggagatctga ggtcggcaag 540 cagctgaggc cattatatga agagtatgtg gtcttgaaaa atgagatggc aagagcaaat 600 cattatgagg actatgggga ttattggaga ggagactatg aagtaaatgg ggtagatggc 660 tatgactaca gccgcggcca gttgattgaa gatgtggaac atacctttga agagattaaa 720 ccattatatg aacatcttca tgcctatgtg agggcaaagt tgatgaatgc ctatccttcc 780 tatatcagtc caattggatg cctccctgct catttgcttg gtgatatgtg gggtagattt 840 tggacaaatc tgtactcttt gacagttccc tttggacaga aaccaaacat agatgttact 900 gatgcaatgg tggaccaggc ctgggatgca cagagaatat tcaaggaggc cgagaagttc 960 tttgtatctg ttggtcttcc taatatgact caaggattct gggaaaattc catgctaacg 1020 gacccaggaa atgttcagaa agcagtctgc catcccacag cttgggacct ggggaagggc 1080 gacttcagga tccttatgtg cacaaaggtg acaatggacg acttcctgac agctcatcat 1140 gagatggggc atatccagta tgatatggca tatgctgcac aaccttttct gctaagaaat 1200 ggagctaatg aaggattcca tgaagctgtt ggggaaatca tgtcactttc tgcagccaca 1260 cctaagcatt taaaatccat tggtcttctg tcacccgatt ttcaagaaga caatgaaaca 1320 gaaataaact tcctgctcaa acaagcactc acgattgttg ggactctgcc atttacttac 1380 atgttagaga agtggaggtg gatggtcttt aaaggggaaa ttcccaaaga ccagtggatg 1440 aaaaagtggt gggagatgaa gcgagagata gttggggtgg tggaacctgt gccccatgat 1500 gaaacatact gtgaccccgc atctctgttc catgtttcta atgattactc attcattcga 1560 tattacacaa ggacccttta ccaattccag tttcaagaag cactttgtca agcagctaaa 1620 catgaaggcc ctctgcacaa atgtgacatc tcaaactcta cagaagctgg acagaaactg 1680 ttcaatatgc tgaggcttgg aaaatcagaa ccctggaccc tagcattgga aaatgttgta 1740 ggagcaaaga acatgaatgt aaggccactg ctcaactact ttgagccctt atttacctgg 1800 ctgaaagacc agaacaagaa ttcttttgtg ggatggagta ccgactggag tccatatgca 1860 gaccaaagca tcaaagtgag gataagccta aaatcagctc ttggagataa agcatatgaa 1920 tggaacgaca atgaaatgta cctgttccga tcatctgttg catatgctat gaggcagtac 1980 tttttaaaag taaaaaatca gatgattctt tttggggagg aggatgtgcg agtggctaat 2040 ttgaaaccaa gaatctcctt taatttcttt gtcactgcac ctaaaaatgt gtctgatatc 2100 attcctagaa ctgaagttga aaaggccatc aggatgtccc ggagccgtat caatgatgct 2160 ttccgtctga atgacaacag cctagagttt ctggggatac agccaacact tggacctcct 2220 aaccagcccc ctgtttccat atggctgatt gtttttggag ttgtgatggg agtgatagtg 2280 gttggcattg tcatcctgat cttcactggg atcagagatc ggaagaagaa aaataaagca 2340 agaagtggag aaaatcctta tgcctccatc gatattagca aaggagaaaa taatccagga 2400 ttccaaaaca ctgatgatgt tcagacctcc ttttagaaaa atctatgttt ttcctcttga 2460 ggtgattttg ttgtatgtaa atgttaattt catggtatag aaaatataag atgataaaga 2520 tatcattaaa tgtcaaaact atgactctgt tcagaaaaaa aattgtccaa agacaacatg 2580 gccaaggaga gagcatcttc attgacattg ctttcagtat ttatttctgt ctctggattt 2640 gacttctgtt ctgtttctta ataaggattt tgtattagag tatattaggg aaagtgtgta 2700 tttggtctca caggctgttc agggataatc taaatgtaaa tgtctgttga atttctgaag 2760 ttgaaaacaa ggatatatca ttggagcaag tgttggatct tgtatggaat atggatggat 2820 cacttgtaag gacagtgcct gggaactggt gtagctgcaa ggattgagaa tggcatgcat 2880 tagctcactt tcatttaatc cattgtcaag gatgacatgc tttcttcaca gtaactcagt 2940 tcaagtacta tggtgatttg cctacagtga tgtttggaat cgatcatgct ttcttcaagg 3000 tgacaggtct aaagagagaa gaatccaggg aacaggtaga ggacattgct ttttcacttc 3060 caaggtgctt gatcaacatc tccctgacaa cacaaaacta gagccagggg cctccgtgaa 3120 ctcccagagc atgcctgata gaaactcatt tctactgttc tctaactgtg gagtgaatgg 3180 aaattccaac tgtatgttca ccctctgaag tgggtaccca gtctcttaaa tcttttgtat 3240 ttgctcacag tgtttgagca gtgctgagca caaagcagac actcaataaa tgctagattt 3300 acacaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 3348 //