![]() |
EBI DbfetchID BC007637; SV 1; linear; mRNA; STD; HUM; 1913 BP. XX AC BC007637; XX DT 16-MAY-2001 (Rel. 67, Created) DT 15-OCT-2008 (Rel. 97, Last updated, Version 8) XX DE Homo sapiens chromosome 1 open reading frame 94, mRNA (cDNA clone MGC:15882 DE IMAGE:3529463), complete cds. XX KW MGC. XX OS Homo sapiens (human) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. XX RN [1] RP 1-1913 RX DOI; 10.1073/pnas.242603899 RX PUBMED; 12477932. RG Mammalian Gene Collection Program Team RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D., RA Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F., RA Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H., RA Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K., RA Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F., RA Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S., RA Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J., RA Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J., RA Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M., RA Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., RA Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., RA Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C., RA Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I., RA Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.; RT "Generation and initial analysis of more than 15,000 full-length human and RT mouse cDNA sequences"; RL Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002). XX RN [2] RC NIH-MGC Project URL: http://mgc.nci.nih.gov RP 1-1913 RG NIH MGC Project RA ; RT ; RL Submitted (10-MAY-2001) to the EMBL/GenBank/DDBJ databases. RL National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, RL MD 20892-2590, USA XX DR ASTD; TRAN00000052645. DR Ensembl-Gn; ENSG00000142698; Homo_sapiens. DR Ensembl-Tr; ENST00000398041; Homo_sapiens. DR H-InvDB; HIT000033347. DR ImaGenes; IMAGp958J24200Q. DR ImaGenes; IOH6973. DR ImaGenes; IRALp962G2323Q. DR ImaGenes; IRAUp969D0739D. XX CC Contact: MGC help desk CC Email: cgapbs-r@mail.nih.gov CC Tissue Procurement: ATCC CC cDNA Library Preparation: Rubin Laboratory CC cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) CC DNA Sequencing by: Genome Sequence Centre, CC BC Cancer Agency, Vancouver, BC, Canada CC info@bcgsc.bc.ca CC Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson CC Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen CC Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel CC Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave CC Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth CC Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, CC Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR CC Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, CC Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. CC Clone distribution: MGC clone distribution information can be found CC through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov CC Series: IRAL Plate: 23 Row: g Column: 23 CC This clone was selected for full length sequencing because it CC passed the following selection criteria: matched mRNA gi: 41054803. CC Differences found between this sequence and the human reference CC genome (build 35) are described in misc_difference features below CC and these differences were also compared to chimpanzee genomic CC seqeunces available as of 09/15/2004 00:00:00. XX FH Key Location/Qualifiers FH FT source 1..1913 FT /organism="Homo sapiens" FT /lab_host="DH10B-R" FT /mol_type="mRNA" FT /clone_lib="NIH_MGC_17" FT /clone="MGC:15882 IMAGE:3529463" FT /tissue_type="Muscle, rhabdomyosarcoma" FT /note="Vector: pOTB7" FT /db_xref="taxon:9606" FT gene 1..1913 FT /gene="C1orf94" FT /note="synonym: MGC15882" FT CDS 308..1534 FT /codon_start=1 FT /gene="C1orf94" FT /product="C1orf94 protein" FT /db_xref="GOA:Q6P1W5" FT /db_xref="HGNC:28250" FT /db_xref="UniProtKB/Swiss-Prot:Q6P1W5" FT /protein_id="AAH07637.1" FT /translation="MPVISSRQDCDSATSTVTDILCAAEVKSSKGTEDRGRILGDSNLE FT VSKLLSQFPLKSTETSKVPDNKNVLDKTRVTKDFLQDNLFSGPGPKEPTGLSPFLLLPP FT RPPPARPEKLPELPAQKRQLPVFAKICSKPKADPAVERHHLMEWSPGTKEPKKGQGSLF FT LSQWPQSQKDACGEEGCCDAVGTASLTLPPKKPTCPAEKNLLYEFLGATKNPSGQPRLR FT NKVEVDGPELKFNAPVTVADKNNPKYTGNVFTPHFPTAMTSATLNQPLWLNLNYPPPPV FT FTNHSTFLQYQGLYPQQAARMPYQQALHPQLGCYSQQVMPYNPQQMGQQIFRSSYTPLL FT SYIPFVQPNYPYPQRTPPKMSANPRDPPLMAGDGPQYLFPQGYGFGSTSGGPLMHSPYF FT SSSGNGINF" FT misc_difference 412 FT /gene="C1orf94" FT /note="'A' in cDNA is 'G' in the human genome; no amino FT acid change. The chimpanzee genome agrees with the human FT genomic sequence and not the cDNA." FT misc_difference 440 FT /gene="C1orf94" FT /note="'G' in cDNA is 'C' in the human genome; amino acid FT difference: 'E' in cDNA, 'Q' in the human genome." FT misc_difference 643 FT /gene="C1orf94" FT /note="'G' in cDNA is 'C' in the human genome; amino acid FT difference: 'E' in cDNA, 'D' in the human genome. The FT chimpanzee genome agrees with the human genomic sequence FT and not the cDNA." FT misc_difference 1635 FT /gene="C1orf94" FT /note="'A' in cDNA is 'G' in the human genome." FT misc_difference 1743^1744 FT /gene="C1orf94" FT /note="6 bases in the human genome, ACACAT, are not found FT in cDNA. The chimpanzee genome agrees with the cDNA FT sequence, suggesting that this difference is unlikely to be FT due to an artifact." FT misc_difference 1787 FT /gene="C1orf94" FT /note="'G' in cDNA is 'A' in the human genome." FT misc_difference 1899..1913 FT /gene="C1orf94" FT /note="polyA tail: 15 bases do not align to the human FT genome." XX SQ Sequence 1913 BP; 495 A; 554 C; 457 G; 407 T; 0 other; gagggctcag ccttcccacc agccctggag aggaggaagg gactgactac gccagcaagc 60 tctggagctc agcagtggca aagatgagat ctccttgttg gtggaacagg agttcctaag 120 cctcaccaaa gagcactcga tcctggtcga agagagttct ggggagctgg aggtacccgg 180 cagctctccc gaggggacca gagagctggc tccctgcatt cttgcccctc ctctagtggc 240 aggcagtaat gagcgcccca gagcctccat cattgtcgga gacaagcttc tgaagcagaa 300 ggtggccatg cccgttatca gcagcaggca ggactgtgat tctgccactt ctactgtcac 360 agacattctg tgtgccgccg aggtcaagag cagcaagggg acagaggaca gaggccgcat 420 cctaggtgac tccaacttgg aagtcagcaa gcttctgtcc cagttcccac tgaagtccac 480 tgagacatcc aaggtccctg acaacaagaa tgtgctggac aagacaaggg tcaccaagga 540 cttcctacag gacaacctgt tcagtggccc tggacccaag gagcccacag ggctgagccc 600 atttctgctg ctgcctcccc gacctcctcc tgcacgtcct gagaagctcc ctgagctccc 660 tgctcagaag aggcagctcc cagtgtttgc caagatctgt tccaagccca aggctgaccc 720 tgctgtggag aggcaccact tgatggaatg gagccctggc accaaggagc caaaaaaggg 780 tcaagggagc ctctttctca gccagtggcc ccagagccag aaggacgcct gtggtgagga 840 gggttgctgt gacgcagtgg gcaccgcatc actgaccctg ccgcccaaga aacctacatg 900 tccagccgag aagaacttgc tctatgagtt ccttggggcc accaagaacc caagcgggca 960 gccgagactt cgaaacaaag tggaagtgga tgggccggag ctgaaattta acgcacctgt 1020 gacggttgct gacaagaaca acccgaagta cacagggaat gttttcactc cacactttcc 1080 tacagccatg acctcagcaa ccctgaacca gccactctgg ctcaacctga actatccacc 1140 tccaccagtg ttcacgaatc actctacctt cttgcagtat cagggcctgt acccacagca 1200 ggcagcgagg atgccctatc agcaggcttt gcacccgcag ctgggatgtt actcccaaca 1260 ggtgatgcca tacaacccac agcagatggg acagcagatc ttccgctctt cctacacccc 1320 tctgctgagc tacatccctt ttgtccagcc caattatccc taccctcaga ggacacctcc 1380 aaagatgtct gccaaccccc gagaccctcc cctaatggca ggagatggac cgcagtacct 1440 ctttccccaa ggatatgggt tcggctcgac atccggaggg cccttgatgc acagccccta 1500 tttttcttcc agtgggaatg gcataaactt ttagatctcc tcttctccct tctcctccct 1560 tagcccttgg atcaggacta ggggctctga tttttggatt ctgcaaaagc ttggtatgaa 1620 gtttggaaaa gcaaagttct gaccaggtca cagacaaaac agcaagacca gattcatcta 1680 ttggccaaca ctgacacaaa aatagccctc ctcacacatg gcacaagcta cacacacaca 1740 cacgaccctc atattcatac ttgcttgctc aaccacttat gcatctgtat ttagctaaca 1800 tgagtgattt ttgtttttgt ttttgttggt aaaatagaag taagacactt aattttagaa 1860 agtttgtatt ttatgataaa agtatgagct acttgaaaaa aaaaaaaaaa aaa 1913 // ![]() |