spacer
spacer

EBI Dbfetch

ID   BC007637; SV 1; linear; mRNA; STD; HUM; 1913 BP.
XX
AC   BC007637;
XX
DT   16-MAY-2001 (Rel. 67, Created)
DT   15-OCT-2008 (Rel. 97, Last updated, Version 8)
XX
DE   Homo sapiens chromosome 1 open reading frame 94, mRNA (cDNA clone MGC:15882
DE   IMAGE:3529463), complete cds.
XX
KW   MGC.
XX
OS   Homo sapiens (human)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Homo.
XX
RN   [1]
RP   1-1913
RX   DOI; 10.1073/pnas.242603899
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-1913
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (10-MAY-2001) to the EMBL/GenBank/DDBJ databases.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   ASTD; TRAN00000052645.
DR   Ensembl-Gn; ENSG00000142698; Homo_sapiens.
DR   Ensembl-Tr; ENST00000398041; Homo_sapiens.
DR   H-InvDB; HIT000033347.
DR   ImaGenes; IMAGp958J24200Q.
DR   ImaGenes; IOH6973.
DR   ImaGenes; IRALp962G2323Q.
DR   ImaGenes; IRAUp969D0739D.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: ATCC
CC   cDNA Library Preparation: Rubin Laboratory
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Genome Sequence Centre,
CC   BC Cancer Agency, Vancouver, BC, Canada
CC   info@bcgsc.bc.ca
CC   Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
CC   Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
CC   Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
CC   Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
CC   Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
CC   Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
CC   Kim MacDonald,  Mike R. Mayo, Josh Moran, Diana Palmquist, JR
CC   Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
CC   Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAL Plate: 23 Row: g Column: 23
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 41054803.
CC   Differences found between this sequence and the human reference
CC   genome (build 35) are described in misc_difference features below
CC   and these differences were also compared to chimpanzee genomic
CC   seqeunces available as of 09/15/2004 00:00:00.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1913
FT                   /organism="Homo sapiens"
FT                   /lab_host="DH10B-R"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NIH_MGC_17"
FT                   /clone="MGC:15882 IMAGE:3529463"
FT                   /tissue_type="Muscle, rhabdomyosarcoma"
FT                   /note="Vector: pOTB7"
FT                   /db_xref="taxon:9606"
FT   gene            1..1913
FT                   /gene="C1orf94"
FT                   /note="synonym: MGC15882"
FT   CDS             308..1534
FT                   /codon_start=1
FT                   /gene="C1orf94"
FT                   /product="C1orf94 protein"
FT                   /db_xref="GOA:Q6P1W5"
FT                   /db_xref="HGNC:28250"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q6P1W5"
FT                   /protein_id="AAH07637.1"
FT                   /translation="MPVISSRQDCDSATSTVTDILCAAEVKSSKGTEDRGRILGDSNLE
FT                   VSKLLSQFPLKSTETSKVPDNKNVLDKTRVTKDFLQDNLFSGPGPKEPTGLSPFLLLPP
FT                   RPPPARPEKLPELPAQKRQLPVFAKICSKPKADPAVERHHLMEWSPGTKEPKKGQGSLF
FT                   LSQWPQSQKDACGEEGCCDAVGTASLTLPPKKPTCPAEKNLLYEFLGATKNPSGQPRLR
FT                   NKVEVDGPELKFNAPVTVADKNNPKYTGNVFTPHFPTAMTSATLNQPLWLNLNYPPPPV
FT                   FTNHSTFLQYQGLYPQQAARMPYQQALHPQLGCYSQQVMPYNPQQMGQQIFRSSYTPLL
FT                   SYIPFVQPNYPYPQRTPPKMSANPRDPPLMAGDGPQYLFPQGYGFGSTSGGPLMHSPYF
FT                   SSSGNGINF"
FT   misc_difference 412
FT                   /gene="C1orf94"
FT                   /note="'A' in cDNA is 'G' in the human genome; no amino
FT                   acid change. The chimpanzee genome agrees with the human
FT                   genomic sequence and not the cDNA."
FT   misc_difference 440
FT                   /gene="C1orf94"
FT                   /note="'G' in cDNA is 'C' in the human genome; amino acid
FT                   difference: 'E' in cDNA, 'Q' in the human genome."
FT   misc_difference 643
FT                   /gene="C1orf94"
FT                   /note="'G' in cDNA is 'C' in the human genome; amino acid
FT                   difference: 'E' in cDNA, 'D' in the human genome. The
FT                   chimpanzee genome agrees with the human genomic sequence
FT                   and not the cDNA."
FT   misc_difference 1635
FT                   /gene="C1orf94"
FT                   /note="'A' in cDNA is 'G' in the human genome."
FT   misc_difference 1743^1744
FT                   /gene="C1orf94"
FT                   /note="6 bases in the human genome, ACACAT, are not found
FT                   in cDNA. The chimpanzee genome agrees with the cDNA
FT                   sequence, suggesting that this difference is unlikely to be
FT                   due to an artifact."
FT   misc_difference 1787
FT                   /gene="C1orf94"
FT                   /note="'G' in cDNA is 'A' in the human genome."
FT   misc_difference 1899..1913
FT                   /gene="C1orf94"
FT                   /note="polyA tail: 15 bases do not align to the human
FT                   genome."
XX
SQ   Sequence 1913 BP; 495 A; 554 C; 457 G; 407 T; 0 other;
     gagggctcag ccttcccacc agccctggag aggaggaagg gactgactac gccagcaagc        60
     tctggagctc agcagtggca aagatgagat ctccttgttg gtggaacagg agttcctaag       120
     cctcaccaaa gagcactcga tcctggtcga agagagttct ggggagctgg aggtacccgg       180
     cagctctccc gaggggacca gagagctggc tccctgcatt cttgcccctc ctctagtggc       240
     aggcagtaat gagcgcccca gagcctccat cattgtcgga gacaagcttc tgaagcagaa       300
     ggtggccatg cccgttatca gcagcaggca ggactgtgat tctgccactt ctactgtcac       360
     agacattctg tgtgccgccg aggtcaagag cagcaagggg acagaggaca gaggccgcat       420
     cctaggtgac tccaacttgg aagtcagcaa gcttctgtcc cagttcccac tgaagtccac       480
     tgagacatcc aaggtccctg acaacaagaa tgtgctggac aagacaaggg tcaccaagga       540
     cttcctacag gacaacctgt tcagtggccc tggacccaag gagcccacag ggctgagccc       600
     atttctgctg ctgcctcccc gacctcctcc tgcacgtcct gagaagctcc ctgagctccc       660
     tgctcagaag aggcagctcc cagtgtttgc caagatctgt tccaagccca aggctgaccc       720
     tgctgtggag aggcaccact tgatggaatg gagccctggc accaaggagc caaaaaaggg       780
     tcaagggagc ctctttctca gccagtggcc ccagagccag aaggacgcct gtggtgagga       840
     gggttgctgt gacgcagtgg gcaccgcatc actgaccctg ccgcccaaga aacctacatg       900
     tccagccgag aagaacttgc tctatgagtt ccttggggcc accaagaacc caagcgggca       960
     gccgagactt cgaaacaaag tggaagtgga tgggccggag ctgaaattta acgcacctgt      1020
     gacggttgct gacaagaaca acccgaagta cacagggaat gttttcactc cacactttcc      1080
     tacagccatg acctcagcaa ccctgaacca gccactctgg ctcaacctga actatccacc      1140
     tccaccagtg ttcacgaatc actctacctt cttgcagtat cagggcctgt acccacagca      1200
     ggcagcgagg atgccctatc agcaggcttt gcacccgcag ctgggatgtt actcccaaca      1260
     ggtgatgcca tacaacccac agcagatggg acagcagatc ttccgctctt cctacacccc      1320
     tctgctgagc tacatccctt ttgtccagcc caattatccc taccctcaga ggacacctcc      1380
     aaagatgtct gccaaccccc gagaccctcc cctaatggca ggagatggac cgcagtacct      1440
     ctttccccaa ggatatgggt tcggctcgac atccggaggg cccttgatgc acagccccta      1500
     tttttcttcc agtgggaatg gcataaactt ttagatctcc tcttctccct tctcctccct      1560
     tagcccttgg atcaggacta ggggctctga tttttggatt ctgcaaaagc ttggtatgaa      1620
     gtttggaaaa gcaaagttct gaccaggtca cagacaaaac agcaagacca gattcatcta      1680
     ttggccaaca ctgacacaaa aatagccctc ctcacacatg gcacaagcta cacacacaca      1740
     cacgaccctc atattcatac ttgcttgctc aaccacttat gcatctgtat ttagctaaca      1800
     tgagtgattt ttgtttttgt ttttgttggt aaaatagaag taagacactt aattttagaa      1860
     agtttgtatt ttatgataaa agtatgagct acttgaaaaa aaaaaaaaaa aaa             1913
//


  
spacer
spacer