Dbfetch

ID   BC029252; SV 1; linear; mRNA; STD; MUS; 1020 BP.
XX
AC   BC029252;
XX
DT   25-MAY-2002 (Rel. 71, Created)
DT   24-SEP-2008 (Rel. 97, Last updated, Version 11)
XX
DE   Mus musculus ribonuclease H2, large subunit, mRNA (cDNA clone MGC:36700
DE   IMAGE:3666709), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-1020
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-1020
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (01-MAY-2002) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; 44f6b432b80f159f22b1623fcee934f4.
DR   Ensembl-Gn; ENSMUSG00000052926; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0033882; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0033865; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0033791; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0033858; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0033568; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0034378; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0032898; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0033544; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0033697; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0033645; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0033789; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0033687; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0034395; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0032602; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0033011; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000065049; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0089767; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0089849; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0089788; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0089784; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0089353; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0090306; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0089902; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0089288; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0089491; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0089337; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0089527; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0089358; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0090609; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0089327; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0088399; mus_musculus_wsbeij.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Lothar Hennighausen Ph.D., Robin Humphreys
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Baylor College of Medicine Human Genome
CC   Sequencing Center
CC   Center code: BCM-HGSC
CC   Web site: http://www.hgsc.bcm.tmc.edu/cdna/
CC   Contact: amg@bcm.tmc.edu
CC   Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
CC   Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
CC   A.N., Gibbs, R.A.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 61 Row: d Column: 12
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: Hexamer frequency ORF
CC   analysis.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1020
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="mix FVB/N, C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NCI_CGAP_Mam5"
FT                   /clone="MGC:36700 IMAGE:3666709"
FT                   /tissue_type="Mammary tumor. WAP-TGF alpha model. 7 months
FT                   old, gross tissue."
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..1020
FT                   /gene="Rnaseh2a"
FT                   /note="synonyms: RNHL, RNHIA, RNASEHI"
FT   CDS             56..820
FT                   /codon_start=1
FT                   /gene="Rnaseh2a"
FT                   /product="Rnaseh2a protein"
FT                   /db_xref="GOA:Q05C51"
FT                   /db_xref="InterPro:IPR001352"
FT                   /db_xref="InterPro:IPR004649"
FT                   /db_xref="InterPro:IPR012337"
FT                   /db_xref="InterPro:IPR023160"
FT                   /db_xref="InterPro:IPR024567"
FT                   /db_xref="InterPro:IPR036397"
FT                   /db_xref="MGI:MGI:1916974"
FT                   /db_xref="UniProtKB/TrEMBL:Q05C51"
FT                   /protein_id="AAH29252.1"
FT                   /translation="MDLSELERDNTGRCRLSSPVPAVCLKEPCVLGVDEAGRGPVLGPM
FT                   VYAICYCPLSRLADLEALKVADSKTLTENERERLFAKMEEDGDFVGWALDVLSPNLIST
FT                   SMLGRVKYNLNSLSHDTAAGLIQYALDQNVNVTQVFVDTVGMPETYQARLQQHFPGIEV
FT                   TVKAKADSLFPVVSAASIFAKVARDKAVKNWQFVENLQDLDSDYGSGYPNDPKTKAWLR
FT                   KHVDPVFGFPQFVRFSWSTAQAILEKEAEDVI"
FT   misc_difference 241
FT                   /gene="Rnaseh2a"
FT                   /note="'T' in cDNA is 'C' in the mouse genome."
FT   misc_difference 421
FT                   /gene="Rnaseh2a"
FT                   /note="'A' in cDNA is 'T' in the mouse genome."
FT   misc_difference 748
FT                   /gene="Rnaseh2a"
FT                   /note="'C' in cDNA is 'T' in the mouse genome."
FT   misc_difference 959
FT                   /gene="Rnaseh2a"
FT                   /note="'A' in cDNA is 'G' in the mouse genome."
FT   misc_difference 1007..1020
FT                   /gene="Rnaseh2a"
FT                   /note="polyA tail: 14 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 1020 BP; 246 A; 264 C; 290 G; 220 T; 0 other;
     gggacctgcg cttgcagtgt tgttttctac aattgttggc tgtcgcagag gcagcatgga        60
     tctcagcgag ctggagaggg acaatacggg tcgttgtcgt ctgagttctc ctgtacctgc       120
     tgtgtgtctc aaggagccgt gcgttctggg cgtggatgaa gcgggccgag gccctgtgct       180
     tggtcccatg gtctacgcca tctgttactg ccccctgtct cgcttggcag atctggaggc       240
     tctgaaagtg gcagactcta agaccttgac agagaacgag cgggagaggc tctttgcgaa       300
     aatggaggag gatggagact ttgtgggttg ggctttggac gtcctgtctc caaacctgat       360
     ctctaccagc atgcttgggc gagtcaagta caacctcaac tccctgtcac acgatacagc       420
     agcggggctg atacagtacg cactggacca gaatgtgaat gtcactcagg tatttgtgga       480
     cactgtagga atgccagaga cataccaggc tcgattacaa cagcactttc ccgggataga       540
     ggtgacagtc aaggccaaag ctgactccct gttccctgtg gtcagtgctg ccagcatctt       600
     tgccaaggtg gcccgagaca aggctgtgaa gaactggcag tttgtggaaa atttacagga       660
     tctggactcc gattatggct caggctatcc caatgatccc aagaccaaag cctggctgag       720
     gaaacatgtg gaccctgtgt ttggcttccc ccagtttgta cggttcagtt ggagcacagc       780
     ccaggccatc ctggagaagg aggcagaaga tgtcatttga agctgaggag gatcctgaaa       840
     gaccgggaaa aatcacctcc tacttcagcc agggcccgca gacctgccgc cctcaggccc       900
     cccacagata cttccaggag cgaggcctgg aggcagccag cagcctctag gagcccacat       960
     gtgcaccacc tgcccttcta acccaggaat aaaagctgtt caagaaaaaa aaaaaaaaaa      1020
//