Dbfetch

ID   BC038158; SV 1; linear; mRNA; STD; MUS; 1080 BP.
XX
AC   BC038158;
XX
DT   24-SEP-2002 (Rel. 73, Created)
DT   24-SEP-2008 (Rel. 97, Last updated, Version 12)
XX
DE   Mus musculus ribonuclease H2, large subunit, mRNA (cDNA clone MGC:48163
DE   IMAGE:1493985), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-1080
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-1080
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (20-SEP-2002) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; 15d169d71eff48b267c112afbd4b233f.
DR   Ensembl-Gn; ENSMUSG00000052926; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0033882; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0033865; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0033791; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0033858; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0033568; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0034378; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0032898; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0033544; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0033697; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0033645; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0033789; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0033687; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0034395; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0032602; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0033011; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000109736; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000109738; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000147812; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0089767; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0089849; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0089788; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0089784; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0089353; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0090306; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0089902; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0089288; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0089491; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0089337; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0089527; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0089358; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0090609; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0089327; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0088399; mus_musculus_wsbeij.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Marcello Bento Soares, Ph.D.
CC   cDNA Library Preparation: M. Bento Soares, University of Iowa
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Institute for Systems Biology
CC   http://www.systemsbiology.org
CC   contact: amadan@systemsbiology.org
CC   Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
CC   Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 83 Row: i Column: 16.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1080
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /clone_lib="Soares_mammary_gland_NbMMG"
FT                   /clone="MGC:48163 IMAGE:1493985"
FT                   /tissue_type="Mammary gland"
FT                   /note="Vector: pT7T3-Pac"
FT                   /db_xref="taxon:10090"
FT   gene            1..1080
FT                   /gene="Rnaseh2a"
FT                   /note="synonyms: RNHL, RNHIA, RNASEHI"
FT   misc_difference 17
FT                   /gene="Rnaseh2a"
FT                   /note="'G' in cDNA is 'A' in the mouse genome."
FT   CDS             100..1005
FT                   /codon_start=1
FT                   /gene="Rnaseh2a"
FT                   /product="ribonuclease H2, large subunit"
FT                   /db_xref="GOA:Q9CWY8"
FT                   /db_xref="InterPro:IPR001352"
FT                   /db_xref="InterPro:IPR004649"
FT                   /db_xref="InterPro:IPR012337"
FT                   /db_xref="InterPro:IPR023160"
FT                   /db_xref="InterPro:IPR024567"
FT                   /db_xref="MGI:MGI:1916974"
FT                   /db_xref="PDB:3KIO"
FT                   /db_xref="PDB:3P5J"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9CWY8"
FT                   /protein_id="AAH38158.1"
FT                   /translation="MDLSELERDNTGRCRLSSPVPAVCLKEPCVLGVDEAGRGPVLGPM
FT                   VYAICYCPLSRLADLEALKVADSKTLTENERERLFAKMEEDGDFVGWALDVLSPNLIST
FT                   SMLGRVKYNLNSLSHDTAAGLIQYALDQNVNVTQVFVDTVGMPETYQARLQQHFPGIEV
FT                   TVKAKADSLFPVVSAASIFAKVARDKAVKNWQFVENLQDLDSDYGSGYPNDPKTKAWLR
FT                   KHVDPVFGFPQFVRFSWSTAQAILEKEAEDVIWEDSEAEEDPERPGKITSYFSQGPQTC
FT                   RPQAPHRYFQERGLEAASSL"
FT   misc_difference 285
FT                   /gene="Rnaseh2a"
FT                   /note="'T' in cDNA is 'C' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 465
FT                   /gene="Rnaseh2a"
FT                   /note="'A' in cDNA is 'T' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 792
FT                   /gene="Rnaseh2a"
FT                   /note="'C' in cDNA is 'T' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 1014
FT                   /gene="Rnaseh2a"
FT                   /note="'A' in cDNA is 'G' in the mouse genome."
FT   misc_difference 1062..1080
FT                   /gene="Rnaseh2a"
FT                   /note="polyA tail: 19 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 1080 BP; 264 A; 279 C; 308 G; 229 T; 0 other;
     gtcagcatca agagccgctg cagttttcgc ggaaaacgcg cgctgggacc tgcgcttgca        60
     gtgttgtttt ctacaattgt tggctgtcgc agaggcagca tggatctcag cgagctggag       120
     agggacaata cgggtcgttg tcgtctgagt tctcctgtac ctgctgtgtg tctcaaggag       180
     ccgtgcgttc tgggcgtgga tgaagcgggc cgaggccctg tgcttggtcc catggtctac       240
     gccatctgtt actgccccct gtctcgcttg gcagatctgg aggctctgaa agtggcagac       300
     tctaagacct tgacagagaa cgagcgggag aggctctttg cgaaaatgga ggaggatgga       360
     gactttgtgg gttgggcttt ggacgtcctg tctccaaacc tgatctctac cagcatgctt       420
     gggcgagtca agtacaacct caactccctg tcacacgata cagcagcggg gctgatacag       480
     tacgcactgg accagaatgt gaatgtcact caggtatttg tggacactgt aggaatgcca       540
     gagacatacc aggctcgatt acaacagcac tttcccggga tagaggtgac agtcaaggcc       600
     aaagctgact ccctgttccc tgtggtcagt gctgccagca tctttgccaa ggtggcccga       660
     gacaaggctg tgaagaactg gcagtttgtg gaaaatttac aggatctgga ctccgattat       720
     ggctcaggct atcccaatga tcccaagacc aaagcctggc tgaggaaaca tgtggaccct       780
     gtgtttggct tcccccagtt tgtacggttc agttggagca cagcccaggc catcctggag       840
     aaggaggcag aagatgtcat ttgggaggac tcagaagctg aggaggatcc tgaaagaccg       900
     ggaaaaatca cctcctactt cagccagggc ccgcagacct gccgccctca ggccccccac       960
     agatacttcc aggagcgagg cctggaggca gccagcagcc tctaggagcc cacatgtgca      1020
     ccacctgccc ttctaaccca ggaataaaag ctgttcaaga aaaaaaaaaa aaaaaaaaaa      1080
//