Dbfetch

ID   BC052743; SV 1; linear; mRNA; STD; MUS; 5075 BP.
XX
AC   BC052743;
XX
DT   21-MAY-2003 (Rel. 75, Created)
DT   24-SEP-2008 (Rel. 97, Last updated, Version 8)
XX
DE   Mus musculus protein tyrosine phosphatase, receptor type, O, mRNA (cDNA
DE   clone MGC:63404 IMAGE:6834312), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-5075
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-5075
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (19-MAY-2003) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; d14bb82944d121fe8f831602c389c84b.
DR   Ensembl-Gn; ENSMUSG00000030223; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000077115; mus_musculus.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Dr. Jim Lin, University of Iowa
CC   cDNA Library Preparation: M. Bento Soares, University of Iowa
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: University of Iowa, Dr. M. Bento Soares and Dr.
CC   Thomas L. Casavant.
CC   Web site: http://genome.uiowa.edu
CC   Contact: bento-soares@uiowa.edu; tom-casavant@uiowa.edu
CC   Bonaldo,M.F., Akabogu,I.,  Bair,T., Bair,J., Crouch,K., Davis,A.,
CC   Fishler,K., Keppel,C., Kucaba,T., Lebeck,M., Melo,A., Schaefer,K.,
CC   Scheetz,T., Smith,C., Snir,E., Tack,D., Trout,K., Walters,J.,
CC   Casavant,T., Soares,M.B.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series:  Plate:  Row:  Column: 0
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 47059068.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..5075
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="C57BL/6"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NIH_BMAP_FY0"
FT                   /clone="MGC:63404 IMAGE:6834312"
FT                   /tissue_type="Brain, mouse, 13.5,14.5,16.5,17.5 dpc"
FT                   /note="Vector: pYX-ASC"
FT                   /db_xref="taxon:10090"
FT   gene            1..5075
FT                   /gene="Ptpro"
FT                   /note="synonyms: GLEPP1, PTP-BK, PTP-U2, PTPROt, PTP-phi"
FT   CDS             222..3902
FT                   /codon_start=1
FT                   /gene="Ptpro"
FT                   /product="protein tyrosine phosphatase, receptor type, O"
FT                   /db_xref="GOA:E9Q612"
FT                   /db_xref="InterPro:IPR000242"
FT                   /db_xref="InterPro:IPR000387"
FT                   /db_xref="InterPro:IPR003595"
FT                   /db_xref="InterPro:IPR003961"
FT                   /db_xref="InterPro:IPR013783"
FT                   /db_xref="InterPro:IPR016130"
FT                   /db_xref="InterPro:IPR029021"
FT                   /db_xref="MGI:MGI:1097152"
FT                   /db_xref="UniProtKB/Swiss-Prot:E9Q612"
FT                   /protein_id="AAH52743.1"
FT                   /translation="MGHLPRGTLGGRRLLPLLGLFVLLKIVTTFHVAVQDDNNIVVSLE
FT                   ASDIVSPASVYVVRVAGESKNYFFEFEEFNSTLPPPVVFKATYHGLYYIITLVVVNGNV
FT                   VTKPSRSITVLTKPLPVTSVSIYDYKPSPETGVLFEIHYPEKYNVFSRVNISYWEGRDF
FT                   RTMLYKDFFKGKTVFNHWLPGLCYSNITFQLVSEATFNKSTLVEYSGVSHEPKQHRTAP
FT                   YPPRNISVRFVNLNKNNWEEPSGSFPEDSFIKPPQDSIGRDRRFHFPEETPETPPSNVS
FT                   SGSPPSNVSSAWPDPNSTDYESTSQPFWWDSASAAPENEEDFVSALPADYDTETTLDRT
FT                   EKPTADPFSAFPVQMTLSWLPPKPPTAFDGFNILIEREENFTDYLTVDEEAHEFVAELK
FT                   EPGKYKLSVTTFSSSGACETRKSQSAKSLSFYISPTGEWIEELTEKPQHVSVHVLSSTT
FT                   ALMSWTSSQENYNSTIVSVVSLTCQKQKESQRLEKQYCTQVNSSKPVIENLVPGAQYQV
FT                   VMYLRKGPLIGPPSDPVTFAIVPTGIKDLMLYPLGPTAVVLSWTRPILGVFRKYVVEMF
FT                   YFNPTTMTSEWTTYYEIAATVSLTASVRIASLLPAWYYNFRVTMVTWGDPELSCCDSST
FT                   ISFITAPVAPEITSVEYFNSLLYISWTYGDATTDLSHSRMLHWMVVAEGRKKIKKSVTR
FT                   NVMTAILSLPPGDIYNLSVTACTERGSNTSLPRLVKLEPAPPKSLFAVNKTQTSVTLLW
FT                   VEEGVADFFEVFCQQLGSGHNGKLQEPVAVSSHVVTISSLLPATAYNCSVTSFSHDTPS
FT                   VPTFIAVSTMVTEVNPNVVVISVLAILSTLLIGLLLVTLVILRKKHLQMARECGAGTFV
FT                   NFASLEREGKLPYSWRRSVFALLTLLPSCLWTDYLLAFYINPWSKNGLKKRKLTNPVQL
FT                   DDFDSYIKDMAKDSDYKFSLQFEELKLIGLDIPHFAADLPLNRCKNRYTNILPYDFSRV
FT                   RLVSMNEEEGADYINANYIPGYNSPQEYIATQGPLPETRNDFWKMVLQQKSHIIVMLTQ
FT                   CNEKRRVKCDHYWPFTEEPIAYGDITVEMVSEEEEEDWASRHFRINYADEAQDVMHFNY
FT                   TAWPDHGVPPANAAESILQFVFTVRQQAAKSKGPMIIHCSAGVGRTGTFIALDRLLQHI
FT                   RDHEFVDILGLVSEMRSHRMSMVQTEEQYIFIHQCVQLMWLRKKQQFCISDVIYENVSK
FT                   S"
FT   misc_difference 3771
FT                   /gene="Ptpro"
FT                   /note="'C' in cDNA is 'T' in the mouse genome; amino acid
FT                   difference: 'H' in cDNA, 'Y' in the mouse genome."
FT   misc_difference 5056..5075
FT                   /gene="Ptpro"
FT                   /note="polyA tail: 20 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 5075 BP; 1322 A; 1282 C; 1220 G; 1251 T; 0 other;
     cattaaacaa cgccggctcc aagcacgttc tagcaggact cgggcacaga agaggagagc        60
     gatccatccg ggggcacagg acgacctgcc aggcgacatg cggccgcccg gagccgcccg       120
     ggcgcccgaa gtctgaggct gccagccggg actggtcatt gtaagcgcca cggagaactg       180
     ctggcgctgc cgttccctcc ccggtccctg gtcgcgccgc gatggggcac ctgcctaggg       240
     gaacgctcgg gggccgccgc ctgctacctc tgctcgggct ctttgtgctg ctcaagattg       300
     ttacgacgtt ccacgtggct gtgcaagatg acaacaatat cgttgtgtct ttagaagctt       360
     ctgatatagt cagcccagca tctgtgtatg ttgtgagggt agctggcgaa tcaaaaaact       420
     atttcttcga atttgaggaa tttaacagca cattgcctcc tcctgtggtc tttaaggcca       480
     catatcacgg tctttattac ataatcactc tggtggtagt caacgggaat gtggtcacca       540
     aaccatccag atcaatcacc gtgttgacaa aacccttgcc tgtaaccagt gtgtctatct       600
     atgactataa accttctcct gagacaggag tcctgtttga aatccattat ccggaaaaat       660
     acaatgtgtt cagcagagtg aacatcagct actgggaagg gagggacttc aggacgatgc       720
     tgtacaaaga tttctttaag gggaaaacgg tgtttaatca ctggctacca ggactatgtt       780
     acagtaacat cactttccag ctggtatcgg aggcaacttt taataaaagt acccttgtgg       840
     agtacagtgg tgtgagccat gaacccaaac agcacagaac agcaccatat ccacctcgaa       900
     acatctctgt tcgctttgtc aacttgaaca agaacaactg ggaggagccg agcgggagct       960
     tccccgagga ctcgttcatc aaaccaccgc aagattcaat aggaagagac agacgcttcc      1020
     atttccccga agaaactccg gagactcccc ccagcaatgt gtcctccggt tctcccccca      1080
     gcaatgtgtc ctccgcttgg cctgacccga atagcacaga ctatgaaagc acatcacagc      1140
     ccttctggtg ggacagtgca tccgcggccc ctgaaaacga ggaggacttt gtcagtgcgc      1200
     tgccagcaga ctatgacact gagaccacac tcgataggac agagaagccc acagccgacc      1260
     ctttctctgc cttccctgtg cagatgactc tgagctggtt accacccaaa ccgcccacag      1320
     cctttgatgg cttcaatatc ctcatagaga gggaagagaa ctttactgac tatttgacag      1380
     tggatgaaga agcccatgaa ttcgttgcag aactgaagga gcctgggaaa tacaaactct      1440
     cagtgacaac ctttagctcc tcgggggcct gtgagactcg gaaaagccag tcagcaaaat      1500
     cgctcagctt ctacatcagc cccacaggcg agtggattga agaactgacc gagaaacctc      1560
     agcatgtgag tgtccacgtc ttaagctcaa ccaccgcctt gatgtcgtgg acatcttctc      1620
     aggagaacta caacagcacc attgtgtctg tggtgtcctt gacctgccag aaacagaagg      1680
     agagccagcg gctggagaag cagtattgta cccaggtgaa ctcaagcaaa cctgtaattg      1740
     agaacctggt tcctggtgcc cagtaccagg ttgtgatgta cttaagaaaa ggccctttga      1800
     ttgggccacc ttccgatcct gtgacatttg cgattgttcc cactgggatc aaagatttaa      1860
     tgctctaccc cttgggtccc actgctgtcg tgctgagctg gacccgacct atcctgggag      1920
     tcttcagaaa atacgtggtt gaaatgttct acttcaaccc caccaccatg acctcagagt      1980
     ggacgaccta ctatgagata gcagccacag tttccttaac ggcatccgtg agaatagcaa      2040
     gcctattgcc agcgtggtac tacaacttcc gcgtaaccat ggtgacatgg ggagatccag      2100
     agctgagctg ttgtgacagt tccaccatca gcttcataac agcccccgtt gctccagaaa      2160
     tcacgtctgt ggagtacttc aacagcctgc tgtacatcag ttggacctat ggggatgcca      2220
     ccactgacct gtcccactct agaatgctgc actggatggt cgttgcagaa gggaggaaga      2280
     aaattaaaaa gagtgtgaca cgcaatgtca tgacggccat ccttagcctg cctccaggag      2340
     atatctacaa tctgtctgtc acggcctgca ctgagagagg gagcaacacc tccttgcccc      2400
     gccttgtcaa gctcgaacca gcccctccga agtcactctt cgcagtgaac aaaacacaga      2460
     cgtcagtgac cctgctgtgg gttgaggagg gtgttgctga tttctttgaa gtcttctgtc      2520
     agcagctcgg ctctggccac aatggcaaac tccaggagcc agtagctgtg tcgtcccacg      2580
     tggtgaccat ctccagcctc ctcccagcca ctgcctacaa ctgcagtgtc accagcttca      2640
     gccacgacac tcccagtgtc cctacattca tagctgtctc cacaatggtt acagaggtga      2700
     accctaacgt ggtggtgatc tcggtgctgg ccatcctcag cacactttta attggactgc      2760
     tcttagtgac ccttgtcatt ctccgaaaga agcacctgca gatggctagg gaatgtggag      2820
     ctggcacgtt tgttaacttt gcatccttag aaagggaagg gaaactcccc tacagttggc      2880
     gtaggagtgt gtttgctctc ttaaccctgc tgccttcatg tctttggact gactatcttt      2940
     tggcatttta tattaaccct tggagtaaaa atggcttaaa gaagaggaaa ctaacaaacc      3000
     ccgttcagct ggatgacttc gattcttaca tcaaggatat ggccaaggac tcggactata      3060
     aattctctct tcagtttgag gagttgaagt tgattggact ggatattccg cactttgctg      3120
     cagatctacc gctgaaccga tgtaaaaacc gctacacaaa catcctgccg tatgacttta      3180
     gccgggtgag gctagtctcc atgaacgaag aggaaggagc agactacatt aatgccaact      3240
     atattcctgg atacaactca ccccaggagt acattgccac ccagggtccc ctgccagaaa      3300
     ccagaaatga cttctggaag atggtcctac aacagaagtc ccacatcatc gtcatgctca      3360
     ctcaatgcaa tgagaaaagg agggtaaaat gtgaccacta ctggccattc acagaagaac      3420
     ccattgctta tggggacatc accgtggaga tggtctctga ggaagaggag gaggactggg      3480
     ccagtagaca cttccggatc aactatgcgg acgaagcgca ggacgtgatg cattttaact      3540
     acacagcctg gcccgatcac ggcgtgcctc cagcaaacgc cgccgagagc atcctgcagt      3600
     ttgtgttcac agtgcgacag caagccgcca agagcaaagg gcccatgatc atccactgca      3660
     gtgcgggtgt gggacggaca ggaaccttca ttgccctgga caggctcctg caacacattc      3720
     gagatcatga atttgtggac atcttagggc tggtatcaga gatgcgctca caccgaatgt      3780
     caatggtaca gacagaggag cagtacattt ttatccatca gtgtgtgcag ctgatgtggc      3840
     tgaggaagaa gcaacagttc tgcatcagcg acgtcatcta cgagaacgtc agcaaatcct      3900
     agttcggaat ctggagcggt gaggacatga tacctgcgca tcctcccttg cttccagagt      3960
     ttttataggg gcttttagtc actttgctaa atcgagtccc tgctgtgcag tatatggaca      4020
     aggaggagat ttacctccta gaaccaagaa gaccttaagg agcctacagt atcatttgga      4080
     gtcttttcac ttctagatgt ttgatggacc aaattcagta gaattccaga aaggtcacca      4140
     atgctccttg aggggcagga agcgcaatac ttcctgatgg ctcagttggc tctttgctgc      4200
     tcgggttggg ttgatttttt ttttaagtgc aataatttct gtataattgt gattttttac      4260
     attcacaatt cagaatgttg aattgttatc tgcccacacc atcgatcagt gccacagcct      4320
     tggaaataac caacatcatt acggagtttt atctcatccc tggaaaggca tggctacaca      4380
     ggaagacgtg gtgatgaatc tccttcaaag aaactgagcg agtgtgctcc tcgctccaac      4440
     cttgaagcta cgggtcagga gtaggcaggc agagggaaga gagcggttct actcacccat      4500
     cagccagctt atcttttcct atttcaaata tgaaaacctg tgtttcaaag tagagtagga      4560
     aaaacattat atgtctgact tgtcatagtg ggtttcttcc ttgttgacag ttgggagttt      4620
     cttccgtggc tcttttgaac taatgtcacg gtcctttttg aacagccaga gttgattcaa      4680
     cagttttgag tctgactctg aactctgaga tatcttggtg cctagccttc acgtttattt      4740
     ttctctgtgt ccacctgtgt acacaaaaac agtttctcca agagctatag gctgtaaaaa      4800
     tgcactttct ttctccaccc agaggagctg ggaattcaga agtcacgaca acaacaaact      4860
     cagcatgtaa ggacaggtat tggacaccat cccttaagaa cattgttgag tgtctctatg      4920
     gattccgaca ggagattctc tgggtcttct ttgcatttct gttgtcagag tcattttaac      4980
     ctgtgtagct agtggcatta tattctttgg attttgtatg attaaagtac atgattgtgt      5040
     gttgtgacca tgaaaaaaaa aaaaaaaaaa aaaaa                                 5075
//