spacer

EBI Dbfetch

ID   BC044848; SV 1; linear; mRNA; STD; MUS; 3611 BP.
XX
AC   BC044848;
XX
DT   30-JAN-2003 (Rel. 74, Created)
DT   24-SEP-2008 (Rel. 97, Last updated, Version 6)
XX
DE   Mus musculus desmoglein 2, mRNA (cDNA clone IMAGE:5355674), complete cds.
XX
KW   .
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC   Muridae; Murinae; Mus; Mus.
XX
RN   [1]
RP   1-3611
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-3611
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (23-JAN-2003) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; 8a01ccf07f090db232edd96844aa69be.
DR   Ensembl-Gn; ENSMUSG00000044393; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000120102; mus_musculus.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Jeffrey Green M.D.
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Baylor College of Medicine Human Genome
CC   Sequencing Center
CC   Center code: BCM-HGSC
CC   Web site: http://www.hgsc.bcm.tmc.edu/cdna/
CC   Contact: amg@bcm.tmc.edu
CC   Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
CC   Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
CC   A.N., Gibbs, R.A.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 54 Row: b Column: 12
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 22779878
CC   This clone has the following problem: The cds is short compared to
CC   the longest cds in the locus.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 33) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3611
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="FVB/N"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NCI_CGAP_Mam6"
FT                   /clone="IMAGE:5355674"
FT                   /tissue_type="Mammary tumor. C3(1)-Tag model. Infiltrating
FT                   ductal carcinoma. 5 month old virgin mouse."
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..3611
FT                   /gene="Dsg2"
FT   misc_difference 33
FT                   /gene="Dsg2"
FT                   /note="'G' in cDNA is 'C' in the mouse genome."
FT   CDS             131..1213
FT                   /codon_start=1
FT                   /gene="Dsg2"
FT                   /product="Dsg2 protein"
FT                   /db_xref="GOA:Q811I1"
FT                   /db_xref="InterPro:IPR002126"
FT                   /db_xref="InterPro:IPR009122"
FT                   /db_xref="InterPro:IPR015919"
FT                   /db_xref="InterPro:IPR020894"
FT                   /db_xref="MGI:MGI:1196466"
FT                   /db_xref="UniProtKB/TrEMBL:Q811I1"
FT                   /protein_id="AAH44848.1"
FT                   /translation="MARSPGDRCALLLLVQLLAVVCLDFGNGLHLEVFSPRNEGKPFPK
FT                   HTHLVRQKRAWITAPVALREGEDLSRKNPIAKIHSDLAEEKGIKITYKYTGKGITEPPF
FT                   GIFVFDRNTGELNITSILDREETPYFLLTGYALDSRGNNLEKPLELRIKVLDINDNEPV
FT                   FTQEVFVGSIEELSAAHTLVMKITATDADDPETLNAKVSYRIVSQEPANSHMFYLNKDT
FT                   GEIYTTSFTLDREEHSSYSLTVEARDGNGQITDKPVQQAQVQIRILDVNDNIPVVENKM
FT                   YEGTVEENQVNVEVMRIKVTDADEVGSDNWLANFTFASGNEGGYFHIETDTQTNEGIVT
FT                   LVKVSCKCPPPGWDCGYRKD"
FT   misc_difference 211
FT                   /gene="Dsg2"
FT                   /note="'T' in cDNA is 'C' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 1877
FT                   /gene="Dsg2"
FT                   /note="'A' in cDNA is 'G' in the mouse genome."
FT   misc_difference 1975
FT                   /gene="Dsg2"
FT                   /note="'A' in cDNA is 'C' in the mouse genome."
FT   misc_difference 1992
FT                   /gene="Dsg2"
FT                   /note="'C' in cDNA is 'T' in the mouse genome."
FT   misc_difference 2583
FT                   /gene="Dsg2"
FT                   /note="'A' in cDNA is 'G' in the mouse genome."
FT   misc_difference 2939
FT                   /gene="Dsg2"
FT                   /note="'G' in cDNA is 'T' in the mouse genome."
FT   misc_difference 3200
FT                   /gene="Dsg2"
FT                   /note="'T' in cDNA is 'C' in the mouse genome."
FT   misc_difference 3346
FT                   /gene="Dsg2"
FT                   /note="'T' in cDNA is 'A' in the mouse genome."
FT   misc_difference 3432
FT                   /gene="Dsg2"
FT                   /note="'A' in cDNA is 'G' in the mouse genome."
FT   misc_difference 3590..3611
FT                   /gene="Dsg2"
FT                   /note="polyA tail: 22 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 3611 BP; 1104 A; 730 C; 828 G; 949 T; 0 other;
     cccgggcaca cctggaaccg caccccgggt ccggcagagt cagagaaggg cggccccggg        60
     agggacctgc ccaggaggat ccgcagggcg ccggcgaggc ccggaggcga gggcgcggcg       120
     gatcgaggcg atggcgcgga gcccgggtga ccggtgcgcc ctgctgctgc tggtgcagct       180
     gctggcggtg gtctgcttgg actttggaaa tggacttcac ttagaggtct tcagcccaag       240
     aaatgaaggc aaaccgttcc ctaagcacac tcacttggtt cgtcaaaaga gggcctggat       300
     cactgcccct gtggctctgc gggagggcga agacctgtcc agaaagaacc cgattgccaa       360
     gatacactct gaccttgcag aagaaaaagg gataaaaatc acgtacaagt acactgggaa       420
     gggaattaca gaaccgcctt tcggcatatt cgtctttgat agaaacacag gagaactgaa       480
     catcactagc attcttgacc gggaagaaac accatatttt ctgctgacag gctatgcatt       540
     ggactccaga ggaaacaacc tggaaaagcc cttggaacta cgcatcaaag ttctggacat       600
     caatgacaac gagccagtgt tcacacagga ggtctttgtt gggtccattg aggaattgag       660
     tgcagcacat acacttgtga tgaaaatcac cgccacagat gcagatgacc cggagactct       720
     gaatgctaaa gtctcctaca gaattgtctc tcaggagcct gcaaatagtc atatgttcta       780
     cctaaataaa gacacggggg agatctatac gaccagtttt actttggaca gagaggaaca       840
     cagcagctat tccttgacgg tggaagcaag agatggtaac gggcagataa cagacaagcc       900
     agtccagcaa gctcaagttc agatccgtat attggatgtc aatgacaata tacctgtggt       960
     agaaaacaaa atgtatgagg ggacagtgga agaaaaccag gtcaatgtag aagtcatgcg      1020
     gatcaaagtg accgatgcag atgaagtggg ctctgataac tggctagcaa actttacatt      1080
     tgcatcagga aatgaagggg gctatttcca cattgagact gacacacaga ctaatgaagg      1140
     gattgtgacc cttgtcaagg taagttgtaa atgtccacca ccagggtggg actgtggtta      1200
     taggaaagac taacatgctt tgagtcccat actctcacag gtgctttaca gggacggaca      1260
     tcagttggta ccagcattgc tcagaatcag gaacagcctt catagcacaa gtcatataga      1320
     aagaagcagt tacactacag ttcagagtca gatcaggaac tcagatccca ggcacacaac      1380
     cttgcatagc acacttccaa gcagtgtgag cttgatgggc cagggctcag ttacttgcct      1440
     ttcagagaaa aatctattgt gattttactt gcaagtattg taagggctgc agtttgaaag      1500
     gaagtaacct gctctcctgt gcctcctgat tatcaatgaa ggcaaaattg aattaagtag      1560
     tcgtgctgaa catatgaaat tcagggtata gctcagtggt agatggcttg cctagcatgt      1620
     gtaaggtcct gggttcaacc cccagtgcca aataagttaa aatttggctg gatatagtga      1680
     cttgtacata taatcccagc acttgaaagg aggattgttg aaagtccaag gacaacctag      1740
     gctacattgt aagcctgggt tataaagtga gacctctagc caatcaatgg ccatggatga      1800
     actgggttta cctgaaaggg gggctggaaa tgaaagagac agggggcggg gctagagaag      1860
     gatgaagcca agacaaagtt ctggtcaagg ctccaagttt aatattcagc actgtgttta      1920
     tatagggaaa gcccacagac ccatcctttg tttcagctgg tttatgttgc aaagaaagat      1980
     cagcagggag tctataattc taagttcctg taggaaagtg cacatgcaac aaaggcagat      2040
     gactccacct aagtcattaa ggttcaatgt ggcaaacgct caaggtctgc tcaccctgct      2100
     caaggccgag aaaagagaac aattctccct ttttttgttt tttaaagatg agcagtaaca      2160
     gtaactggac aggggcacaa gatttacttg ttttctataa tcacatctag ggtaaagagt      2220
     ggcccagtga ctgttcctgt cttaggtcag tgtctatatt gctccttctt acccgtcatg      2280
     gagaacaaac atagcattga tatatgactt tgccttaggt cgataggagg tcaagaacac      2340
     tcttacctgt ctctgactac aggctttcta tagcacactc tttcccattc ggaccaaagc      2400
     catgaaggac atgtgactat gcactcaatg catttcttca tagcaattga taactgtcat      2460
     ggtgaatagc tgagcttcct gaaagggatc ctctgtgaag gcaaccaatc tgtttaagat      2520
     acaaggtcca gtagttaaaa gcaacagatt ataattaatg atcaaagcag tgtagaaatc      2580
     aaagttgtaa gccctgggga agtggaaaac caggactcaa accatccctg attctgtcct      2640
     ttttctcttt tccttttggc tagttcttct cgaactttag ccatagaatc ctttactatt      2700
     cctgaattat caatgtaaaa gtaacattct tcatggaggg ctgtgcgtag tcctccttgt      2760
     tgaaggaata aaagatcaag tcctctcctg ttttgaagga ctacctcaga caaggaggtt      2820
     agagattctt ataaatggga aatagaggtt tctatctttt ctatatctgt atctattgcc      2880
     attctaatgc actcatattt ttttttcttg caaaaccagg gcagaagtgc tagtagctgc      2940
     tcctcccaca ctcagattac cttgtcatcc aaattttaaa tttttaaaat aaaaagcatg      3000
     atttaaatga ttttgtggta tattcagata tagttcccct tttttgtctt ttatgaagat      3060
     attgttttat gaagataagt tccatgggtc tgttccacaa tatcttgtcc tttaggataa      3120
     taagaaaata tgggtaacat ccatttgcca taattggtta ggtacaagtc ctcgaagatt      3180
     aacactatta tatggtacat gaagagactg aggacgctga gagcaagtct ttacaacttg      3240
     atgtgcatat tttctagaaa taccaaattg ttgtctcaaa ctgttactat tttgatgatg      3300
     taaagaatga gagtgttcag ctaagtgttc ctgtaaggcc tataatctgc ctgatataca      3360
     aaactgctat accattttcc aggggtccag gcaatccagt atgagctctt aaatgtccta      3420
     taaagtaagg aacagtacat atttctcaaa ttaagccata tttgtataaa taactgtaaa      3480
     aattgagaat tagcagtatc taaaaaagga acaatttcaa gcaattgtaa accatgagct      3540
     atatattggc tatcagtata caaattaaaa acttgttttt tgagcatttc aaaaaaaaaa      3600
     aaaaaaaaaa a                                                           3611
//


spacer
spacer