Dbfetch

ID   BC012252; SV 1; linear; mRNA; STD; MUS; 1977 BP.
XX
AC   BC012252;
XX
DT   09-AUG-2001 (Rel. 68, Created)
DT   24-SEP-2008 (Rel. 97, Last updated, Version 10)
XX
DE   Mus musculus solute carrier family 35 (CMP-sialic acid transporter), member
DE   1, mRNA (cDNA clone MGC:18789 IMAGE:4190458), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-1977
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-1977
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (06-AUG-2001) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; b22c81fe7148f76a9befac3f7c123421.
DR   Ensembl-Gn; ENSMUSG00000028293; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0028129; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0028087; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0028050; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0028098; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0027827; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0028540; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0027282; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0027799; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0027943; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0027909; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0028051; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0027942; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0028599; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0027001; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0027358; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000029970; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0064997; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0065028; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0064967; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0064947; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0064637; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0065437; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0064986; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0064586; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0064709; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0064657; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0064841; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0064660; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0065653; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0064506; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0063887; mus_musculus_wsbeij.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Jeffrey E. Green, M.D.
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Baylor College of Medicine Human Genome
CC   Sequencing Center
CC   Center code: BCM-HGSC
CC   Web site: http://www.hgsc.bcm.tmc.edu/cdna/
CC   Contact: amg@bcm.tmc.edu
CC   Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
CC   Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
CC   A.N., Gibbs, R.A.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 24 Row: h Column: 2
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 31560518.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1977
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="FVB/N"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NCI_CGAP_SG2"
FT                   /clone="MGC:18789 IMAGE:4190458"
FT                   /tissue_type="Salivary gland, 10 week old female mouse"
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..1977
FT                   /gene="Slc35a1"
FT   misc_difference 45
FT                   /gene="Slc35a1"
FT                   /note="'C' in cDNA is 'T' in the mouse genome."
FT   CDS             95..1105
FT                   /codon_start=1
FT                   /gene="Slc35a1"
FT                   /product="solute carrier family 35 (CMP-sialic acid
FT                   transporter), member 1"
FT                   /db_xref="GOA:Q61420"
FT                   /db_xref="InterPro:IPR007271"
FT                   /db_xref="MGI:MGI:1345622"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q61420"
FT                   /protein_id="AAH12252.1"
FT                   /translation="MAPARENVSLFFKLYCLAVMTLVAAAYTVALRYTRTTAEELYFST
FT                   TAVCITEVIKLLISVGLLAKETGSLGRFKASLSENVLGSPKELAKLSVPSLVYAVQNNM
FT                   AFLALSNLDAAVYQVTYQLKIPCTALCTVLMLNRTLSKLQWISVFMLCGGVTLVQWKPA
FT                   QASKVVVAQNPLLGFGAIAIAVLCSGFAGVYFEKVLKSSDTSLWVRNIQMYLSGIVVTL
FT                   AGTYLSDGAEIQEKGFFYGYTYYVWFVIFLASVGGLYTSVVVKYTDNIMKGFSAAAAIV
FT                   LSTIASVLLFGLQITLSFALGALLVCVSIYLYGLPRQDTTSIQQEATSKERIIGV"
FT   misc_difference 146
FT                   /gene="Slc35a1"
FT                   /note="'G' in cDNA is 'A' in the mouse genome; amino acid
FT                   difference: 'A' in cDNA, 'T' in the mouse genome."
FT   misc_difference 590
FT                   /gene="Slc35a1"
FT                   /note="'T' in cDNA is 'A' in the mouse genome; amino acid
FT                   difference: 'S' in cDNA, 'T' in the mouse genome."
FT   misc_difference 1346^1347
FT                   /gene="Slc35a1"
FT                   /note="2 bases in the mouse genome, GA, are not found in
FT                   cDNA."
FT   misc_difference 1414^1415
FT                   /gene="Slc35a1"
FT                   /note="12 bases in the mouse genome are not found in cDNA."
FT   misc_difference 1941..1977
FT                   /gene="Slc35a1"
FT                   /note="polyA tail: 37 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 1977 BP; 559 A; 374 C; 461 G; 583 T; 0 other;
     cgcttcctcc tcgcgcgtgt ggtgcggcgg gctctctagg ccggcgcgtc tctatggccg        60
     caggggcgtc agttccgcag actctctcgg caccatggct ccggcgagag aaaatgtcag       120
     tttattcttc aagctgtact gcttggcggt gatgactctg gtggctgccg cttacaccgt       180
     agctttaaga tacacaagga caacagctga agaactctac ttctcaacca ctgccgtgtg       240
     tatcacagaa gtgataaagt tactgataag tgttggcctg ttagctaagg aaactggcag       300
     tttgggtaga tttaaagcct cattaagtga aaatgtcttg gggagcccca aggaactggc       360
     gaagttgagt gtgccatcac tagtgtatgc tgtgcagaac aacatggcct tcctggctct       420
     cagtaatctg gatgcagcag tgtaccaggt gacctatcaa ctgaagatcc cctgcactgc       480
     tttatgtact gttttaatgt taaatcgaac actcagcaaa ttacagtgga tttccgtctt       540
     catgctgtgt ggtggggtca cactcgtaca gtggaaacca gcccaagctt caaaagtcgt       600
     ggtagcgcag aatccattgt taggctttgg tgctatagct attgctgtat tgtgctctgg       660
     atttgcagga gtttattttg aaaaagtctt aaagagttcc gacacttccc tttgggtgag       720
     aaacattcag atgtatctgt cagggatcgt tgtgacgtta gctggtacct acttgtcaga       780
     tggagctgaa attcaagaaa aaggattctt ctatggctac acgtattatg tctggtttgt       840
     tatcttcctt gctagtgtgg gaggcctcta cacgtcagtg gtggtgaagt atacagacaa       900
     catcatgaaa ggcttctctg ctgccgcagc cattgttctt tctaccattg cttcagtcct       960
     actgtttgga ttacagataa cactttcatt tgcactggga gctcttcttg tgtgtgtttc      1020
     catatatctc tatgggttac ccagacaaga tactacatcc attcaacaag aagcaacttc      1080
     aaaagagaga atcattggtg tgtgatttga atctcaagag attcctataa ggacttaaac      1140
     tgttgataat aaattagagc cttaagtcaa ccccagatgg taggttaaat aatgtcaaca      1200
     aaataattgt atgacataag aatcaagaag aaaactctga atgaaatgct aaaacagatt      1260
     taatttgggt gtgtttggtg tcaagttata ttatttcaaa atgaaggact ttatatatat      1320
     gagagagaga gagagagaga gagagaaaga gagagagaga tcttgtacac agagcatgga      1380
     ggtgccatgg tatttttgtg tgtgtgtgtg tgtgatgtac acagagcacg gaggcaatga      1440
     tgcaggacca gagagcatca ggtgaaacta taatattcaa gcaacgaggt ttaagaccgt      1500
     gtctgagggt tacagtgcca aagccatttc tgtacacact gttctcttgt tcaggtacct      1560
     ggagaggaag gctagccttc ttctccagtc catggatagt actttgtccc catagcagtg      1620
     aggatctagc ttctcttctc agagtgagga aggagtaaga cagttgaaca cacctcaggg      1680
     tgaatctact tcgtagctca gacactgact tctgggtgaa gactgagaac tctagtgatg      1740
     catttgtgcc atttactgtc tggttcctat ttctttgctg tcagctgata tacttttcag      1800
     aaaattttat aagctgcttt tatactttct tttttataaa gtatggttac ctgttgggct      1860
     ctcaatttgt gactttcagt gattttaaaa tatttctata atgttaatgg ggaaatccag      1920
     caataaactt atttctacca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa         1977
//