 |
EBI Dbfetch
ID BC064670; SV 1; linear; mRNA; STD; VRT; 3982 BP.
XX
AC BC064670;
XX
DT 24-DEC-2003 (Rel. 78, Created)
DT 16-JUL-2006 (Rel. 88, Last updated, Version 9)
XX
DE Danio rerio carbohydrate (chondroitin) synthase 1, mRNA (cDNA clone
DE MGC:64191 IMAGE:6798417), complete cds.
XX
KW MGC.
XX
OS Danio rerio (zebrafish)
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC Cyprinidae; Danio.
XX
RN [1]
RP 1-3982
RX DOI; 10.1073/pnas.242603899.
RX PUBMED; 12477932.
RG Mammalian Gene Collection Program Team
RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT "Generation and initial analysis of more than 15,000 full-length human and
RT mouse cDNA sequences";
RL Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN [2]
RC NIH-MGC Project URL: http://mgc.nci.nih.gov
RP 1-3982
RG NIH MGC Project
RA ;
RT ;
RL Submitted (22-DEC-2003) to the EMBL/GenBank/DDBJ databases.
RL National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL MD 20892-2590, USA
XX
DR Ensembl-Gn; ENSDARG00000042232; Danio_rerio.
DR Ensembl-Gn; ENSDARG00000059061; Danio_rerio.
DR Ensembl-Gn; ENSDARG00000071235; Danio_rerio.
DR Ensembl-Gn; ENSDARG00000079027; Danio_rerio.
DR Ensembl-Tr; ENSDART00000061912; Danio_rerio.
DR Ensembl-Tr; ENSDART00000082068; Danio_rerio.
DR Ensembl-Tr; ENSDART00000092131; Danio_rerio.
DR Ensembl-Tr; ENSDART00000104536; Danio_rerio.
DR ImaGenes; IMAGp998E0814313Q.
DR ImaGenes; IRAKp961O09117Q.
DR ImaGenes; IRBOp991G108D.
XX
CC Contact: MGC help desk
CC Email: cgapbs-r@mail.nih.gov
CC Tissue Procurement: Dr. Chi-Bin Chien
CC cDNA Library Preparation: Invitrogen Corp
CC cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC DNA Sequencing by: Sequencing Group at the Stanford Human Genome
CC Center, Stanford University School of Medicine, Stanford, CA 94305
CC Web site: http://www-shgc.stanford.edu
CC Contact: (Dickson, Mark) mcd@paxil.stanford.edu
CC Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
CC R. M.
CC Clone distribution: MGC clone distribution information can be found
CC through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC Series: IRAK Plate: 117 Row: o Column: 9
CC This clone was selected for full length sequencing because it
CC passed the following selection criteria: matched mRNA gi: 40352709.
XX
FH Key Location/Qualifiers
FH
FT source 1..3982
FT /organism="Danio rerio"
FT /lab_host="DH10B"
FT /mol_type="mRNA"
FT /clone_lib="NCI_CGAP_ZEmb3"
FT /clone="MGC:64191 IMAGE:6798417"
FT /tissue_type="Embryo, zebrafish whole embryo 72 hpf"
FT /note="Vector: pCMV-SPORT6.1"
FT /db_xref="taxon:7955"
FT gene 1..3982
FT /gene="chys1"
FT /note="synonyms: Chsy1, fc27h05, zgc:64191, fc27h06,
FT wu:fc27h06, wu:fc27h05"
FT CDS 286..2691
FT /codon_start=1
FT /gene="chys1"
FT /product="carbohydrate (chondroitin) synthase 1"
FT /db_xref="GOA:Q6P296"
FT /db_xref="InterPro:IPR008428"
FT /db_xref="UniProtKB/TrEMBL:Q6P296"
FT /db_xref="ZFIN:ZDB-GENE-030131-3127"
FT /protein_id="AAH64670.1"
FT /translation="MAGRSRRSWFSVLLGLVVGFTLASRLILPRASELKKAGQKRRASG
FT GGAGGCGGGGGMMMMMKKEYGGVLMPGTEKPSVPQSSSFLFVGVMTAQKYLNNRAVAAY
FT RTWAKTIPGKVEFFSSEGSDTTIPIPIVALQNVDDSYPPQKKSFMMLKYMHDHYLDQFE
FT WFMRADDDVYIKSEKLESFLRSLNSSEAIFLGQTGMGARDELGKLALEPGENFCMGGPG
FT VIMSREVLRRMVPHIRQCLQEMYTTHEDVEVGRCVRRFAGVQCVWSYEMQQLFYENYEP
FT NKKGYIRDLHNSKIHRAITLHPNKNPPYQYRLHSYMLSRKIAELRHRTIQLHREIVQMS
FT RYSNTEVHREDLQLGMPPSFMRFHPHQREEVLEWEFLTGKYIFSSSDGQPPRRGIDSSQ
FT KMALDDIIMQVMEMINANAKTRGRVIDFKEIQYGYRRVNPMYGAEYVLDLLLLYKKHKG
FT KTMTVPVRRHAYLQQTFSKIQFLEEEEMDARVLAARINQDSDSLSFLSNSLKMLVPFKL
FT SSPGIEQHEPKEKKINILVPLAGRYEIFLRFMANFEKICLIPNQNVKLLILLFSTDNNT
FT ERIKQIELMREYRMKYPKADMEIKPVSGPFSRALALEVGSAHFTNDSLLFYCDVDLLFT
FT PDFLTRCRGNTILGEQTYFPIIFSQYDPKVVYAGKVPSDNHYVFTSKTGLWRHYGFGIV
FT CVYKGDLVKAGGFDVSIQGWGLEDVDLFNKFVQSGIKLFRSTDTGIVHVHHPVVCDPNL
FT DPKQYKMCLGSKASSHGSTQQLAELWLEKNNHSFRKISNSNNESMRTE"
XX
SQ Sequence 3982 BP; 1044 A; 939 C; 988 G; 1011 T; 0 other;
cgagaatgag gatgcagccg cacttctttc cgcaacaaac cgttaattgc tcctgactac 60
ccaccgactc gggctcgaaa tcacggagat acctttacga accccgtcat ggctcttccg 120
gagcggatag tgtgtgatta gatggcagcg gttggtgatt tatggggacc ggagcggtga 180
ccatgctcgg acccaaaccc agcgttgtgg acgggactgc agcagcagca acgggcggcg 240
tttaataaac aaaactctga caaattaaag cgactagtag ctgaaatggc aggaaggagt 300
cgcagatctt ggtttagcgt gttgctgggg ctggtggtcg ggttcacact ggcgtccagg 360
ctcatcctgc cccgggccag cgagctgaag aaagcgggtc agaagcggcg ggcgagcggc 420
ggcggcgcgg gaggctgcgg aggcggcgga gggatgatga tgatgatgaa gaaggagtac 480
ggaggggtct tgatgcccgg taccgagaag ccctcagtgc cccagtctag cagctttctg 540
tttgtgggcg tcatgaccgc gcagaagtac ctgaataacc gcgccgtggc agcatatagg 600
acctgggcca agaccatccc gggcaaggtg gagtttttct ccagtgaagg ttctgacacc 660
accatcccca tccccattgt ggctctgcag aacgtggatg actcgtaccc tccgcagaag 720
aagtccttca tgatgctcaa atacatgcat gaccactatt tggaccagtt cgagtggttc 780
atgcgagccg acgacgacgt ctacatcaaa agcgaaaagc tggagagttt cctaaggagt 840
ctcaacagca gcgaggccat ctttctgggc cagacgggca tgggagcccg cgacgaactg 900
ggcaaactgg cgctggagcc gggggagaac ttctgcatgg gcggccctgg tgttatcatg 960
agccgcgagg tgctgcgcag gatggttccc cacatccgcc agtgcctgca ggagatgtac 1020
accacccatg aggacgtgga ggtgggccgc tgcgtgcgca gattcgccgg agtgcagtgt 1080
gtttggtcct acgagatgca gcagctgttc tatgagaatt acgagccaaa caagaagggc 1140
tacatccgtg atctccacaa cagtaagatc cacagagcca taacccttca ccccaacaag 1200
aacccacctt accagtatcg gttgcacagc tacatgttga gccggaagat tgcagagctg 1260
cgccaccgca ccattcagct ccatcgtgag attgtgcaaa tgagccgcta cagcaacacc 1320
gaggtgcaca gggaagacct tcaactgggc atgcctccat ccttcatgcg ctttcatccc 1380
caccaacgtg aggaggtgct ggaatgggaa ttcctcacgg gaaagtacat tttctcatcc 1440
tcagatggac agcctccccg tcgaggcatt gactcctcgc agaagatggc gctggacgac 1500
atcatcatgc aagttatgga gatgattaac gccaatgcca agacacgcgg aagagtcatt 1560
gacttcaaag agatccaata tggctaccga agggtcaacc ctatgtatgg tgcagagtac 1620
gtactggatc tgctgctgct ctacaagaag cacaagggga agactatgac agtgcccgta 1680
aggaggcatg cgtatcttca gcagaccttc agcaagatcc agttcttgga agaagaagag 1740
atggacgctc gagtcctggc tgccaggatc aaccaagact ccgactcgct gtctttcttg 1800
tccaactccc tcaagatgct cgtcccgttc aagttgagca gcccaggcat cgaacaacac 1860
gaacccaaag agaagaagat caacatcctt gtgccgctgg ccggacgcta cgagattttt 1920
ctgcgcttca tggccaactt cgagaagatc tgcctcattc ccaaccagaa tgtcaagctt 1980
ctgattctgc ttttcagcac ggacaacaac actgagcgca tcaagcaaat cgagctcatg 2040
agggagtatc gcatgaagta ccccaaagcc gatatggaga tcaaaccggt ttccggacca 2100
ttctcccgtg ctctggccct ggaggtcggc tcggcccact ttaccaatga ttctttactc 2160
ttctactgtg acgtggactt actgtttact cctgacttct tgactcgatg tcgtgggaat 2220
accatacttg gggaacagac atactttcct attattttca gccagtatga cccaaaggtt 2280
gtctacgcag ggaaggtgcc cagcgacaac cactacgtgt tcacatccaa aacaggcctg 2340
tggagacact atggttttgg tattgtctgc gtctacaaag gagacctggt caaagctggt 2400
ggttttgacg tctccatcca aggttgggga ctggaggacg ttgacctttt caacaagttt 2460
gtccaatccg ggatcaagct gttccgcagt acggataccg gcatcgttca cgtgcatcac 2520
cctgtggttt gcgaccccaa tcttgatccc aagcagtaca aaatgtgcct gggatccaaa 2580
gcatcctcgc acggctccac gcagcaactc gcagagctat ggttggagaa aaacaaccac 2640
agcttccgaa agatctcgaa ctccaataat gaatcaatga ggacagaata gacctggcaa 2700
gtgcacattg tccatccgtc ttcctcttaa tggtgttttt taaaactacc ttatttattt 2760
tttacaggaa aaagacttct gtggatatat tttgaaataa taactggttt tgtaaactgt 2820
acaggagctg gccgtcacct tcagacttga tcttttttta agtagcgttt caaataacac 2880
aacagtgact taaagaggac gatctatttt acaaatccaa tactgagcgt tatatttatg 2940
gcacctcaaa aacagactgc tgtggacctc gccagaaaaa aaatgctttg aaatggagtc 3000
gaagagggag tttggacacg gacttgtgtg gtgtctggca agatggtaaa tgtttatttg 3060
tgaatgttta cagagagcga tgaacaagtg attcattctt ggtgaagggg atctatagaa 3120
taacaatctt aattctgaca attaagggcc attgatcaac agcttctttt cggttcgttt 3180
tttccagttt acttgtctgg agttgggagg gggatccata ataacactgt ttcgttttta 3240
tcgcttttga tttattgttt ttattgactt ctgtgaatga aactgctggc tgttgatctc 3300
tcctactcat tcccactgtc taatttattg agcgttcagt cttcggggct gaggttgtgt 3360
atattagaga ttgtataacc gtttcagtca tggatagtgt ttttgttccc tctctcgatc 3420
ttgaattact ttaattacca aagactttct ttcttttttt tttttttttg cacagttgct 3480
gtaggtgata tttgttcaag agtgcttgtt tttgtgtgtg cgtgcgtgta tgagtccaaa 3540
tgcaaggcag tagagcagga aaaaacatgt aatgtactgt aggatggagg taccgatgac 3600
tattcaacat ggccctatga aagtatgccg acaacaagcc acttaaagag attttctcat 3660
ttcatatgat cagatttcat ttactcctct tggttattta aattttttga ttcagttggc 3720
ctttctcctg tgattgagcc atgcaccctg tggggacaac ggccgtaacg cggccccgcc 3780
ttcagattcg aggacgatct cccacagtag tctctctttc tgttgtcttt tgtttttctc 3840
tggggcgttt tcatacttca gttctttgtt agcccccata caactgggac aaagttttat 3900
atttaataaa atggaaaatg aaaaagaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3960
aaaaaaaaaa aaaaaaaaaa aa 3982
//
 |