Dbfetch

ID   BC093036; SV 1; linear; mRNA; STD; HUM; 2711 BP.
XX
AC   BC093036;
XX
DT   15-APR-2005 (Rel. 83, Created)
DT   15-OCT-2008 (Rel. 97, Last updated, Version 7)
XX
DE   Homo sapiens vacuolar protein sorting 35 homolog (S. cerevisiae), mRNA
DE   (cDNA clone MGC:110953 IMAGE:30340379), complete cds.
XX
KW   MGC.
XX
OS   Homo sapiens (human)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Homo.
XX
RN   [1]
RP   1-2711
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-2711
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (04-APR-2005) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; 34caaa564b052511f287c7944b7e2641.
DR   Ensembl-Gn; ENSG00000069329; homo_sapiens.
DR   Ensembl-Tr; ENST00000299138; homo_sapiens.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Dr. Stefan Hansson
CC   cDNA Library Preparation: Michael Brownstein /  Ted Usdin
CC   Laboratory
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Sequencing Group at the Stanford Human Genome
CC   Center, Stanford University School of Medicine, Stanford, CA  94305
CC   Web site:       http://www-shgc.stanford.edu
CC   Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
CC   Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
CC   R. M.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 198 Row: i Column: 8
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 41352714.
CC   Differences found between this sequence and the human reference
CC   genome (build 36) are described in misc_difference features below
CC   and these differences were also compared to chimpanzee genome
CC   (build 1).
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2711
FT                   /organism="Homo sapiens"
FT                   /lab_host="DH10B"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NIH_MGC_147"
FT                   /clone="MGC:110953 IMAGE:30340379"
FT                   /tissue_type="Placenta, normal"
FT                   /note="Vector: pBluescriptR"
FT                   /db_xref="taxon:9606"
FT   gene            1..2711
FT                   /gene="VPS35"
FT                   /note="synonyms: DKFZp434E1211, DKFZp434P1672, MEM3"
FT   CDS             46..2436
FT                   /codon_start=1
FT                   /gene="VPS35"
FT                   /product="vacuolar protein sorting 35 homolog (S.
FT                   cerevisiae)"
FT                   /db_xref="GOA:Q96QK1"
FT                   /db_xref="H-InvDB:HIT000334527.10"
FT                   /db_xref="HGNC:HGNC:13487"
FT                   /db_xref="InterPro:IPR005378"
FT                   /db_xref="InterPro:IPR011989"
FT                   /db_xref="InterPro:IPR016024"
FT                   /db_xref="PDB:2R17"
FT                   /db_xref="PDB:5F0J"
FT                   /db_xref="PDB:5F0K"
FT                   /db_xref="PDB:5F0L"
FT                   /db_xref="PDB:5F0M"
FT                   /db_xref="PDB:5F0P"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q96QK1"
FT                   /protein_id="AAH93036.1"
FT                   /translation="MPTTQQSPQDEQEKLLDEAIQAVKVQSFQMKRCLDKNKLMDALKH
FT                   ASNMLGELRTSMLSPKSYYELYMAISDELHYLEVYLTDEFAKGRKVADLYELVQYAGNI
FT                   IPRLYLLITVGVVYVKSFPQSRKDILKDLVEMCRGVQHPLRGLFLRNYLLQCTRNILPD
FT                   EGEPTDEETTGDISDSMDFVLLNFAEMNKLWVRMQHQGHSRDREKRERERQELRILVGT
FT                   NLVRLSQLEGVNVERYKQIVLTGILEQVVNCRDALAQEYLMECIIQVFPDEFHLQTLNP
FT                   FLRACAELHQNVNVKNIIIALIDRLALFAHREDGPGIPADIKLFDIFSQQVATVIQSRQ
FT                   DMPSEDVVSLQVSLINLAMKCYPDRVDYVDKVLETTVEIFNKLNLEHIATSSAVSKELT
FT                   RLLKIPVDTYNNILTVLKLKHFHPLFEYFDYESRKSMSCYVLSNVLDYNTEIVSQDQVD
FT                   SIMNLVSTLIQDQPDQPVEDPDPEDFADEQSLVGRFIHLLRSEDPDQQYLILNTARKHF
FT                   GAGGNQRIRFTLPPLVFAAYQLAFRYKENSKVDDKWEKKCQKIFSFAHQTISALIKAEL
FT                   AELPLRLFLQGALAAGEIGFENHETVAYEFMSQAFSLYEDEISDSKAQLAAITLIIGTF
FT                   ERMKCFSEENHEPLRTQCALAASKLLKKPDQGRAVSTCAHLFWSGRNTDKNGEELHGGK
FT                   RVMECLKKALKIANQCMDPSLQVQLFIEILNRYIYFYEKENDAVTIQVLNQLIQKIRED
FT                   LPNLESSEETEQINKHFHNTLEHLRLRRESPESEGPIYEGLIL"
FT   misc_difference 1983
FT                   /gene="VPS35"
FT                   /note="'T' in cDNA is 'C' in the human genome; no amino
FT                   acid change. The chimpanzee genome agrees with the human
FT                   genomic sequence and not the cDNA."
XX
SQ   Sequence 2711 BP; 817 A; 543 C; 594 G; 757 T; 0 other;
     acgcgcgggg cgggtgctgc ttgctgcagg ctctggggag tcgccatgcc tacaacacag        60
     cagtcccctc aggatgagca ggaaaagctc ttggatgaag ccatacaggc tgtgaaggtc       120
     cagtcattcc aaatgaagag atgcctggac aaaaacaagc ttatggatgc tctaaaacat       180
     gcttctaata tgcttggtga actccggact tctatgttat caccaaagag ttactatgaa       240
     ctttatatgg ccatttctga tgaactgcac tacttggagg tctacctgac agatgagttt       300
     gctaaaggaa ggaaagtggc agatctctac gaacttgtac agtatgctgg aaacattatc       360
     ccaaggcttt accttttgat cacagttgga gttgtatatg tcaagtcatt tcctcagtcc       420
     aggaaggata ttttgaaaga tttggtagaa atgtgccgtg gtgtgcaaca tcccttgagg       480
     ggtctgtttc ttcgaaatta ccttcttcag tgtaccagaa atatcttacc tgatgaagga       540
     gagccaacag atgaagaaac aactggtgac atcagtgatt ccatggattt tgtactgctc       600
     aactttgcag aaatgaacaa gctctgggtg cgaatgcagc atcagggaca tagccgagat       660
     agagaaaaaa gagaacgaga aagacaagaa ctgagaattt tagtgggaac aaatttggtg       720
     cgcctcagtc agttggaagg tgtaaatgtg gaacgttaca aacagattgt tttgactggc       780
     atattggagc aagttgtaaa ctgtagggat gctttggctc aagaatatct catggagtgt       840
     attattcagg ttttccctga tgaatttcac ctccagactt tgaatccttt tcttcgggcc       900
     tgtgctgagt tacaccagaa tgtaaatgtg aagaacataa tcattgcttt aattgataga       960
     ttagctttat ttgctcaccg tgaagatgga cctggaatcc cagcggatat taaacttttt      1020
     gatatatttt cacagcaggt ggctacagtg atacagtcta gacaagacat gccttcagag      1080
     gatgttgtat ctttacaagt ctctctgatt aatcttgcca tgaaatgtta ccctgatcgt      1140
     gtggactatg ttgataaagt tctagaaaca acagtggaga tattcaataa gctcaacctt      1200
     gaacatattg ctaccagtag tgcagtttca aaggaactca ccagactttt gaaaatacca      1260
     gttgacactt acaacaatat tttaacagtc ttgaaattaa aacattttca cccactcttt      1320
     gagtactttg actacgagtc cagaaagagc atgagttgtt atgtgcttag taatgttctg      1380
     gattataaca cagaaattgt ctctcaagac caggtggatt ccataatgaa tttggtatcc      1440
     acgttgattc aagatcagcc agatcaacct gtagaagacc ctgatccaga agattttgct      1500
     gatgagcaga gccttgtggg ccgcttcatt catctgctgc gctctgagga ccctgaccag      1560
     cagtacttga ttttgaacac agcacgaaaa cattttggag ctggtggaaa tcagcggatt      1620
     cgcttcacac tgccaccttt ggtatttgca gcttaccagc tggcttttcg atataaagag      1680
     aattctaaag tggatgacaa atgggaaaag aaatgccaga agattttttc atttgcccac      1740
     cagactatca gtgctttgat caaagcagag ctggcagaat tgcccttaag actttttctt      1800
     caaggagcac tagctgctgg ggaaattggt tttgaaaatc atgagacagt cgcatatgaa      1860
     ttcatgtccc aggcattttc tctgtatgaa gatgaaatca gcgattccaa agcacagcta      1920
     gctgccatca ccttgatcat tggcactttt gaaaggatga agtgcttcag tgaagagaat      1980
     catgaacctc tgaggactca gtgtgccctt gctgcatcca aacttctaaa gaaacctgat      2040
     cagggccgag ctgtgagcac ctgtgcacat ctcttctggt ctggcagaaa cacggacaaa      2100
     aatggggagg agcttcacgg aggcaagagg gtaatggagt gcctaaaaaa agctctaaaa      2160
     atagcaaatc agtgcatgga cccctctcta caagtgcagc tttttataga aattctgaac      2220
     agatatatct atttttatga aaaggaaaat gatgcggtaa caattcaggt tttaaaccag      2280
     cttatccaaa agattcgaga agacctcccg aatcttgaat ccagtgaaga aacagagcag      2340
     attaacaaac attttcataa cacactggag catttgcgct tgcggcggga atcaccagaa      2400
     tccgaggggc caatttatga aggtctcatc ctttaaaaag gaaatagctc accatactcc      2460
     tttccatgta catccagtga gggttttatt acgctaggtt tcccttccat agattgtgcc      2520
     tttcagaaat gctgaggtag gtttcccatt tcttacctgt gatgtgtttt acccagcacc      2580
     tccggacact caccttcagg accttaataa aattattcac ttggtaagtg ttcaagtctt      2640
     tctgatcacc ccaagtagca tgactgatct gcaatttaaa attcctgtga tctgtaaaaa      2700
     aaaaaaaaaa a                                                           2711
//