Dbfetch

ID   BC002414; SV 2; linear; mRNA; STD; HUM; 2682 BP.
XX
AC   BC002414;
XX
DT   09-MAR-2001 (Rel. 67, Created)
DT   15-OCT-2008 (Rel. 97, Last updated, Version 15)
XX
DE   Homo sapiens vacuolar protein sorting 35 homolog (S. cerevisiae), mRNA
DE   (cDNA clone MGC:2587 IMAGE:3162255), complete cds.
XX
KW   MGC.
XX
OS   Homo sapiens (human)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Homo.
XX
RN   [1]
RP   1-2682
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-2682
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (05-FEB-2001) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; 8e0d190b138424d97aa8a409c35078a3.
DR   Ensembl-Gn; ENSG00000069329; homo_sapiens.
DR   Ensembl-Tr; ENST00000299138; homo_sapiens.
XX
CC   On Oct 8, 2003 this sequence version replaced gi:12803212.
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: ATCC
CC   cDNA Library Preparation: Rubin Laboratory
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: National Institutes of Health Intramural
CC   Sequencing Center (NISC),
CC   Gaithersburg, Maryland;
CC   Web site: http://www.nisc.nih.gov/
CC   Contact: nisc_mgc@nhgri.nih.gov
CC   Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
CC   Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
CC   Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
CC   Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
CC   Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
CC   McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
CC   Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
CC   Young,A., Zhang,L.-H. and Green,E.D.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAL Plate: 5 Row: a Column: 7
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 41352714.
CC   Differences found between this sequence and the human reference
CC   genome (build 36) are described in misc_difference features below
CC   and these differences were also compared to chimpanzee genome
CC   (build 1).
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2682
FT                   /organism="Homo sapiens"
FT                   /lab_host="DH10B-R"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NIH_MGC_19"
FT                   /clone="MGC:2587 IMAGE:3162255"
FT                   /tissue_type="Brain, neuroblastoma"
FT                   /note="Vector: pOTB7"
FT                   /db_xref="taxon:9606"
FT   gene            1..2682
FT                   /gene="VPS35"
FT                   /note="synonyms: DKFZp434E1211, DKFZp434P1672, MEM3"
FT   CDS             7..2397
FT                   /codon_start=1
FT                   /gene="VPS35"
FT                   /product="vacuolar protein sorting 35 homolog (S.
FT                   cerevisiae)"
FT                   /db_xref="GOA:Q96QK1"
FT                   /db_xref="H-InvDB:HIT000030875.16"
FT                   /db_xref="HGNC:HGNC:13487"
FT                   /db_xref="InterPro:IPR005378"
FT                   /db_xref="InterPro:IPR016024"
FT                   /db_xref="PDB:2R17"
FT                   /db_xref="PDB:5F0J"
FT                   /db_xref="PDB:5F0K"
FT                   /db_xref="PDB:5F0L"
FT                   /db_xref="PDB:5F0M"
FT                   /db_xref="PDB:5F0P"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q96QK1"
FT                   /protein_id="AAH02414.1"
FT                   /translation="MPTTQQSPQDEQEKLLDEAIQAVKVQSFQMKRCLDKNKLMDALKH
FT                   ASNMLGELRTSMLSPKSYYELYMAISDELHYLEVYLTDEFAKGRKVADLYELVQYAGNI
FT                   IPRLYLLITVGVVYVKSFPQSRKDILKDLVEMCRGVQHPLRGLFLRNYLLQCTRNILPD
FT                   EGEPTDEETTGDISDSMDFVLLNFAEMNKLWVRMQHQGHSRDREKRERERQELRILVGT
FT                   NLVRLSQLEGVNVERYKQIVLTGILEQVVNCRDALAQEYLMECIIQVFPDEFHLQTLNP
FT                   FLRACAELHQNVNVKNIIIALIDRLALFAHREDGPGIPADIKLFDIFSQQVATVIQSRQ
FT                   DMPSEDVVSLQVSLINLAMKCYPDRVDYVDKVLETTVEIFNKLNLEHIATSSAVSKELT
FT                   RLLKIPVDTYNNILTVLKLKHFHPLFEYFDYESRKSMSCYVLSNVLDYNTEIVSQDQVD
FT                   SIMNLVSTLIQDQPDQPVEDPDPEDFADEQSLVGRFIHLLRSEDPDQQYLILNTARKHF
FT                   GAGGNQRIRFTLPPLVFAAYQLAFRYKENSKVDDKWEKKCQKIFSFAHQTISALIKAEL
FT                   AELPLRLFLQGALAAGEIGFENHETVAYEFMSQAFSLYEDEISDSKAQLAAITLIIGTF
FT                   ERMKCFSEENHEPLRTQCALAASKLLKKPDQGRAVSTCAHLFWSGRNTDKNGEELHGGK
FT                   RVMECLKKALKIANQCMDPSLQVQLFIEILNRYIYFYEKENDAVTIQVLNQLIQKIRED
FT                   LPNLESSEETEQINKHFHNTLEHLRLRRESPESEGPIYEGLIL"
FT   misc_difference 1944
FT                   /gene="VPS35"
FT                   /note="'T' in cDNA is 'C' in the human genome; no amino
FT                   acid change. The chimpanzee genome agrees with the human
FT                   genomic sequence and not the cDNA."
FT   misc_difference 2678..2682
FT                   /gene="VPS35"
FT                   /note="polyA tail: 5 bases do not align to the human
FT                   genome."
XX
SQ   Sequence 2682 BP; 824 A; 533 C; 575 G; 750 T; 0 other;
     gtcgccatgc ctacaacaca gcagtcccct caggatgagc aggaaaagct cttggatgaa        60
     gccatacagg ctgtgaaggt ccagtcattc caaatgaaga gatgcctgga caaaaacaag       120
     cttatggatg ctctaaaaca tgcttctaat atgcttggtg aactccggac ttctatgtta       180
     tcaccaaaga gttactatga actttatatg gccatttctg atgaactgca ctacttggag       240
     gtctacctga cagatgagtt tgctaaagga aggaaagtgg cagatctcta cgaacttgta       300
     cagtatgctg gaaacattat cccaaggctt taccttttga tcacagttgg agttgtatat       360
     gtcaagtcat ttcctcagtc caggaaggat attttgaaag atttggtaga aatgtgccgt       420
     ggtgtgcaac atcccttgag gggtctgttt cttcgaaatt accttcttca gtgtaccaga       480
     aatatcttac ctgatgaagg agagccaaca gatgaagaaa caactggtga catcagtgat       540
     tccatggatt ttgtactgct caactttgca gaaatgaaca agctctgggt gcgaatgcag       600
     catcagggac atagccgaga tagagaaaaa agagaacgag aaagacaaga actgagaatt       660
     ttagtgggaa caaatttggt gcgcctcagt cagttggaag gtgtaaatgt ggaacgttac       720
     aaacagattg ttttgactgg catattggag caagttgtaa actgtaggga tgctttggct       780
     caagaatatc tcatggagtg tattattcag gttttccctg atgaatttca cctccagact       840
     ttgaatcctt ttcttcgggc ctgtgctgag ttacaccaga atgtaaatgt gaagaacata       900
     atcattgctt taattgatag attagcttta tttgctcacc gtgaagatgg acctggaatc       960
     ccagcggata ttaaactttt tgatatattt tcacagcagg tggctacagt gatacagtct      1020
     agacaagaca tgccttcaga ggatgttgta tctttacaag tctctctgat taatcttgcc      1080
     atgaaatgtt accctgatcg tgtggactat gttgataaag ttctagaaac aacagtggag      1140
     atattcaata agctcaacct tgaacatatt gctaccagta gtgcagtttc aaaggaactc      1200
     accagacttt tgaaaatacc agttgacact tacaacaata ttttaacagt cttgaaatta      1260
     aaacattttc acccactctt tgagtacttt gactacgagt ccagaaagag catgagttgt      1320
     tatgtgctta gtaatgttct ggattataac acagaaattg tctctcaaga ccaggtggat      1380
     tccataatga atttggtatc cacgttgatt caagatcagc cagatcaacc tgtagaagac      1440
     cctgatccag aagattttgc tgatgagcag agccttgtgg gccgcttcat tcatctgctg      1500
     cgctctgagg accctgacca gcagtacttg attttgaaca cagcacgaaa acattttgga      1560
     gctggtggaa atcagcggat tcgcttcaca ctgccacctt tggtatttgc agcttaccag      1620
     ctggcttttc gatataaaga gaattctaaa gtggatgaca aatgggaaaa gaaatgccag      1680
     aagatttttt catttgccca ccagactatc agtgctttga tcaaagcaga gctggcagaa      1740
     ttgcccttaa gactttttct tcaaggagca ctagctgctg gggaaattgg ttttgaaaat      1800
     catgagacag tcgcatatga attcatgtcc caggcatttt ctctgtatga agatgaaatc      1860
     agcgattcca aagcacagct agctgccatc accttgatca ttggcacttt tgaaaggatg      1920
     aagtgcttca gtgaagagaa tcatgaacct ctgaggactc agtgtgccct tgctgcatcc      1980
     aaacttctaa agaaacctga tcagggccga gctgtgagca cctgtgcaca tctcttctgg      2040
     tctggcagaa acacggacaa aaatggggag gagcttcacg gaggcaagag ggtaatggag      2100
     tgcctaaaaa aagctctaaa aatagcaaat cagtgcatgg acccctctct acaagtgcag      2160
     ctttttatag aaattctgaa cagatatatc tatttttatg aaaaggaaaa tgatgcggta      2220
     acaattcagg ttttaaacca gcttatccaa aagattcgag aagacctccc gaatcttgaa      2280
     tccagtgaag aaacagagca gattaacaaa cattttcata acacactgga gcatttgcgc      2340
     ttgcggcggg aatcaccaga atccgagggg ccaatttatg aaggtctcat cctttaaaaa      2400
     ggaaatagct caccatactc ctttccatgt acatccagtg agggttttat tacgctaggt      2460
     ttcccttcca tagattgtgc ctttcagaaa tgctgaggta ggtttcccat ttcttacctg      2520
     tgatgtgttt tacccagcac ctccggacac tcaccttcag gaccttaata aaattattca      2580
     cttggtaagt gttcaagtct ttctgatcac cccaagtagc atgactgatc tgcaatttaa      2640
     aattcctgtg atctgtaaaa aaaaaaaaaa aaaaaaaaaa aa                         2682
//