Dbfetch

ID   BC010362; SV 1; linear; mRNA; STD; HUM; 2698 BP.
XX
AC   BC010362;
XX
DT   13-JUL-2001 (Rel. 68, Created)
DT   15-OCT-2008 (Rel. 97, Last updated, Version 8)
XX
DE   Homo sapiens vacuolar protein sorting 35 homolog (S. cerevisiae), mRNA
DE   (cDNA clone MGC:13402 IMAGE:4249949), complete cds.
XX
KW   MGC.
XX
OS   Homo sapiens (human)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Homo.
XX
RN   [1]
RP   1-2698
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-2698
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (09-JUL-2001) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; ee17e11f4905d0b573eeb37e8dd2982d.
DR   Ensembl-Gn; ENSG00000069329; homo_sapiens.
DR   Ensembl-Tr; ENST00000299138; homo_sapiens.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: CLONTECH
CC   cDNA Library Preparation: CLONTECH Laboratories, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Institute for Systems Biology
CC   http://www.systemsbiology.org
CC   contact: amadan@systemsbiology.org
CC   Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
CC   Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAL Plate: 19 Row: i Column: 6
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 41352714.
CC   Differences found between this sequence and the human reference
CC   genome (build 36) are described in misc_difference features below
CC   and these differences were also compared to chimpanzee genome
CC   (build 1).
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2698
FT                   /organism="Homo sapiens"
FT                   /lab_host="DH10B"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NIH_MGC_83"
FT                   /clone="MGC:13402 IMAGE:4249949"
FT                   /tissue_type="Prostate"
FT                   /note="Vector: pDNR-LIB"
FT                   /db_xref="taxon:9606"
FT   gene            1..2698
FT                   /gene="VPS35"
FT                   /note="synonyms: DKFZp434E1211, DKFZp434P1672, MEM3"
FT   misc_difference 1..3
FT                   /gene="VPS35"
FT                   /note="3 bases at the 5' end do not align to the human
FT                   genome."
FT   CDS             21..2411
FT                   /codon_start=1
FT                   /gene="VPS35"
FT                   /product="vacuolar protein sorting 35 homolog (S.
FT                   cerevisiae)"
FT                   /db_xref="GOA:Q96QK1"
FT                   /db_xref="H-InvDB:HIT000034898.15"
FT                   /db_xref="HGNC:HGNC:13487"
FT                   /db_xref="InterPro:IPR005378"
FT                   /db_xref="InterPro:IPR016024"
FT                   /db_xref="InterPro:IPR042491"
FT                   /db_xref="PDB:2R17"
FT                   /db_xref="PDB:5F0J"
FT                   /db_xref="PDB:5F0K"
FT                   /db_xref="PDB:5F0L"
FT                   /db_xref="PDB:5F0M"
FT                   /db_xref="PDB:5F0P"
FT                   /db_xref="PDB:5OSH"
FT                   /db_xref="PDB:5OSI"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q96QK1"
FT                   /protein_id="AAH10362.1"
FT                   /translation="MPTTQQSPQDEQEKLLDEAIQAVKVQSFQMKRCLDKNKLMDALKH
FT                   ASNMLGELRTSMLSPKSYYELYMAISDELHYLEVYLTDEFAKGRKVADLYELVQYAGNI
FT                   IPRLYLLITVGVVYVKSFPQSRKDILKDLVEMCRGVQHPLRGLFLRNYLLQCTRNILPD
FT                   EGEPTDEETTGDISDSMDFVLLNFAEMNKLWVRMQHQGHSRDREKRERERQELRILVGT
FT                   NLVRLSQLEGVNVERYKQIVLTGILEQVVNCRDALAQEYLMECIIQVFPDEFHLQTLNP
FT                   FLRACAELHQNVNVKNIIIALIDRLALFAHREDGPGIPADIKLFDIFSQQVATVIQSRQ
FT                   DMPSEDVVSLQVSLINLAMKCYPDRVDYVDKVLETTVEIFNKLNLEHIATSSAVSKELT
FT                   RLLKIPVDTYNNILTVLKLKHFHPLFEYFDYESRKSMSCYVLSNVLDYNTEIVFQDQVD
FT                   SIMNLVSTLIQDQPDQPVEDPDPEDFADEQSLVGRFIHLLRSEDPDQQYLILNTARKHF
FT                   GAGGNQRIRFTLPPLVFAAYQLAFRYKENSKVDDKWEKKCQKIFSFAHQTISALIKAEL
FT                   AELPLRLFLQGALAAGEIGFENHETVAYEFMSQAFSLYEDEISDSKAQLAAITLIIGTF
FT                   ERMKCFSEENHEPLRTQCALAASKLLKKPDQGRAVSTCAHLFWSGRNTDKNGEELHGGK
FT                   RVMECLKKALKIANQCMDPSLQVQLFIEILNRYIYFYEKENDAVTIQVLNQLIQKIRED
FT                   LPNLESSEETEQINKHFHNTLEHLRLRRESPESEGPIYEGLIL"
FT   misc_difference 1378
FT                   /gene="VPS35"
FT                   /note="'T' in cDNA is 'C' in the human genome; amino acid
FT                   difference: 'F' in cDNA, 'S' in the human genome. The
FT                   chimpanzee genome agrees with the human genomic sequence
FT                   and not the cDNA."
FT   misc_difference 1958
FT                   /gene="VPS35"
FT                   /note="'T' in cDNA is 'C' in the human genome; no amino
FT                   acid change. The chimpanzee genome agrees with the human
FT                   genomic sequence and not the cDNA."
FT   misc_difference 2670..2698
FT                   /gene="VPS35"
FT                   /note="polyA tail: 29 bases do not align to the human
FT                   genome."
XX
SQ   Sequence 2698 BP; 827 A; 535 C; 584 G; 752 T; 0 other;
     gggggctctg gggagtcgcc atgcctacaa cacagcagtc ccctcaggat gagcaggaaa        60
     agctcttgga tgaagccata caggctgtga aggtccagtc attccaaatg aagagatgcc       120
     tggacaaaaa caagcttatg gatgctctaa aacatgcttc taatatgctt ggtgaactcc       180
     ggacttctat gttatcacca aagagttact atgaacttta tatggccatt tctgatgaac       240
     tgcactactt ggaggtctac ctgacagatg agtttgctaa aggaaggaaa gtggcagatc       300
     tctacgaact tgtacagtat gctggaaaca ttatcccaag gctttacctt ttgatcacag       360
     ttggagttgt atatgtcaag tcatttcctc agtccaggaa ggatattttg aaagatttgg       420
     tagaaatgtg ccgtggtgtg caacatccct tgaggggtct gtttcttcga aattaccttc       480
     ttcagtgtac cagaaatatc ttacctgatg aaggagagcc aacagatgaa gaaacaactg       540
     gtgacatcag tgattccatg gattttgtac tgctcaactt tgcagaaatg aacaagctct       600
     gggtgcgaat gcagcatcag ggacatagcc gagatagaga aaaaagagaa cgagaaagac       660
     aagaactgag aattttagtg ggaacaaatt tggtgcgcct cagtcagttg gaaggtgtaa       720
     atgtggaacg ttacaaacag attgttttga ctggcatatt ggagcaagtt gtaaactgta       780
     gggatgcttt ggctcaagaa tatctcatgg agtgtattat tcaggttttc cctgatgaat       840
     ttcacctcca gactttgaat ccttttcttc gggcctgtgc tgagttacac cagaatgtaa       900
     atgtgaagaa cataatcatt gctttaattg atagattagc tttatttgct caccgtgaag       960
     atggacctgg aatcccagcg gatattaaac tttttgatat attttcacag caggtggcta      1020
     cagtgataca gtctagacaa gacatgcctt cagaggatgt tgtatcttta caagtctctc      1080
     tgattaatct tgccatgaaa tgttaccctg atcgtgtgga ctatgttgat aaagttctag      1140
     aaacaacagt ggagatattc aataagctca accttgaaca tattgctacc agtagtgcag      1200
     tttcaaagga actcaccaga cttttgaaaa taccagttga cacttacaac aatattttaa      1260
     cagtcttgaa attaaaacat tttcacccac tctttgagta ctttgactac gagtccagaa      1320
     agagcatgag ttgttatgtg cttagtaatg ttctggatta taacacagaa attgtctttc      1380
     aagaccaggt ggattccata atgaatttgg tatccacgtt gattcaagat cagccagatc      1440
     aacctgtaga agaccctgat ccagaagatt ttgctgatga gcagagcctt gtgggccgct      1500
     tcattcatct gctgcgctct gaggaccctg accagcagta cttgattttg aacacagcac      1560
     gaaaacattt tggagctggt ggaaatcagc ggattcgctt cacactgcca cctttggtat      1620
     ttgcagctta ccagctggct tttcgatata aagagaattc taaagtggat gacaaatggg      1680
     aaaagaaatg ccagaagatt ttttcatttg cccaccagac tatcagtgct ttgatcaaag      1740
     cagagctggc agaattgccc ttaagacttt ttcttcaagg agcactagct gctggggaaa      1800
     ttggttttga aaatcatgag acagtcgcat atgaattcat gtcccaggca ttttctctgt      1860
     atgaagatga aatcagcgat tccaaagcac agctagctgc catcaccttg atcattggca      1920
     cttttgaaag gatgaagtgc ttcagtgaag agaatcatga acctctgagg actcagtgtg      1980
     cccttgctgc atccaaactt ctaaagaaac ctgatcaggg ccgagctgtg agcacctgtg      2040
     cacatctctt ctggtctggc agaaacacgg acaaaaatgg ggaggagctt cacggaggca      2100
     agagggtaat ggagtgccta aaaaaagctc taaaaatagc aaatcagtgc atggacccct      2160
     ctctacaagt gcagcttttt atagaaattc tgaacagata tatctatttt tatgaaaagg      2220
     aaaatgatgc ggtaacaatt caggttttaa accagcttat ccaaaagatt cgagaagacc      2280
     tcccgaatct tgaatccagt gaagaaacag agcagattaa caaacatttt cataacacac      2340
     tggagcattt gcgcttgcgg cgggaatcac cagaatccga ggggccaatt tatgaaggtc      2400
     tcatccttta aaaaggaaat agctcaccat actcctttcc atgtacatcc agtgagggtt      2460
     ttattacgct aggtttccct tccatagatt gtgcctttca gaaatgctga ggtaggtttc      2520
     ccatttctta cctgtgatgt gttttaccca gcacctccgg acactcacct tcaggacctt      2580
     aataaaatta ttcacttggt aagtgttcaa gtctttctga tcaccccaag tagcatgact      2640
     gatctgcaat ttaaaattcc tgtgatctgc aaaaaaaaaa aaaaaaaaaa aaaaaaaa        2698
//