ID BC010362; SV 1; linear; mRNA; STD; HUM; 2698 BP.
XX
AC BC010362;
XX
DT 13-JUL-2001 (Rel. 68, Created)
DT 15-OCT-2008 (Rel. 97, Last updated, Version 8)
XX
DE Homo sapiens vacuolar protein sorting 35 homolog (S. cerevisiae), mRNA
DE (cDNA clone MGC:13402 IMAGE:4249949), complete cds.
XX
KW MGC.
XX
OS Homo sapiens (human)
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
XX
RN [1]
RP 1-2698
RX DOI; 10.1073/pnas.242603899.
RX PUBMED; 12477932.
RG Mammalian Gene Collection Program Team
RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT "Generation and initial analysis of more than 15,000 full-length human and
RT mouse cDNA sequences";
RL Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN [2]
RC NIH-MGC Project URL: http://mgc.nci.nih.gov
RP 1-2698
RG NIH MGC Project
RA ;
RT ;
RL Submitted (09-JUL-2001) to the INSDC.
RL National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL MD 20892-2590, USA
XX
DR MD5; ee17e11f4905d0b573eeb37e8dd2982d.
DR Ensembl-Gn; ENSG00000069329; homo_sapiens.
DR Ensembl-Tr; ENST00000299138; homo_sapiens.
XX
CC Contact: MGC help desk
CC Email: cgapbs-r@mail.nih.gov
CC Tissue Procurement: CLONTECH
CC cDNA Library Preparation: CLONTECH Laboratories, Inc.
CC cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC DNA Sequencing by: Institute for Systems Biology
CC http://www.systemsbiology.org
CC contact: amadan@systemsbiology.org
CC Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
CC Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
CC Clone distribution: MGC clone distribution information can be found
CC through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC Series: IRAL Plate: 19 Row: i Column: 6
CC This clone was selected for full length sequencing because it
CC passed the following selection criteria: matched mRNA gi: 41352714.
CC Differences found between this sequence and the human reference
CC genome (build 36) are described in misc_difference features below
CC and these differences were also compared to chimpanzee genome
CC (build 1).
XX
FH Key Location/Qualifiers
FH
FT source 1..2698
FT /organism="Homo sapiens"
FT /lab_host="DH10B"
FT /mol_type="mRNA"
FT /clone_lib="NIH_MGC_83"
FT /clone="MGC:13402 IMAGE:4249949"
FT /tissue_type="Prostate"
FT /note="Vector: pDNR-LIB"
FT /db_xref="taxon:9606"
FT gene 1..2698
FT /gene="VPS35"
FT /note="synonyms: DKFZp434E1211, DKFZp434P1672, MEM3"
FT misc_difference 1..3
FT /gene="VPS35"
FT /note="3 bases at the 5' end do not align to the human
FT genome."
FT CDS 21..2411
FT /codon_start=1
FT /gene="VPS35"
FT /product="vacuolar protein sorting 35 homolog (S.
FT cerevisiae)"
FT /db_xref="GOA:Q96QK1"
FT /db_xref="H-InvDB:HIT000034898.15"
FT /db_xref="HGNC:HGNC:13487"
FT /db_xref="InterPro:IPR005378"
FT /db_xref="InterPro:IPR016024"
FT /db_xref="InterPro:IPR042491"
FT /db_xref="PDB:2R17"
FT /db_xref="PDB:5F0J"
FT /db_xref="PDB:5F0K"
FT /db_xref="PDB:5F0L"
FT /db_xref="PDB:5F0M"
FT /db_xref="PDB:5F0P"
FT /db_xref="PDB:5OSH"
FT /db_xref="PDB:5OSI"
FT /db_xref="UniProtKB/Swiss-Prot:Q96QK1"
FT /protein_id="AAH10362.1"
FT /translation="MPTTQQSPQDEQEKLLDEAIQAVKVQSFQMKRCLDKNKLMDALKH
FT ASNMLGELRTSMLSPKSYYELYMAISDELHYLEVYLTDEFAKGRKVADLYELVQYAGNI
FT IPRLYLLITVGVVYVKSFPQSRKDILKDLVEMCRGVQHPLRGLFLRNYLLQCTRNILPD
FT EGEPTDEETTGDISDSMDFVLLNFAEMNKLWVRMQHQGHSRDREKRERERQELRILVGT
FT NLVRLSQLEGVNVERYKQIVLTGILEQVVNCRDALAQEYLMECIIQVFPDEFHLQTLNP
FT FLRACAELHQNVNVKNIIIALIDRLALFAHREDGPGIPADIKLFDIFSQQVATVIQSRQ
FT DMPSEDVVSLQVSLINLAMKCYPDRVDYVDKVLETTVEIFNKLNLEHIATSSAVSKELT
FT RLLKIPVDTYNNILTVLKLKHFHPLFEYFDYESRKSMSCYVLSNVLDYNTEIVFQDQVD
FT SIMNLVSTLIQDQPDQPVEDPDPEDFADEQSLVGRFIHLLRSEDPDQQYLILNTARKHF
FT GAGGNQRIRFTLPPLVFAAYQLAFRYKENSKVDDKWEKKCQKIFSFAHQTISALIKAEL
FT AELPLRLFLQGALAAGEIGFENHETVAYEFMSQAFSLYEDEISDSKAQLAAITLIIGTF
FT ERMKCFSEENHEPLRTQCALAASKLLKKPDQGRAVSTCAHLFWSGRNTDKNGEELHGGK
FT RVMECLKKALKIANQCMDPSLQVQLFIEILNRYIYFYEKENDAVTIQVLNQLIQKIRED
FT LPNLESSEETEQINKHFHNTLEHLRLRRESPESEGPIYEGLIL"
FT misc_difference 1378
FT /gene="VPS35"
FT /note="'T' in cDNA is 'C' in the human genome; amino acid
FT difference: 'F' in cDNA, 'S' in the human genome. The
FT chimpanzee genome agrees with the human genomic sequence
FT and not the cDNA."
FT misc_difference 1958
FT /gene="VPS35"
FT /note="'T' in cDNA is 'C' in the human genome; no amino
FT acid change. The chimpanzee genome agrees with the human
FT genomic sequence and not the cDNA."
FT misc_difference 2670..2698
FT /gene="VPS35"
FT /note="polyA tail: 29 bases do not align to the human
FT genome."
XX
SQ Sequence 2698 BP; 827 A; 535 C; 584 G; 752 T; 0 other;
gggggctctg gggagtcgcc atgcctacaa cacagcagtc ccctcaggat gagcaggaaa 60
agctcttgga tgaagccata caggctgtga aggtccagtc attccaaatg aagagatgcc 120
tggacaaaaa caagcttatg gatgctctaa aacatgcttc taatatgctt ggtgaactcc 180
ggacttctat gttatcacca aagagttact atgaacttta tatggccatt tctgatgaac 240
tgcactactt ggaggtctac ctgacagatg agtttgctaa aggaaggaaa gtggcagatc 300
tctacgaact tgtacagtat gctggaaaca ttatcccaag gctttacctt ttgatcacag 360
ttggagttgt atatgtcaag tcatttcctc agtccaggaa ggatattttg aaagatttgg 420
tagaaatgtg ccgtggtgtg caacatccct tgaggggtct gtttcttcga aattaccttc 480
ttcagtgtac cagaaatatc ttacctgatg aaggagagcc aacagatgaa gaaacaactg 540
gtgacatcag tgattccatg gattttgtac tgctcaactt tgcagaaatg aacaagctct 600
gggtgcgaat gcagcatcag ggacatagcc gagatagaga aaaaagagaa cgagaaagac 660
aagaactgag aattttagtg ggaacaaatt tggtgcgcct cagtcagttg gaaggtgtaa 720
atgtggaacg ttacaaacag attgttttga ctggcatatt ggagcaagtt gtaaactgta 780
gggatgcttt ggctcaagaa tatctcatgg agtgtattat tcaggttttc cctgatgaat 840
ttcacctcca gactttgaat ccttttcttc gggcctgtgc tgagttacac cagaatgtaa 900
atgtgaagaa cataatcatt gctttaattg atagattagc tttatttgct caccgtgaag 960
atggacctgg aatcccagcg gatattaaac tttttgatat attttcacag caggtggcta 1020
cagtgataca gtctagacaa gacatgcctt cagaggatgt tgtatcttta caagtctctc 1080
tgattaatct tgccatgaaa tgttaccctg atcgtgtgga ctatgttgat aaagttctag 1140
aaacaacagt ggagatattc aataagctca accttgaaca tattgctacc agtagtgcag 1200
tttcaaagga actcaccaga cttttgaaaa taccagttga cacttacaac aatattttaa 1260
cagtcttgaa attaaaacat tttcacccac tctttgagta ctttgactac gagtccagaa 1320
agagcatgag ttgttatgtg cttagtaatg ttctggatta taacacagaa attgtctttc 1380
aagaccaggt ggattccata atgaatttgg tatccacgtt gattcaagat cagccagatc 1440
aacctgtaga agaccctgat ccagaagatt ttgctgatga gcagagcctt gtgggccgct 1500
tcattcatct gctgcgctct gaggaccctg accagcagta cttgattttg aacacagcac 1560
gaaaacattt tggagctggt ggaaatcagc ggattcgctt cacactgcca cctttggtat 1620
ttgcagctta ccagctggct tttcgatata aagagaattc taaagtggat gacaaatggg 1680
aaaagaaatg ccagaagatt ttttcatttg cccaccagac tatcagtgct ttgatcaaag 1740
cagagctggc agaattgccc ttaagacttt ttcttcaagg agcactagct gctggggaaa 1800
ttggttttga aaatcatgag acagtcgcat atgaattcat gtcccaggca ttttctctgt 1860
atgaagatga aatcagcgat tccaaagcac agctagctgc catcaccttg atcattggca 1920
cttttgaaag gatgaagtgc ttcagtgaag agaatcatga acctctgagg actcagtgtg 1980
cccttgctgc atccaaactt ctaaagaaac ctgatcaggg ccgagctgtg agcacctgtg 2040
cacatctctt ctggtctggc agaaacacgg acaaaaatgg ggaggagctt cacggaggca 2100
agagggtaat ggagtgccta aaaaaagctc taaaaatagc aaatcagtgc atggacccct 2160
ctctacaagt gcagcttttt atagaaattc tgaacagata tatctatttt tatgaaaagg 2220
aaaatgatgc ggtaacaatt caggttttaa accagcttat ccaaaagatt cgagaagacc 2280
tcccgaatct tgaatccagt gaagaaacag agcagattaa caaacatttt cataacacac 2340
tggagcattt gcgcttgcgg cgggaatcac cagaatccga ggggccaatt tatgaaggtc 2400
tcatccttta aaaaggaaat agctcaccat actcctttcc atgtacatcc agtgagggtt 2460
ttattacgct aggtttccct tccatagatt gtgcctttca gaaatgctga ggtaggtttc 2520
ccatttctta cctgtgatgt gttttaccca gcacctccgg acactcacct tcaggacctt 2580
aataaaatta ttcacttggt aagtgttcaa gtctttctga tcaccccaag tagcatgact 2640
gatctgcaat ttaaaattcc tgtgatctgc aaaaaaaaaa aaaaaaaaaa aaaaaaaa 2698
//