ID GQ404846; SV 1; circular; genomic DNA; STD; VRL; 1740 BP. XX AC GQ404846; XX DT 15-MAR-2010 (Rel. 104, Created) DT 15-MAR-2010 (Rel. 104, Last updated, Version 1) XX DE Cyclovirus PK5222, complete genome. XX KW . XX OS Cyclovirus PK5222 OC Viruses; ssDNA viruses; Circoviridae; unclassified Circoviridae. XX RN [1] RP 1-1740 RX DOI; 10.1128/JVI.02109-09. RX PUBMED; 20007276. RA Li L., Kapoor A., Slikas B., Bamidele O.S., Wang C., Shaukat S., RA Masroor M.A., Wilson M.L., Ndjango J.B., Peeters M., Gross-Camp N.D., RA Muller M.N., Hahn B.H., Wolfe N.D., Triki H., Bartkus J., Zaidi S.Z., RA Delwart E.; RT "Multiple diverse circoviruses infect farm animals and are commonly found RT in human and chimpanzee feces"; RL J. Virol. 84(4):1674-1682(2010). XX RN [2] RP 1-1740 RA Li L., Kapoor A., Delwart E.; RT ; RL Submitted (23-JUL-2009) to the INSDC. RL Department of Laboratory Medicine, University of California, San Francisco, RL Blood Systems Research Institute, 270 Masonic Ave, San Francisco, CA 94118, RL USA XX FH Key Location/Qualifiers FH FT source 1..1740 FT /organism="Cyclovirus PK5222" FT /host="Homo sapiens" FT /isolate="PK5222" FT /mol_type="genomic DNA" FT /country="Pakistan" FT /isolation_source="stool" FT /collection_date="2006" FT /db_xref="taxon:742917" FT gene 23..862 FT /gene="rep" FT CDS 23..862 FT /codon_start=1 FT /gene="rep" FT /product="replication-association protein" FT /db_xref="GOA:D4N3P0" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR003365" FT /db_xref="UniProtKB/TrEMBL:D4N3P0" FT /protein_id="ADD62455.1" FT /translation="MANRTVRRFCFTWNNYPDTAYEKCETFIQKFCKYGIVGEELAPTT FT GTPHLQGFCNLRKPTRFSTIKKHLDNSIHIEKANGSDEQNQAYCSKTGIFFEEGSPTKQ FT GQRNDLQLLVDDVIGHPDIKTKDIAAKYPSTFVRYFRGIKELQRVVHPVAERMFKTEVY FT YYWGPPGTGKSRRALEEAQATGNGSIYYKPRGLWWDGYEQQDSVIIDDFYGWIKYDELL FT KITDRYPYKVQIKGGFEEFTSKKIFITSNVDTDELYKFNGYNTAALERRLTVKEYMS" FT gene complement(859..1515) FT /gene="cap" FT CDS complement(859..1515) FT /codon_start=1 FT /gene="cap" FT /product="capsid protein" FT /db_xref="UniProtKB/TrEMBL:D4N3P1" FT /protein_id="ADD62456.1" FT /translation="MALRRYRRVRRPLRRYRRRRRKFFRRKSRVGKSDLLVKLTKIDTV FT NVDISKINVWSLSYLPTDFSEYNELKKNFEYCVFLKERVTIYPMQNIANNSTSQVPAYL FT MAPWHRGGPASATFNTYLTIDRAKIFRGTQVGSQTYVPSVLIEGLSDSGGGTSSPDMIQ FT FRPKVYISNSQTRAIRIYNGIIAFQGNGEMTGTARYNVKHDVWVVFKKQNTLASL" XX SQ Sequence 1740 BP; 536 A; 402 C; 357 G; 445 T; 0 other; atacccgtca cttcgttgtg aaatggcaaa ccgtacagtg cgccgattct gctttacatg 60 gaacaactat cctgacacag cgtacgaaaa gtgtgaaaca ttcatacaaa agttctgcaa 120 atatggaatc gtgggagaag aactcgctcc aacaacaggg acgcctcacc tacaagggtt 180 ttgtaactta cgtaaaccca cacgcttcag taccatcaag aagcatctcg ataactcaat 240 ccatattgag aaggcaaacg gatccgacga acagaaccaa gcatactgtt cgaagacagg 300 catatttttt gaagagggta gtcccactaa acaagggcaa cgcaacgatt tacaacttct 360 cgtggatgat gttatcggac accctgatat taaaacaaaa gatattgccg ccaaatatcc 420 ttctacattt gttagatact tcagaggaat taaagaatta caacgagtgg ttcatccagt 480 ggcagaaaga atgttcaaaa ctgaggtgta ctattattgg ggacctcctg gaactggcaa 540 aagtagaaga gccctagaag aagcccaggc gacaggaaac ggatcgatat actacaaacc 600 acgcggcctc tggtgggacg ggtacgaaca acaagattcc gttattatag acgacttcta 660 cggatggata aaatatgacg aactacttaa aattacagac cgctacccat acaaggtaca 720 aataaaagga ggattcgaag aatttacatc aaaaaaaatt tttattacat caaacgttga 780 tacagatgaa ctttacaaat ttaatggtta caacacagca gctttagaac gccgtttaac 840 cgtcaaggaa tatatgtctt aaagcgacgc caatgtattc tgtttcttaa atacaaccca 900 tacatcatgt ttaacattat atctcgctgt tccagtcatt tcaccattcc cttgaaaagc 960 tattattcca ttataaatcc ttatagctct tgtctgagaa ttagatatat atacttttgg 1020 cctaaactgt atcatatcgg gtgaggatgt accacctccg gaatctgata gtccctctat 1080 taatactgaa ggtacatatg tctgcgatcc aacttgagta cctctaaata ttttagctct 1140 atcgattgtc aaataagtat taaacgtagc tgatgctggt cctccgcgat gccatggggc 1200 catcaaatat gccggtactt gacttgtcga attattagca atattctgca tcggatatat 1260 agtaactctc tctttcaaaa aaacacaata ttcaaaattc tttttcaact cattgtactc 1320 cgaaaaatcc gtcgggagat atgacaaaga ccacacgtta attttagaaa tgtcaacatt 1380 aaccgtatcg attttggtaa gtttaaccaa gagatcagat tttccgacgc gacttttacg 1440 gcgaaaaaac ttgcgacggc gtcgacgata ccgccgcagc ggtcgccgaa cccgtctata 1500 acgcctcaag gccatgcctt tcactttgac gctcggtata aaataactcc acttcaggcg 1560 gctgcctgtt gccaagtgaa cgttgcgccc cccccctcgg ccggggagcc tgcccggccg 1620 gtcccccccc cgccggccgg gcggcccccg gcggggggag ggcgtgcgtg ttgaaacttg 1680 tttgggcggt tgagtcagac cagttacgaa gttacggtta cgaagtgacg gggtaatact 1740 //