ID J02451; SV 1; circular; genomic DNA; STD; PHG; 6408 BP. XX AC J02451; M10731; M10767; M21666-M21670; M25198; V00602; XX DT 01-OCT-1996 (Rel. 49, Created) DT 08-NOV-2021 (Rel. 144, Last updated, Version 4) XX DE Escherichia phage fd strain 478, complete genome. XX KW . XX OS Escherichia phage fd OC Viruses; Monodnaviria; Loebvirae; Hofneiviricota; Faserviricetes; OC Tubulavirales; Inoviridae; Inovirus. XX RN [1] RX DOI; 10.1093/nar/2.11.2091. RX PUBMED; 1052531. RA Sugimoto K., Sugisaki H., Okamoto T., Takanami M.; RT "Studies on bacteriophage fd DNA. III. Nucleotide sequence preceding the RT RNA start-site on a promoter-containing fragment"; RL Nucleic Acids Res. 2(11):2091-2100(1975). XX RN [2] RP 402-443 RX DOI; 10.1073/pnas.72.2.737. RX PUBMED; 1054851. RA Schaller H., Gray C., Herrmann K.; RT "Nucleotide sequence of an RNA polymerase binding site from the DNA of RT bacteriophage fd"; RL Proc. Natl. Acad. Sci. U.S.A. 72(2):737-741(1975). XX RN [3] RP 1196-1564 RX PUBMED; 864706. RA Sugimoto K., Sugisaki H., Okamoto T., Takanami M.; RT "Studies on bacteriophage fd DNA. IV. The sequence of messenger RNA for the RT major coat protein gene"; RL J. Mol. Biol. 111(4):487-507(1977). XX RN [4] RP 1-6408 RX PUBMED; 745987. RX DOI; 10.1093/nar/5.12.4495. RA Beck E., Sommer R., Auerswald E.A., Kurz C., Zink B., Osterburg G., RA Schaller H., Sugimoto K., Sugisaki H., Okamoto T., Takanami M.; RT "Nucleotide sequence of bacteriophage fd DNA"; RL Nucleic Acids Res. 5(12):4495-4503(1978). XX RN [5] RP 5585-5771 RX DOI; 10.1073/pnas.75.1.50. RX PUBMED; 272666. RA Gray C.P., Sommer R., Polke C., Beck E., Schaller H.; RT "Structure of the orgin of DNA replication of bacteriophage fd"; RL Proc. Natl. Acad. Sci. U.S.A. 75(1):50-53(1978). XX RN [6] RX PUBMED; 2457024. RA Horabin J.I., Webster R.E.; RT "An amino acid sequence which directs membrane insertion causes loss of RT membrane potential"; RL J Biol Chem 263(23):11575-11583(1988). XX DR MD5; 3f765ff7eed0713be5edfe0b37bb2ac0. DR EuropePMC; PMC3187576; 21972239. XX CC On Apr 28, 2004 this sequence version replaced V00602.1. CC [6] sites; internal signal sequence. CC [1] sites. CC Single-stranded circular DNA which codes for ten proteins CC replicative form is duplex. Filamentous. CC [3] missing data project. XX FH Key Location/Qualifiers FH FT source 1..6408 FT /organism="Escherichia phage fd" FT /strain="478" FT /mol_type="genomic DNA" FT /db_xref="taxon:2847073" FT CDS join(6007..6408,1..831) FT /codon_start=1 FT /transl_table=11 FT /product="II" FT /note="rf replication; nicking" FT /db_xref="GOA:P69545" FT /db_xref="InterPro:IPR006516" FT /db_xref="InterPro:IPR022686" FT /db_xref="InterPro:IPR022688" FT /db_xref="UniProtKB/Swiss-Prot:P69545" FT /protein_id="AAA32303.1" FT /translation="MIDMLVLRLPFIDSLVCSRLSGNDLIAFVDLSKIATLSGMNLSAR FT TVEYHIDGDLTVSGLSHPFESLPTHYSGIAFKIYEGSKNFYPCVEIKASPAKVLQGHNV FT FGTTDLALCSEALLLNFANSLPCLYDLLDVNATTISRIDATFSARAPNENIAKQVIDHL FT RNVSNGQTKSTRSQNWESTVTWNETSRHRTLVAYLKHVELQHQIQQLSSKPSAKMTSYQ FT KEQLKVLSNPDLLEFASGLVRFEARIETRYLKSFGLPLNLFDAIRFASDYNRQGKDLIF FT DLWSFSFSELFKAFEGDSMNIYDDSAVLDAIQSKHFTITPSGKTSFAKASRYFGFYRRL FT VNEGYDSVALTMPRNSFWRYVSALVECGIPKSQLMNLSTCNNVVPLVRFINVDFSSQRP FT DWYNEPVLKIA" FT CDS 496..831 FT /codon_start=1 FT /transl_table=11 FT /product="X" FT /db_xref="GOA:P69545" FT /db_xref="InterPro:IPR006516" FT /db_xref="InterPro:IPR022686" FT /db_xref="InterPro:IPR022688" FT /db_xref="UniProtKB/Swiss-Prot:P69545" FT /protein_id="AAA32304.1" FT /translation="MNIYDDSAVLDAIQSKHFTITPSGKTSFAKASRYFGFYRRLVNEG FT YDSVALTMPRNSFWRYVSALVECGIPKSQLMNLSTCNNVVPLVRFINVDFSSQRPDWYN FT EPVLKIA" FT CDS 843..1106 FT /codon_start=1 FT /transl_table=11 FT /product="V" FT /note="single-stranded binding protein" FT /db_xref="GOA:P69542" FT /db_xref="InterPro:IPR003512" FT /db_xref="InterPro:IPR012340" FT /db_xref="PDB:2GN5" FT /db_xref="UniProtKB/Swiss-Prot:P69542" FT /protein_id="AAA32305.1" FT /translation="MIKVEIKPSQAQFTTRSGVSRQGKPYSLNEQLCYVDLGNEYPVLV FT KITLDEGQPAYAPGLYTVHLSSFKVGQFGSLMIDRLRLVPAK" FT CDS 1108..1209 FT /codon_start=1 FT /transl_table=11 FT /product="VII" FT /function="morphogenesis" FT /db_xref="GOA:P69533" FT /db_xref="InterPro:IPR031377" FT /db_xref="UniProtKB/Swiss-Prot:P69533" FT /protein_id="AAA32306.1" FT /translation="MEQVADFDTIYQAMIQISVVLCFALGIIAGGQR" FT CDS 1206..1304 FT /codon_start=1 FT /transl_table=11 FT /product="IX" FT /note="minor coat protein" FT /db_xref="GOA:P69536" FT /db_xref="UniProtKB/Swiss-Prot:P69536" FT /protein_id="AAA32307.1" FT /translation="MSVLVYSFASFVLGWCLRSGITYFTRLMETSS" FT CDS 1301..1522 FT /codon_start=1 FT /transl_table=11 FT /product="VIII" FT /note="major coat protein" FT /db_xref="GOA:P69539" FT /db_xref="InterPro:IPR008020" FT /db_xref="InterPro:IPR023390" FT /db_xref="PDB:1FDM" FT /db_xref="PDB:1IFD" FT /db_xref="PDB:1IFI" FT /db_xref="PDB:1IFJ" FT /db_xref="PDB:1MZT" FT /db_xref="PDB:2C0W" FT /db_xref="PDB:2C0X" FT /db_xref="PDB:2HI5" FT /db_xref="UniProtKB/Swiss-Prot:P69539" FT /protein_id="AAA32308.1" FT /translation="MKKSLVLKASVAVATLVPMLSFAAEGDDPAKAAFDSLQASATEYI FT GYAWAMVVVIVGATIGIKLFKKFTSKAS" FT CDS 1579..2853 FT /codon_start=1 FT /transl_table=11 FT /product="III" FT /note="adsorption protein" FT /db_xref="GOA:P03661" FT /db_xref="InterPro:IPR008021" FT /db_xref="InterPro:IPR013834" FT /db_xref="InterPro:IPR036200" FT /db_xref="PDB:1FGP" FT /db_xref="PDB:2G3P" FT /db_xref="PDB:3DGS" FT /db_xref="PDB:3KNQ" FT /db_xref="UniProtKB/Swiss-Prot:P03661" FT /protein_id="AAA32309.1" FT /translation="MKKLLFAIPLVVPFYSHSAETVESCLAKPHTENSFTNVWKDDKTL FT DRYANYEGCLWNATGVVVCTGDETQCYGTWVPIGLAIPENEGGGSEGGGSEGGGSEGGG FT TKPPEYGDTPIPGYTYINPLDGTYPPGTEQNPANPNPSLEESQPLNTFMFQNNRFRNRQ FT GALTVYTGTVTQGTDPVKTYYQYTPVSSKAMYDAYWNGKFRDCAFHSGFNEDPFVCEYQ FT GQSSDLPQPPVNAGGGSGGGSGGGSEGGGSEGGGSEGGGSEGGGSGGGSGSGDFDYEKM FT ANANKGAMTENADENALQSDAKGKLDSVATDYGAAIDGFIGDVSGLANGNGATGDFAGS FT NSQMAQVGDGDNSPLMNNFRQYLPSLPQSVECRPYVFGAGKPYEFSIDCDKINLFRGVF FT AFLLYVATFMYVFSTFANILRNKES" FT CDS 2856..3194 FT /codon_start=1 FT /transl_table=11 FT /product="VI" FT /function="morphogenesis" FT /db_xref="GOA:P69530" FT /db_xref="InterPro:IPR035210" FT /db_xref="UniProtKB/Swiss-Prot:P69530" FT /protein_id="AAA32310.1" FT /translation="MPVLLGIPLLLRFLGFLLVTLFGYLLTFLKKGFGKIAIAISLFLA FT LIIGLNSILVGYLSDISAQLPSDFVQGVQLILPSNALPCFYVILSVKAAIFIFDVKQKI FT VSYLDWDK" FT CDS 3197..4243 FT /codon_start=1 FT /transl_table=11 FT /product="I" FT /function="morphogenesis" FT /db_xref="GOA:P03655" FT /db_xref="InterPro:IPR008900" FT /db_xref="UniProtKB/Swiss-Prot:P03655" FT /protein_id="AAA32311.1" FT /translation="MAVYFVTGKLGSGKTLVSVGKIQDKIVAGCKIATNLDLRLQNLPQ FT VGRFAKTPRVLRIPDKPSISDLLAIGRGNDSYDENKNGLLVLDECGTWFNTRSWNDKER FT QPIIDWFLHARKLGWDIIFLVQDLSIVDKQARSALAEHVVYCRRLDRITLPFVGTLYSL FT VTGSKMPLPKLHVGVVKYGDSQLSPTVERWLYTGKNLYNAYDTKQAFSSNYDSGVYSYL FT TPYLSHGRYFKPLNLGQKMKLTKIYLKKFSRVLCLAIGFASAFTYSYITQPKPEVKKVV FT SQTYDFDKFTIDSSQRLNLSYRYVFKDSKGKLINSDDLQKQGYSITYIDLCTVSIKKGN FT SNEIVKCN" FT misc_feature 3817..4081 FT /note="internal signal sequence for gene I product [6]" FT CDS 4221..5501 FT /codon_start=1 FT /transl_table=11 FT /product="IV" FT /function="morphogenesis" FT /db_xref="GOA:P03664" FT /db_xref="InterPro:IPR001775" FT /db_xref="InterPro:IPR004845" FT /db_xref="InterPro:IPR004846" FT /db_xref="InterPro:IPR038591" FT /db_xref="UniProtKB/Swiss-Prot:P03664" FT /protein_id="AAA32312.1" FT /translation="MKLLNVINFVFLMFVSSSSFAQVIEMNNSPLRDFVTWYSKQTGES FT VIVSPDVKGTVTVYSSDVKPENLRNFFISVLRANNFDMVGSIPSIIQKYNPNSQDYIDE FT LPSSDIQEYDDNSAPSGGFFVPQNDNVTQTFKINNVRAKDLIRVVELFVKSNTSKSSNV FT LSVDGSNLLVVSAPKDILDNLPQFLSTVDLPTDQILIEGLIFEVQQGDALDFSFAAGSQ FT RGTVAGGVNTDRLTSVLSSAGGSFGIFNGDVLGLSVRALKTNSHSKILSVPRILTLSGQ FT KGSISVGQNVPFITGRVTGESANVNNPFQTVERQNVGISMSVFPVAMAGGNIVLDITSK FT ADSLSSSTQASDVITNQRSIATTVNLRDGQTLLLGGLTDYKNTSQDSGVPFLSKIPLIG FT LLFSSRSDSNEESTLYVLVKATIVRAL" FT rep_origin 5782 FT /note="origin of viral strand synthesis" XX SQ Sequence 6408 BP; 1578 A; 1295 C; 1325 G; 2210 T; 0 other; aacgctacta ccattagtag aattgatgcc accttttcag ctcgcgcccc aaatgaaaat 60 atagctaaac aggttattga ccatttgcga aatgtatcta atggtcaaac taaatctact 120 cgttcgcaga attgggaatc aactgttaca tggaatgaaa cttccagaca ccgtacttta 180 gttgcatatt taaaacatgt tgaactacag caccagattc agcaattaag ctctaagcca 240 tccgcaaaaa tgacctctta tcaaaaggag caattaaagg tactgtctaa tcctgacctg 300 ttggaatttg cttccggtct ggttcgcttt gaggctcgaa ttgaaacgcg atatttgaag 360 tctttcgggc ttcctcttaa tctttttgat gcaattcgct ttgcttctga ctataataga 420 cagggtaaag acctgatttt tgatttatgg tcattctcgt tttctgaact gtttaaagca 480 tttgaggggg attcaatgaa tatttatgac gattccgcag tattggacgc tatccagtct 540 aaacatttta caattacccc ctctggcaaa acttcctttg caaaagcctc tcgctatttt 600 ggtttctatc gtcgtctggt taatgagggt tatgatagtg ttgctcttac catgcctcgt 660 aattcctttt ggcgttatgt atctgcatta gttgagtgtg gtattcctaa atctcaattg 720 atgaatcttt ccacctgtaa taatgttgtt ccgttagttc gttttattaa cgtagatttt 780 tcctcccaac gtcctgactg gtataatgag ccagttctta aaatcgcata aggtaattca 840 aaatgattaa agttgaaatt aaaccgtctc aagcgcaatt tactacccgt tctggtgttt 900 ctcgtcaggg caagccttat tcactgaatg agcagctttg ttacgttgat ttgggtaatg 960 aatatccggt gcttgtcaag attactctcg acgaaggtca gccagcgtat gcgcctggtc 1020 tgtacaccgt gcatctgtcc tcgttcaaag ttggtcagtt cggttctctt atgattgacc 1080 gtctgcgcct cgttccggct aagtaacatg gagcaggtcg cggatttcga cacaatttat 1140 caggcgatga tacaaatctc cgttgtactt tgtttcgcgc ttggtataat cgctgggggt 1200 caaagatgag tgttttagtg tattctttcg cctctttcgt tttaggttgg tgccttcgta 1260 gtggcattac gtattttacc cgtttaatgg aaacttcctc atgaaaaagt ctttagtcct 1320 caaagcctcc gtagccgttg ctaccctcgt tccgatgctg tctttcgctg ctgagggtga 1380 cgatcccgca aaagcggcct ttgactccct gcaagcctca gcgaccgaat atatcggtta 1440 tgcgtgggcg atggttgttg tcattgtcgg cgcaactatc ggtatcaagc tgtttaagaa 1500 attcacctcg aaagcaagct gataaaccga tacaattaaa ggctcctttt ggagcctttt 1560 tttttggaga ttttcaacgt gaaaaaatta ttattcgcaa ttcctttagt tgttcctttc 1620 tattctcact ccgctgaaac tgttgaaagt tgtttagcaa aacctcatac agaaaattca 1680 tttactaacg tctggaaaga cgacaaaact ttagatcgtt acgctaacta tgagggctgt 1740 ctgtggaatg ctacaggcgt tgtggtttgt actggtgacg aaactcagtg ttacggtaca 1800 tgggttccta ttgggcttgc tatccctgaa aatgagggtg gtggctctga gggtggcggt 1860 tctgagggtg gcggttctga gggtggcggt actaaacctc ctgagtacgg tgatacacct 1920 attccgggct atacttatat caaccctctc gacggcactt atccgcctgg tactgagcaa 1980 aaccccgcta atcctaatcc ttctcttgag gagtctcagc ctcttaatac tttcatgttt 2040 cagaataata ggttccgaaa taggcagggt gcattaactg tttatacggg cactgttact 2100 caaggcactg accccgttaa aacttattac cagtacactc ctgtatcatc aaaagccatg 2160 tatgacgctt actggaacgg taaattcaga gactgcgctt tccattctgg ctttaatgag 2220 gatccattcg tttgtgaata tcaaggccaa tcgtctgacc tgcctcaacc tcctgtcaat 2280 gctggcggcg gctctggtgg tggttctggt ggcggctctg agggtggcgg ctctgagggt 2340 ggcggttctg agggtggcgg ctctgagggt ggcggttccg gtggcggctc cggttccggt 2400 gattttgatt atgaaaaaat ggcaaacgct aataaggggg ctatgaccga aaatgccgat 2460 gaaaacgcgc tacagtctga cgctaaaggc aaacttgatt ctgtcgctac tgattacggt 2520 gctgctatcg atggtttcat tggtgacgtt tccggccttg ctaatggtaa tggtgctact 2580 ggtgattttg ctggctctaa ttcccaaatg gctcaagtcg gtgacggtga taattcacct 2640 ttaatgaata atttccgtca atatttacct tctttgcctc agtcggttga atgtcgccct 2700 tatgtctttg gcgctggtaa accatatgaa ttttctattg attgtgacaa aataaactta 2760 ttccgtggtg tctttgcgtt tcttttatat gttgccacct ttatgtatgt attttcgacg 2820 tttgctaaca tactgcgtaa taaggagtct taatcatgcc agttcttttg ggtattccgt 2880 tattattgcg tttcctcggt ttccttctgg taactttgtt cggctatctg cttactttcc 2940 ttaaaaaggg cttcggtaag atagctattg ctatttcatt gtttcttgct cttattattg 3000 ggcttaactc aattcttgtg ggttatctct ctgatattag cgcacaatta ccctctgatt 3060 ttgttcaggg cgttcagtta attctcccgt ctaatgcgct tccctgtttt tatgttattc 3120 tctctgtaaa ggctgctatt ttcatttttg acgttaaaca aaaaatcgtt tcttatttgg 3180 attgggataa ataaatatgg ctgtttattt tgtaactggc aaattaggct ctggaaagac 3240 gctcgttagc gttggtaaga ttcaggataa aattgtagct gggtgcaaaa tagcaactaa 3300 tcttgattta aggcttcaaa acctcccgca agtcgggagg ttcgctaaaa cgcctcgcgt 3360 tcttagaata ccggataagc cttctatttc tgatttgctt gctattggtc gtggtaatga 3420 ttcctacgac gaaaataaaa acggtttgct tgttcttgat gaatgcggta cttggtttaa 3480 tacccgttca tggaatgaca aggaaagaca gccgattatt gattggtttc ttcatgctcg 3540 taaattggga tgggatatta tttttcttgt tcaggattta tctattgttg ataaacaggc 3600 gcgttctgca ttagctgaac acgttgttta ttgtcgccgt ctggacagaa ttactttacc 3660 ctttgtcggc actttatatt ctcttgttac tggctcaaaa atgcctctgc ctaaattaca 3720 tgttggtgtt gttaaatatg gtgattctca attaagccct actgttgagc gttggcttta 3780 tactggtaag aatttatata acgcatatga cactaaacag gctttttcca gtaattatga 3840 ttcaggtgtt tattcatatt taacccctta tttatcacac ggtcggtatt tcaaaccatt 3900 aaatttaggt cagaagatga aattaactaa aatatatttg aaaaagtttt ctcgcgttct 3960 ttgtcttgcg ataggatttg catcagcatt tacatatagt tatataaccc aacctaagcc 4020 ggaggttaaa aaggtagtct ctcagaccta tgattttgat aaattcacta ttgactcttc 4080 tcagcgtctt aatctaagct atcgctatgt tttcaaggat tctaagggaa aattaattaa 4140 tagcgacgat ttacagaagc aaggttattc catcacatat attgatttat gtactgtttc 4200 aattaaaaaa ggtaattcaa atgaaattgt taaatgtaat taattttgtt ttcttgatgt 4260 ttgtttcatc atcttctttt gctcaagtaa ttgaaatgaa taattcgcct ctgcgcgatt 4320 tcgtgacttg gtattcaaag caaacaggtg aatctgttat tgtctcacct gatgttaaag 4380 gtacagtgac tgtatattcc tctgacgtta agcctgaaaa tttacgcaat ttctttatct 4440 ctgttttacg tgctaataat tttgatatgg ttggctcaat tccttccata attcagaaat 4500 ataacccaaa tagtcaggat tatattgatg aattgccatc atctgatatt caggaatatg 4560 atgataattc cgctccttct ggtggtttct ttgttccgca aaatgataat gttactcaaa 4620 catttaaaat taataacgtt cgcgcaaagg atttaataag ggttgtagaa ttgtttgtta 4680 aatctaatac atctaaatcc tcaaatgtat tatctgttga tggttctaac ttattagtag 4740 ttagcgcccc taaagatatt ttagataacc ttccgcaatt tctttctact gttgatttgc 4800 caactgacca gatattgatt gaaggattaa ttttcgaggt tcagcaaggt gatgctttag 4860 atttttcctt tgctgctggc tctcagcgcg gcactgttgc tggtggtgtt aatactgacc 4920 gtctaacctc tgttttatct tctgcgggtg gttcgttcgg tatttttaac ggcgatgttt 4980 tagggctatc agttcgcgca ttaaagacta atagccattc aaaaatattg tctgtgcctc 5040 gtattcttac gctttcaggt cagaagggtt ctatttctgt tggccagaat gtccctttta 5100 ttactggtcg tgtaactggt gaatctgcca atgtaaataa tccatttcag acggttgagc 5160 gtcaaaatgt tggtatttct atgagtgttt ttcccgttgc aatggctggc ggtaatattg 5220 ttttagatat aaccagtaag gccgatagtt tgagttcttc tactcaggca agtgatgtta 5280 ttactaatca aagaagtatt gcgacaacgg ttaatttgcg tgatggtcag actcttttgc 5340 tcggtggcct cactgattac aaaaacactt ctcaagattc tggtgtgccg ttcctgtcta 5400 aaatcccttt aatcggcctc ctgtttagct cccgttctga ttctaacgag gaaagcacgt 5460 tgtacgtgct cgtcaaagca accatagtac gcgccctgta gcggcgcatt aagcgcggcg 5520 ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca gcgccctagc gcccgctcct 5580 ttcgctttct tcccttcctt tctcgccacg ttctccggct ttccccgtca agctctaaat 5640 cgggggatcc ctttagggtt ccgatttagt gctttacggc acctcgacct ccaaaaactt 5700 gatttgggtg atggttcacg tagtgggcca tcgccctgat agacggtttt tcgccctttg 5760 acgttggagt ccacgttctt taatagtgga ctcttgttcc aaactggaac aacactcaca 5820 actaactcgg cctattcttt tgatttataa ggatttttgt cattttctgc ttactggtta 5880 aaaaataagc tgatttaaca aatatttaac gcgaaattta acaaaacatt aacgtttaca 5940 atttaaatat ttgcttatac aatcatcctg tttttggggc ttttctgatt atcaaccggg 6000 gtacatatga ttgacatgct agttttacga ttaccgttca tcgattctct tgtttgctcc 6060 agactttcag gtaatgacct gatagccttt gtagacctct caaaaatagc taccctctcc 6120 ggcatgaatt tatcagctag aacggttgaa tatcatattg acggtgattt gactgtctcc 6180 ggcctttctc acccgtttga atctttgcct actcattact ccggcattgc atttaaaata 6240 tatgagggtt ctaaaaattt ttatccctgc gttgaaatta aggcttcacc agcaaaagta 6300 ttacagggtc ataatgtttt tggtacaacc gatttagctt tatgctctga ggctttattg 6360 cttaattttg ctaactctct gccttgcttg tacgatttat tggatgtt 6408 //