Dbfetch
LOCUS XM_012208032 1524 bp mRNA linear INV 02-APR-2015 DEFINITION PREDICTED: Atta cephalotes proline-rich protein 4-like (LOC105626738), mRNA. ACCESSION XM_012208032 VERSION XM_012208032.1 DBLINK BioProject: PRJNA279976 KEYWORDS RefSeq. SOURCE Atta cephalotes ORGANISM Atta cephalotes Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea; Formicidae; Myrmicinae; Atta. COMMENT MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NW_012130070.1) annotated using gene prediction method: Gnomon. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Status :: Full annotation Annotation Version :: Atta cephalotes Annotation Release 100 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 6.2 Annotation Method :: Best-placed RefSeq; Gnomon Features Annotated :: Gene; mRNA; CDS; ncRNA ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..1524 /organism="Atta cephalotes" /mol_type="mRNA" /db_xref="taxon:12957" /chromosome="Unknown" /sex="male" gene 1..1524 /gene="LOC105626738" /note="Derived by automated computational analysis using gene prediction method: Gnomon. Supporting evidence includes similarity to: 383 long SRA reads, 40 Proteins" /db_xref="GeneID:105626738" CDS 1..1128 /gene="LOC105626738" /codon_start=1 /product="proline-rich protein 4-like" /protein_id="XP_012063422.1" /db_xref="GeneID:105626738" /translation="MRILVYVAVAALVVTTVLSEETAKIEKSVKSEKKQEKRGLLGLG YGYGYDGGYDIGSGGHIGLDVGGGYGGYGGSFGGGYIDGGHHEHIKTVTLVKNVPVPY PVEKHVPYPVEKPVPVPVKVAVPQPYPVEKHVPYPVKVLVKVPVHVPQPYPVEKKVPY PVHVPVDRPVPVKVFVPQPYPVEKHIHVPVKVAVPHPYPVEKPVPVPVKIPVHVPQPY PVEKLVPYPVKIPVDRPVPVPKPVPYPVHIEKHVPYPVEKPVPYPVKVPVDRPYPVHI EKHVPVHIKVPVPAPYPVEKPIPYAVEKPVPVAVKIPVDRPYPVPVEKPVPYAVEKPV PYPVKVPVAVPVHHEHHGHGGYYAGGYESGYGGGDLGYGHH" ORIGIN 1 atgagaatcc tggtctacgt ggcagttgct gccctcgtgg ttacgacagt actgtcggag 61 gagaccgcga aaatcgagaa atcggtgaaa tccgagaaga aacaagagaa gcgtggtctt 121 ctaggtcttg gttatggtta cggttatgat ggcggatacg acatcggatc tggtggacac 181 ataggccttg atgtaggcgg tggatatggt ggatacggag gaagctttgg cggaggatat 241 attgacggtg gtcatcatga gcacataaag actgtgaccc ttgtgaagaa tgtccccgtg 301 ccttatcctg tggagaagca tgtgccttat ccggttgaga agcccgtacc ggtccccgtc 361 aaagtggccg taccacaacc gtatccagtt gagaagcacg tgccttatcc agtcaaggtc 421 ctcgtgaagg tgcccgtaca tgtgccccag ccgtacccag tggagaagaa ggttccctac 481 ccggtgcacg taccagtcga ccgtcctgtc ccggtcaagg tcttcgtacc acagccttat 541 ccagttgaaa agcatattca tgttccggta aaagtagcag tgccgcatcc gtaccctgtc 601 gagaagccag ttccggtacc ggtgaagatt ccagttcacg taccccaacc ctacccagtt 661 gagaagctcg taccctatcc cgtcaaaatt ccagttgacc gtcccgtgcc tgtacccaag 721 cctgtaccct atcccgtaca cattgagaag cacgtgcctt acccagtcga gaagccagtt 781 ccctaccccg tgaaggttcc agttgacagg ccgtaccccg tgcatatcga gaagcacgtg 841 cccgtccaca ttaaagtacc ggtaccggcg ccgtatccag tagaaaagcc aataccttac 901 gcggtcgaaa aacctgtgcc agtagcagtc aagattccag tcgataggcc gtaccccgtt 961 cctgtcgaga agcctgtgcc ttacgccgtg gagaagcctg taccttaccc agttaaagta 1021 ccggtggctg tgcccgttca tcacgagcac catggacatg gcggctatta cgctggtgga 1081 tatgagagtg gatacggagg tggtgattta ggttacggtc atcactaata tcgttatctc 1141 ccggctggtt ttccattatc gtcgtgcaaa agatttcatt gcgacgatga cggatagccg 1201 tgaggattgt cttaggttac gaaggttacc gatccgaaac gttgccgtgc agtgcagcga 1261 tatctgccac tttgtacctg gatcggaatc ggggacgatc tctggtagac gatcgcgtac 1321 cggaccatcg taccttcgaa tccgatcgca acgaggacgc ggcacggcga cgtgtgccag 1381 ttaggaagaa cgaatgacca gtagggatgt actcgtcttt agcgatcttc gctcgagagc 1441 aacttgcttg tcttactttt gtaccgatat tagcaaagta tatgtatttt tttacggaaa 1501 cgaaataaat attaatgtaa agta //