Dbfetch
LOCUS XM_012201057 1836 bp mRNA linear INV 02-APR-2015 DEFINITION PREDICTED: Atta cephalotes transmembrane protease serine 7-like (LOC105619539), mRNA. ACCESSION XM_012201057 VERSION XM_012201057.1 DBLINK BioProject: PRJNA279976 KEYWORDS RefSeq; corrected model. SOURCE Atta cephalotes ORGANISM Atta cephalotes Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea; Formicidae; Myrmicinae; Atta. COMMENT MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NW_012130086.1) annotated using gene prediction method: Gnomon. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Status :: Full annotation Annotation Version :: Atta cephalotes Annotation Release 100 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 6.2 Annotation Method :: Best-placed RefSeq; Gnomon Features Annotated :: Gene; mRNA; CDS; ncRNA ##Genome-Annotation-Data-END## ##RefSeq-Attributes-START## frameshifts :: corrected 1 indel ##RefSeq-Attributes-END## PRIMARY REFSEQ_SPAN PRIMARY_IDENTIFIER PRIMARY_SPAN COMP 1-40 ADTU01014841.1 8252-8291 41-257 ADTU01014841.1 8424-8640 258-377 ADTU01014841.1 8708-8827 378-706 ADTU01014842.1 884-1212 707-975 ADTU01014842.1 1310-1578 976-1029 ADTU01014842.1 2127-2180 1030-1030 "N" 1-1 1031-1429 ADTU01014842.1 2181-2579 1430-1731 ADTU01014842.1 2874-3175 1732-1836 ADTU01014842.1 3432-3536 FEATURES Location/Qualifiers source 1..1836 /organism="Atta cephalotes" /mol_type="mRNA" /db_xref="taxon:12957" /chromosome="Unknown" /sex="male" gene 1..1836 /gene="LOC105619539" /note="The sequence of the model RefSeq transcript was modified relative to its source genomic sequence to represent the inferred CDS: inserted 1 base in 1 codon; Derived by automated computational analysis using gene prediction method: Gnomon. Supporting evidence includes similarity to: 25 Proteins" /db_xref="GeneID:105619539" CDS 1..1836 /gene="LOC105619539" /note="The sequence of the model RefSeq protein was modified relative to its source genomic sequence to represent the inferred CDS: inserted 1 base in 1 codon" /codon_start=1 /product="LOW QUALITY PROTEIN: transmembrane protease serine 7-like" /protein_id="XP_012056447.1" /db_xref="GeneID:105619539" /translation="MFYCLSRVHTFRRGYKDCSDGSDENATFCREYTCPEYTFRCSYG GCVHQEVVCDGVKDCVDGTDEDSSTCAAINCEDHECSQYECRDQEFTCENRRQCLPMT KVCDGIRHCADASDESAKLCKAHECPENWFRCAYGGCIRPELKCNSQLNCHDWSDEDE SLCGITLPEGACRLPAAKAGTHYNVSGCSRCRPGEVVSELTRLDYVCDVEGFLQGASE IYCQNNRWFPNVPSCTFKNETKGITCPAPFEAHNAIRQCEAMWGPHKGWLSCDQALPV STRVFLDCPAFYERHKGSSSVMCLHDGTWSQSLLSCTPVCGMRDSTGLTKLILIFLAA ALIVKGWSVXKEETLPWQATLFSHEDGQWSFFCGGTLIAERVVLTAGHCVWKTAADTI RVAFGILSSDLNQIGENAQVIDVESIKLQNAYQDYESNYGSDIALLILKKVVTINMVV GPVCIPWNSDSILLEYQRNSGLGLVAGMGLTENNTFSSVLRVTTVKIISDDECRKNQN RDFRKYLTYTSFCAGWANGTGVCNGDSGGGLVLQRPNSSVWEIHGVVSVSPRKLGTNV CDPNFYAVFTKVSMYVNWIHKIIESMPTIGLPDFDQHPNRDNIVV" ORIGIN 1 atgttctact gtttatcaag agtgcatacc tttcgccgag gctacaaaga ttgttcagat 61 gggagcgatg agaatgcgac attctgtcgc gaatatacat gtcctgagta tacattccgt 121 tgttcgtacg gaggttgcgt tcatcaggaa gtggtttgcg acggtgtgaa ggattgcgtc 181 gacggcaccg atgaagactc gagtacatgc gcggctatca attgtgagga tcacgaatgt 241 tcgcaatacg aatgccgaga tcaagaattc acatgcgaga atagacggca atgtttaccg 301 atgactaaag tatgcgatgg tattcgacac tgtgcagatg cgtctgatga aagtgcaaaa 361 ttgtgcaaag cacatgaatg tcctgaaaat tggtttcgtt gtgcttatgg cggatgcatt 421 cggccggagt tgaagtgtaa ttctcaactt aattgtcacg actggagcga cgaggatgag 481 tcgttatgtg ggataacgtt gccagaaggt gcctgtcgat tgcccgccgc aaaagcgggg 541 acccattaca acgtaagcgg atgctccaga tgtcgaccgg gagaagtcgt ttccgagctc 601 actcgccttg actatgtctg tgacgtcgaa ggcttcttgc aaggtgccag tgaaatttac 661 tgtcaaaata accgatggtt tcccaatgtc ccgagttgta ccttcaagaa tgagacgaaa 721 ggaataacgt gtccagcacc gtttgaagcg cataatgcga ttagacaatg cgaggcgatg 781 tggggtcctc acaaagggtg gttgtcttgt gaccaggccc tcccagtcag cacaagagtt 841 tttcttgatt gtccagcatt ttatgagcgt cataaaggat cttccagtgt catgtgcctt 901 cacgacggca cgtggagtca atcacttttg agttgtactc ctgtttgtgg gatgagagac 961 tccacaggtt tgacaaaatt aatattaata ttcttagcgg cagcactaat tgtaaagggt 1021 tggagcgttn gaaaagaaga gacactaccg tggcaagcta ctcttttttc gcatgaggat 1081 ggtcagtgga gtttcttttg cggcggcaca ttgatagcag aacgcgttgt gttaactgca 1141 ggtcattgcg tttggaagac tgctgcagat acgatacgtg ttgctttcgg gattctctcg 1201 agtgacttaa atcaaatagg ggaaaacgcg caagtgatcg atgtggaaag catcaaattg 1261 cagaatgcgt atcaagatta tgaaagtaat tatggctcgg atatcgcgtt gctgattttg 1321 aaaaaagttg tgactattaa tatggttgtt ggaccagttt gcattccttg gaattctgat 1381 tcgatattac tggaatatca acgaaatagt gggcttggac tagttgccgg catggggtta 1441 acggagaaca acacatttag ttcagtcttg agggtaacta cagtaaaaat catttccgac 1501 gatgagtgtc gtaaaaacca aaatagagat ttccgaaaat atttgactta cacttccttt 1561 tgcgccggat gggcgaatgg tactggagta tgcaatggag atagcggtgg aggtttggtg 1621 cttcagcgac cgaattccag tgtctgggaa attcacggag ttgtttctgt gagcccacga 1681 aaactgggta ccaacgtatg tgatccgaat ttctatgctg ttttcacaaa ggtttctatg 1741 tacgtcaatt ggattcataa aattattgag agcatgccta caatcggctt acctgatttt 1801 gatcaacatc caaatagaga taacatcgtt gtttaa //