Dbfetch
LOCUS XM_012204940 1524 bp mRNA linear INV 02-APR-2015
DEFINITION PREDICTED: Atta cephalotes cytochrome P450 4C1-like (LOC105623545),
mRNA.
ACCESSION XM_012204940
VERSION XM_012204940.1
DBLINK BioProject: PRJNA279976
KEYWORDS RefSeq; corrected model; includes ab initio.
SOURCE Atta cephalotes
ORGANISM Atta cephalotes
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita;
Aculeata; Formicoidea; Formicidae; Myrmicinae; Atta.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_012130066.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Version :: Atta cephalotes Annotation Release
100
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 6.2
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
##RefSeq-Attributes-START##
ab initio :: 3% of CDS bases
frameshifts :: corrected 2 indels
##RefSeq-Attributes-END##
PRIMARY REFSEQ_SPAN PRIMARY_IDENTIFIER PRIMARY_SPAN COMP
1-166 ADTU01002145.1 12535-12700
167-261 ADTU01002146.1 5-99
262-352 ADTU01002148.1 1732-1822
353-541 ADTU01002148.1 2187-2375
542-605 ADTU01002148.1 3141-3204
606-732 ADTU01002148.1 8073-8199
733-790 ADTU01002148.1 8268-8325
791-885 ADTU01002148.1 11674-11768
886-954 ADTU01002148.1 11835-11903
955-1132 ADTU01002150.1 215-392
1133-1309 ADTU01002150.1 4205-4381
1310-1368 ADTU01002150.1 4455-4513
1369-1392 ADTU01002150.1 7005-7028
1393-1437 ADTU01002150.1 7031-7075
1438-1519 ADTU01002150.1 7078-7159
1520-1524 ADTU01002150.1 7279-7283
FEATURES Location/Qualifiers
source 1..1524
/organism="Atta cephalotes"
/mol_type="mRNA"
/db_xref="taxon:12957"
/chromosome="Unknown"
/sex="male"
gene 1..1524
/gene="LOC105623545"
/note="The sequence of the model RefSeq transcript was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 4 bases in 2 codons;
Derived by automated computational analysis using gene
prediction method: Gnomon. Supporting evidence includes
similarity to: 232 long SRA reads, 29 Proteins"
/db_xref="GeneID:105623545"
CDS 1..1524
/gene="LOC105623545"
/note="The sequence of the model RefSeq protein was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 4 bases in 2 codons"
/codon_start=1
/product="LOW QUALITY PROTEIN: cytochrome P450 4C1-like"
/protein_id="XP_012060330.1"
/db_xref="GeneID:105623545"
/translation="MLFVIIALSLTCVAISFFVQNYRLFFAFLTMKPLLEPLSLPFIG
IFLFLYRYNSMVWTKFTETYSSPFRICLGPYLFIVVHELDQIKAVLKNRHSLDKSIVY
NCLKPVFDMGLLTASESTWTESRKIIAPAFGMMPMKEYFNIFVQESLILTEDLEKIAQ
SENEIECLDHLCRCTLKIACDTMMGVKMERNLIDEWWKISASQRQIWQYRFRNVLLVP
DIIFNFTSWRGKQQECLNSFHSIIEKIIQQRTNESSTMFTNNDTSHTRFYDILMHSFR
NGKFTQKEIMDNIFTMLGASSDTTATVVHFVIIMMANFPETQEKAYKELLEIYGTETP
KFAPVKYEDLQHMHYLDCIIKETLRLFPIVPMIGRKLTEDLKMGEFVLPKGADVIISL
MGMHRNEKYWPNPLMFNPDRFLQEKTNCVPYYYIPFSDGPRNCIGLKYAMMSMKVILA
TLIRTFCKKFRRNRQHRSYQRQGCVAHNHWQDGNLSIWRRPRISHYYCSDVNNMMTN"
ORIGIN
1 atgttgtttg ttattattgc attaagttta acatgtgttg caataagttt ttttgttcaa
61 aattatcgtt tgttttttgc ctttctaacg atgaaaccgt tacttgaacc actaagccta
121 ccatttatcg gcatttttct ttttttatac cgatataatt ccatggtttg gactaagttc
181 actgaaactt attcatctcc atttcgaata tgcctaggcc catatttatt catcgtagtg
241 catgaactag accaaataaa ggctgtttta aaaaaccgcc atagtttgga taaaagtatt
301 gtatataact gtctaaaacc agtatttgac atgggattac tgacagcttc tgaatctaca
361 tggactgaaa gtcgaaaaat aatagctcca gcatttggta tgatgccgat gaaagaatat
421 tttaacatat tcgttcaaga atctttaata cttactgaag atttagaaaa aattgcacaa
481 agtgaaaacg aaattgaatg tcttgaccat ctttgcagat gtaccttaaa aattgcatgt
541 gataccatga tgggtgtcaa gatggaaagg aatttaatag atgaatggtg gaaaataagt
601 gcaagtcaaa gacaaatttg gcaatataga tttcgaaatg tactcttggt tcctgatatt
661 atatttaatt tcacatcatg gagaggcaaa caacaagagt gtttaaattc ctttcattca
721 atcattgaaa agataataca gcaaaggaca aatgaatcaa gtacgatgtt cactaataat
781 gacacatctc atacacgatt ttatgacatt ttaatgcatt catttcgtaa cggaaaattc
841 acgcaaaagg aaattatgga caatatattt acaatgctag gagcgtcttc agacactacc
901 gccactgtgg tgcattttgt gattattatg atggcaaatt tcccagaaac acaagaaaaa
961 gcgtataaag aattattaga aatttatggc acggaaactc cgaagtttgc accagttaaa
1021 tacgaggatt tacaacatat gcattacttg gattgtatta ttaaggaaac tttaagactt
1081 ttccctattg tccctatgat tggacgaaaa ctgacagaag acttaaaaat gggagaattt
1141 gttttaccga aaggtgcaga tgttattata tcacttatgg gaatgcatcg gaatgaaaaa
1201 tattggccga atccattgat gtttaatccg gatagatttc ttcaagaaaa aacaaattgt
1261 gtaccatatt attacatacc ttttagtgac ggaccaagaa attgtatagg tttgaaatat
1321 gcaatgatgt ctatgaaagt tattttagca acgctgataa gaacgttctg taagaaattt
1381 agaagaaata ggcaacacag aagttatcag cgtcagggct gcgtggctca caatcactgg
1441 caagacggaa atctatcaat ttggaggagg cctcgaatct ctcattatta ttgctctgat
1501 gtcaacaata tgatgacaaa ttaa
//