Dbfetch

LOCUS       XM_012202666            1914 bp    mRNA    linear   INV 02-APR-2015
DEFINITION  PREDICTED: Atta cephalotes polypeptide
            N-acetylgalactosaminyltransferase 35A (LOC105621197), mRNA.
ACCESSION   XM_012202666
VERSION     XM_012202666.1
DBLINK      BioProject: PRJNA279976
KEYWORDS    RefSeq.
SOURCE      Atta cephalotes
  ORGANISM  Atta cephalotes
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Hymenoptera; Apocrita;
            Aculeata; Formicoidea; Formicidae; Myrmicinae; Atta.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NW_012130096.1) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Version          :: Atta cephalotes Annotation Release
                                           100
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 6.2
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..1914
                     /organism="Atta cephalotes"
                     /mol_type="mRNA"
                     /db_xref="taxon:12957"
                     /chromosome="Unknown"
                     /sex="male"
     gene            1..1914
                     /gene="LOC105621197"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Gnomon. Supporting evidence
                     includes similarity to: 15 Proteins"
                     /db_xref="GeneID:105621197"
     CDS             1..1914
                     /gene="LOC105621197"
                     /codon_start=1
                     /product="polypeptide N-acetylgalactosaminyltransferase
                     35A"
                     /protein_id="XP_012058056.1"
                     /db_xref="GeneID:105621197"
                     /translation="MISTRYVSFLSGIIIASLTWAFSLYLYSRLSQNAITTSPTILIP
                     IVSKHGESIDKDIMLRDNVIIARNEKQILAGKEAYNLKSNKNFRKNDLILQQLQAVPV
                     KPAVTLEQGLDELGMVKNLEDQHKRDEGYKEYAFNILISDNLGVQRNIPDTRHKLCKM
                     QKYPVNLPNASIIICFYNEHYTTLLRSLHSVLEKTPTALLHEIILVNDYSDSDMLHEK
                     IEVYIRNNFDDRIRLFKTERREGLIRARVFGARKATGKVLIFLDSHIEVNEMWIEPLL
                     SRIAYSRNIVPMPVIDIINADTFQYTGSPLVRGGFNWGLHFKWDNLPIGTLNHDVDFV
                     KPIKSPTMAGGLFAIDREYFTKMGEYDIGMDIWGGENLEISFRIWMCGGSIELIPCSR
                     VGHVFRRRRPYGSDDPQDTMLKNSLRVAHVWMDEYKDYFLKNAKTIDYGDISERLALR
                     QKLKCKTFGWYLKVVYPELTLPDDTERRLKDKWAKLDQRPMQPWHSRKRNYTDQYQIR
                     LSNTALCIQSEKDIKTKGSKLILMPCLRIKSQMWYETDKNELVLGQMLCMEGAEKIPK
                     LGKCHEMGGSQDWRHKRINATPIYNMATGTCLGVMRDVRNTPLIMDLCTRSNASLITW
                     DLIRSKILAKEIR"
ORIGIN      
        1 atgatatcaa caagatatgt atcattccta tcagggataa taatagcttc tctaacatgg
       61 gcattcagcc tgtatcttta ttcaagactg tcacaaaatg ccatcaccac cagtcccaca
      121 atattaattc ctatcgtttc caaacatgga gaaagtattg acaaagatat tatgctgcgt
      181 gacaatgtta tcattgcacg caatgagaaa caaatattag ctggtaagga agcctataac
      241 ttaaagagca ataagaactt caggaaaaat gacctgatct tgcagcaact tcaggctgtt
      301 ccagtaaaac ctgcagtcac tttggagcaa ggtttagatg aattaggcat ggtgaaaaat
      361 ttggaggatc aacacaagag ggacgagggt tataaggagt atgcctttaa tattttgata
      421 tcagacaatc ttggcgtgca aagaaatata ccagatacga gacacaagct gtgcaagatg
      481 caaaaatatc ctgtcaattt gccaaatgct agtataatta tttgttttta caatgagcat
      541 tatacaaccc tattgagatc tttgcattct gtgcttgaaa aaacgccaac agcacttcta
      601 catgagatta tcttggtgaa tgattatagc gacagcgata tgttgcatga gaagatagaa
      661 gtatatatta gaaacaattt cgatgacaga atacgattat tcaagactga aagaagagaa
      721 ggtttgatca gagcacgggt gtttggagct agaaaagcaa ctggcaaagt tttgatcttt
      781 ttggatagtc acatagaagt gaatgaaatg tggattgagc cacttctttc aaggattgct
      841 tattcgagaa atatcgtacc gatgccagtg attgatatta taaatgcaga tacatttcag
      901 tataccggca gtccattagt gagaggtggt tttaattggg gtcttcactt taagtgggat
      961 aatttaccaa ttggaacatt aaatcatgac gttgattttg taaaacccat caaatcccca
     1021 actatggctg gtggattgtt tgctattgat agagaatatt ttacaaagat gggagaatat
     1081 gacatcggta tggacatttg gggtggagag aatcttgaaa tatcatttcg aatatggatg
     1141 tgtggtggta gtatcgagct tattccctgt tcacgagtgg gtcacgtatt tagaagacga
     1201 cggccgtatg gttcggatga tccacaagac accatgttaa aaaattcctt gcgagttgcg
     1261 catgtatgga tggacgaata taaagattac tttctgaaaa atgctaaaac aattgattac
     1321 ggcgatatct cagagcgatt agcgttgaga cagaaattaa aatgtaaaac attcggttgg
     1381 tatttaaagg tagtttatcc tgaattaaca ttgccagatg atacagaaag gaggctcaaa
     1441 gataaatggg ccaaacttga tcaaagacca atgcaacctt ggcattcgag gaagagaaat
     1501 tataccgatc aatatcaaat cagactttca aataccgcgc tttgtatcca gagtgaaaag
     1561 gatattaaaa ctaaaggttc caaacttatc ttgatgccgt gtttaagaat taaatcacag
     1621 atgtggtatg agaccgataa aaatgaactg gtacttggcc agatgctttg tatggaagga
     1681 gctgagaaaa ttccaaaact tggaaagtgt catgaaatgg gaggaagtca agattggcgt
     1741 cataagcgta ttaatgctac acctatctac aatatggcaa caggaacttg cttaggagtg
     1801 atgcgcgatg taagaaatac tccacttatc atggatttat gcacgagatc gaatgcttca
     1861 ttgataacat gggatctcat tcggtcaaag attttagcca aagaaatcag atga
//