ID   AL591175; SV 6; linear; genomic DNA; STD; VRT; 89232 BP.
AC   AL591175;
DT   04-MAY-2001 (Rel. 67, Created)
DT   13-JAN-2009 (Rel. 99, Last updated, Version 22)
DE   Zebrafish DNA sequence from clone BUSM1-37I06 in linkage group 8 Contains
DE   five genes for a novel protein similar to type I cytokeratin, enveloping
DE   layer, the gene for a novel zinc finger domain containing protein, the cyt1
DE   gene for type I cytokeratin, enveloping layer, the gene for a novel protein
DE   similar to vertebrate keratin family, the gene for a novel protein
DE   (zgc:85781) and four CpG islands.
KW   alpha chain; beta chain; cyt1; HTG; MHC class I; MHC class II;
KW   retrotransposon; SUSHI-ICHI; SUSHIIDR1.
OS   Danio rerio (zebrafish)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC   Cyprinidae; Danio.
RN   [1]
RP   1-89232
RA   Babbage A.;
RT   ;
RL   Submitted (12-JAN-2009) to the INSDC.
RL   Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, UK.
RL   E-mail enquiries: zfish-help@sanger.ac.uk Clone requests:
RL   http://www.sanger.ac.uk/Projects/D_rerio/faqs.shtml#dataeight
DR   MD5; 8b76180ed10a82d8c47988ef2422173a.
CC   -------------- Genome Center
CC   Center: Wellcome Trust Sanger Institute
CC   Center code: SC
CC   Web site: http://www.sanger.ac.uk
CC   Contact: zfish-help@sanger.ac.uk
CC   --------------
CC   This sequence was finished as follows unless otherwise noted: all regions
CC   were either double-stranded or sequenced with an alternate chemistry or
CC   covered by high quality data (i.e., phred quality >= 30); an attempt was
CC   made to resolve all sequencing problems, such as compressions and repeats;
CC   all regions were covered by at least one subclone; and the assembly was
CC   confirmed by restriction digest, except on the rare occasion of the clone
CC   being a YAC.
CC   The following abbreviations are used to associate primary accession
CC   numbers given in the feature table with their source databases:
CC   Information on the WORMPEP database can be found at
CC   http://www.sanger.ac.uk/Projects/C_elegans/wormpep
CC   Clone-derived Zebrafish pUC subclones occasionally display inconsistency
CC   over the length of mononucleotide A/T runs and conserved TA repeats.
CC   Where this is found the longest good quality representation will be
CC   submitted.
CC   Any regions longer than 1kb tagged as misc-feature "unsure" are part of
CC   a tandem repeat of more than 10kb in length where it has not been possible
CC   to anchor the base differences between repeat copies.  The region has been
CC   built up based on the repeat element to match the total size of repeat
CC   indicated by restriction digest, but repeat copies may not be in the
CC   correct order and the usual finishing criteria may not apply.
CC   IMPORTANT: This sequence is not the entire insert of clone BUSM1-37I06. It
CC   may be shorter because we sequence overlapping sections only once, except
CC   for a short overlap.
CC   During sequence assembly data is compared from overlapping clones. Where
CC   differences are found these are annotated as variations together with a
CC   note of the overlapping clone name. Note that the variation annotation may
CC   not be found in the sequence submission corresponding to the overlapping
CC   clone, as we submit sequences with only a small overlap.
CC   The true left end of clone BUSM1-37I06 is at 1 in this sequence.
CC   The true right end of clone BUSM1-37I06 is at 89232 in this sequence.
CC   BUSM1-37I06 is from a Zebrafish PAC library
FH   Key             Location/Qualifiers
FT   source          1..89232
FT                   /organism="Danio rerio"
FT                   /chromosome="8"
FT                   /mol_type="genomic DNA"
FT                   /clone_lib="BUSM1"
FT                   /clone="BUSM1-37I06"
FT                   /db_xref="taxon:7955"
FT   misc_feature    1..89232
FT                   /note="annotated region of clone"
FT   misc_feature    1
FT                   /note="Clone_left_end: BUSM1-37I06"
FT   mRNA            join(<3055..3304,3628..3909,3990..>4111)
FT                   /locus_tag="dZ37I06.1-001"
FT                   /product="novel protein similar to MHC class II alpha
FT                   chain"
FT                   /note="match: cDNAs: Em:AJ251433.1"
FT   CDS             join(<3055..3304,3628..3909,3990..4111)
FT                   /codon_start=1
FT                   /locus_tag="dZ37I06.1-001"
FT                   /standard_name="OTTDARP00000001325"
FT                   /product="novel protein similar to MHC class II alpha
FT                   chain"
FT                   /note="match: proteins: Tr:Q31359 Tr:Q31362 Tr:Q31379
FT                   Tr:Q31625 Tr:Q9MX25 Tr:Q9MX26 Tr:Q9MX27"
FT                   /db_xref="GOA:Q8MGS1"
FT                   /db_xref="InterPro:IPR001003"
FT                   /db_xref="InterPro:IPR003006"
FT                   /db_xref="InterPro:IPR003597"
FT                   /db_xref="InterPro:IPR007110"
FT                   /db_xref="InterPro:IPR011162"
FT                   /db_xref="InterPro:IPR013783"
FT                   /db_xref="InterPro:IPR014745"
FT                   /db_xref="UniProtKB/TrEMBL:Q8MGS1"
FT                   /db_xref="ZFIN:ZDB-GENE-030616-440"
FT                   /protein_id="CAD32273.1"
FT   mobile_element  complement(<7144..>11399)
FT                   /mobile_element_type="transposon"
FT                   /locus_tag="SUSHIIDR1-001"
FT   mRNA            complement(join(13138..13387,13471..13610,13850..13938,
FT                   14124..14200))
FT                   /locus_tag="dZ37I06.3-001"
FT                   /product="putative novel transcript"
FT                   /note="match: ESTs: Em:AI544505 Em:AW279935 Em:BI350674
FT                   Em:BM025909 Em:BM026143 Em:BM036470 Em:BM036710
FT                   Em:BM103315"
FT   polyA_site      complement(13140)
FT   polyA_site      complement(13144)
FT   regulatory      complement(13157..13162)
FT                   /regulatory_class="polyA_signal_sequence"
FT   regulatory      complement(13167..13172)
FT                   /regulatory_class="polyA_signal_sequence"
FT   misc_feature    17164..17227
FT                   /note="Tandem repeat. Inconsistency in the number of copies
FT                   of the repeat element between subclones"
FT   mRNA            complement(join(<18801..18923,21795..22151,23408..23692,
FT                   26366..>26614))
FT                   /locus_tag="dZ37I06.5-001"
FT                   /product="novel protein similar to MHC class I heavy chain"
FT   CDS             complement(join(<18801..18923,21795..22151,23408..23692,
FT                   26366..>26614))
FT                   /codon_start=3
FT                   /locus_tag="dZ37I06.5-001"
FT                   /standard_name="OTTDARP00000001327"
FT                   /product="novel protein similar to MHC class I heavy chain"
FT                   /note="match: proteins: Tr:Q9BDB3 Tr:Q9GJC0 Tr:Q9GJK0
FT                   Tr:Q9GJK8 Tr:Q9GJL1 Tr:Q9GJL2 Tr:Q9TNN4 Tr:Q9TNW9"
FT                   /db_xref="GOA:Q8HWF8"
FT                   /db_xref="InterPro:IPR003597"
FT                   /db_xref="InterPro:IPR007110"
FT                   /db_xref="InterPro:IPR011161"
FT                   /db_xref="InterPro:IPR011162"
FT                   /db_xref="InterPro:IPR013783"
FT                   /db_xref="UniProtKB/TrEMBL:Q8HWF8"
FT                   /db_xref="ZFIN:ZDB-GENE-030616-443"
FT                   /protein_id="CAD56801.1"
FT   mRNA            complement(join(31152..31362,31437..>31697))
FT                   /locus_tag="dZ37I06.6-001"
FT                   /product="novel protein similar to MHC class II alpha
FT                   chain"
FT                   /note="match: cDNAs: Em:L19446.1 Em:L19450.1 Em:X95433.1"
FT                   /note="match: ESTs: Em:BI318681.1 Em:BI318699.1
FT                   Em:BI475666.1 Em:BI842059.1 Em:BI842063.1 Em:BI845880.1
FT                   Em:BI846070.1"
FT   CDS             complement(join(31274..31362,31437..>31697))
FT                   /codon_start=3
FT                   /locus_tag="dZ37I06.6-001"
FT                   /standard_name="OTTDARP00000001328"
FT                   /product="novel protein similar to MHC class II alpha
FT                   chain"
FT                   /note="match: proteins: Tr:Q9MX25 Tr:Q9MX27 Tr:Q9TQ81
FT                   Tr:Q9TQ82 Tr:Q9XRY8 Tr:Q9XRY9 Tr:Q9XRZ0 Tr:Q9XRZ1"
FT                   /db_xref="InterPro:IPR003006"
FT                   /db_xref="InterPro:IPR003597"
FT                   /db_xref="InterPro:IPR007110"
FT                   /db_xref="InterPro:IPR013783"
FT                   /db_xref="UniProtKB/TrEMBL:Q8HWF7"
FT                   /db_xref="ZFIN:ZDB-GENE-030616-444"
FT                   /protein_id="CAD56802.1"
FT                   LAMKMKNVYVD"
FT   mRNA            join(<36699..36756,36978..37253,37365..37646,37725..37838,
FT                   39882..>40021)
FT                   /locus_tag="dZ37I06.4-001"
FT                   /product="novel protein similar to MHC class II beta chain"
FT                   /note="match: cDNAs: Em:AF296391.1 Em:AF296392.1
FT                   Em:AF296397.1 Em:U77597.1 Em:U77598.1 Em:Z49064.1
FT                   Em:Z49065.1"
FT                   /note="match: ESTs: Em:AU090795.1 Em:AW184436.1
FT                   Em:AW233318.1 Em:BE201938.1 Em:BM438192.1"
FT   CDS             join(36699..36756,36978..37253,37365..37646,37725..37838,
FT                   39882..40021)
FT                   /locus_tag="dZ37I06.4-001"
FT                   /standard_name="OTTDARP00000001326"
FT                   /product="novel protein similar to MHC class II beta chain"
FT                   /note="match: proteins: Tr:Q31377 Tr:Q31488 Tr:Q31490
FT                   Tr:Q31590 Tr:Q95HJ7 Tr:Q9GJI1 Tr:Q9GJI4 Tr:Q9GJI6 Tr:Q9GJI8
FT                   Tr:Q9GJI9 Tr:Q9GJJ0"
FT                   /db_xref="GOA:Q8MGS0"
FT                   /db_xref="InterPro:IPR000353"
FT                   /db_xref="InterPro:IPR003597"
FT                   /db_xref="InterPro:IPR007110"
FT                   /db_xref="InterPro:IPR011162"
FT                   /db_xref="InterPro:IPR013783"
FT                   /db_xref="InterPro:IPR014745"
FT                   /db_xref="UniProtKB/TrEMBL:Q8MGS0"
FT                   /db_xref="ZFIN:ZDB-GENE-030616-442"
FT                   /protein_id="CAD32274.3"
FT                   LLCNSILI"
FT   mRNA            join(complement(AL591442.2:78794..>78851),
FT                   complement(AL591442.2:77707..77825),36978..37253,
FT                   54472..54753)
FT                   /locus_tag="dZ243A08.1-001"
FT                   /product="novel protein similar to major histocompatibility
FT                   complex class II DCB gene (mhc2dcb), MHC2DBC related
FT                   sequence (mhc2dbc-rs)"
FT                   /note="match: cDNAs: Em:U77597.1 Em:U77598.1 Em:X95431.1"
FT   mRNA            complement(join(<48782..48891,48976..>49257))
FT                   /locus_tag="dZ37I06.7-001"
FT                   /product="novel protein similar to MHC class II alpha
FT                   chain"
FT                   /note="match: cDNAs: Em:L19445.1 Em:L19446.1 Em:L19450.1"
FT                   /note="match: ESTs: Em:BF717925.1 Em:BI472418.1
FT                   Em:BI475666.1 Em:BI845880.1 Em:BI846070.1 Em:BI865180.1"
FT   CDS             complement(join(48782..48891,48976..>49257))
FT                   /codon_start=3
FT                   /locus_tag="dZ37I06.7-001"
FT                   /standard_name="OTTDARP00000001329"
FT                   /product="novel protein similar to MHC class II alpha
FT                   chain"
FT                   /note="match: proteins: Tr:Q95H91 Tr:Q95HJ5"
FT                   /db_xref="InterPro:IPR003006"
FT                   /db_xref="InterPro:IPR003597"
FT                   /db_xref="InterPro:IPR007110"
FT                   /db_xref="InterPro:IPR013783"
FT                   /db_xref="UniProtKB/TrEMBL:Q8HWF6"
FT                   /db_xref="ZFIN:ZDB-GENE-030616-445"
FT                   /protein_id="CAD56803.1"
FT   mRNA            join(<53806..53863,54085..54360,54472..54753,54832..>54945)
FT                   /locus_tag="dZ37I06.8-001"
FT                   /product="novel protein similat to MHC class II beta chain"
FT                   /note="match: cDNAs: Em:U77597.1 Em:U77598.1 Em:X93898.1
FT                   Em:Z49064.1"
FT                   /note="match: ESTs: Em:AW184436.1 Em:AW233318.1
FT                   Em:BE201938.1 Em:BG936494.1 Em:BQ261921.1"
FT   CDS             join(53806..53863,54085..54360,54472..54753,54832..>54945)
FT                   /locus_tag="dZ37I06.8-001"
FT                   /standard_name="OTTDARP00000001330"
FT                   /product="novel protein similat to MHC class II beta chain"
FT                   /note="match: proteins: Tr:Q31357 Tr:Q95HJ6 Tr:Q95HJ7"
FT                   /db_xref="GOA:Q8HWF5"
FT                   /db_xref="InterPro:IPR000353"
FT                   /db_xref="InterPro:IPR003597"
FT                   /db_xref="InterPro:IPR007110"
FT                   /db_xref="InterPro:IPR011162"
FT                   /db_xref="InterPro:IPR013783"
FT                   /db_xref="InterPro:IPR014745"
FT                   /db_xref="UniProtKB/TrEMBL:Q8HWF5"
FT                   /db_xref="ZFIN:ZDB-GENE-990415-143"
FT                   /protein_id="CAD56804.1"
FT   mRNA            complement(join(58412..58590,58670..>58949))
FT                   /locus_tag="dZ37I06.9-001"
FT                   /product="novel protein similar to MHC class I alpha chain"
FT                   /note="match: cDNAs: Em:AF103007.1 Em:AF212850.1
FT                   Em:L19446.1 Em:L19450.1"
FT                   /note="match: ESTs: Em:BI318681.1 Em:BI472418.1
FT                   Em:BI845880.1 Em:BI846070.1"
FT   CDS             complement(join(58469..58590,58670..>58949))
FT                   /codon_start=1
FT                   /locus_tag="dZ37I06.9-001"
FT                   /standard_name="OTTDARP00000001331"
FT                   /product="novel protein similar to MHC class I alpha chain"
FT                   /note="match: proteins: Tr:Q31362 Tr:Q31466 Tr:Q31468
FT                   Tr:Q31470 Tr:Q31471 Tr:Q95H91"
FT                   /db_xref="GOA:Q8HWF4"
FT                   /db_xref="InterPro:IPR003006"
FT                   /db_xref="InterPro:IPR003597"
FT                   /db_xref="InterPro:IPR007110"
FT                   /db_xref="InterPro:IPR013783"
FT                   /db_xref="UniProtKB/TrEMBL:Q8HWF4"
FT                   /db_xref="ZFIN:ZDB-GENE-030616-447"
FT                   /protein_id="CAD56805.1"
FT   mRNA            join(complement(58670..>58951),AL591442.2:74048..>74169)
FT                   /locus_tag="dZ243A08.2-001"
FT                   /product="novel protein similar to MHC class II alpha
FT                   chain"
FT                   /note="match: cDNAs: Em:AJ251432.1 Em:L19446.1 Em:L19450.1
FT                   Em:L35065.1 Em:L35067.1 Em:X95433.1"
FT   CDS             join(complement(58670..>58951),AL591442.2:74048..74169)
FT                   /codon_start=3
FT                   /locus_tag="dZ243A08.2-001"
FT                   /standard_name="OTTDARP00000001295"
FT                   /product="novel protein similar to MHC class II alpha
FT                   chain"
FT                   /note="match: proteins: Tr:Q31465 Tr:Q31467 Tr:Q9MX26
FT                   Tr:Q9TQ81"
FT                   /db_xref="GOA:A3KFP9"
FT                   /db_xref="InterPro:IPR003006"
FT                   /db_xref="InterPro:IPR003597"
FT                   /db_xref="InterPro:IPR007110"
FT                   /db_xref="InterPro:IPR013783"
FT                   /db_xref="UniProtKB/TrEMBL:A3KFP9"
FT                   /protein_id="CAM56284.1"
FT   misc_feature    75527..75567
FT                   /note="Tandem repeat. Inconsistency in the number of copies
FT                   of the repeat element between subclones"
FT   misc_feature    89232
FT                   /note="Clone_right_end: BUSM1-37I06"
