![]() |
IPD - KIR DatabaseUsing the IPD - KIR Sequence Alignment ToolThe sequence alignment form contains the following options: Select Locus - this option allows the user to choose which of the KIR genes to align. The locus is selected from the drop down menu. The box also includes a number of specialist choices like multiple alignments for all KIR genes. The selection of a locus automatically determines the type of sequences available to align. With regard to 2DLXa and 2DLXb, we presently do not know whether these alleles are from either the 2DL5A or 2DL5B genes, and are included at the present time for information only and will received official names in the future once this has been resolved. Select the feature to align - this option provides a list of alignments available for the loci selected. If an option is not listed in this list then it is either not possible or is currently unavailable. The alignment tool now allows the user to select "Nucleotide - CDS plus pseudoexon" as an option. This incorporates the pseudoexon sequence found in the 2DL1, 2DL2, 2DL3, 2DS1, 2DS2, 2DS3, 2DS4 and 2DS5 sequences into the CDS alignment. Enter any specific sequences required - allows the user to perform specific sequence alignments by either entering common nomenclature or by listing allele names. For example to align 2DL4*00101and 2DL4*00102. You could enter 001 or 0010 in the box as the common nomenclature, or you could enter 00101, 00102 in the box provided, separating each allele name with a comma. Wildcards (*) may be used in the allele name. Enter the reference sequence - the alignment tool allows the user to select an alternative reference sequence. This is optional, if not selected the tool uses the default sequence shown below. The alternatives are a user specified sequence or a consensus sequence. To use an alternative reference sequence simply enter the numerical code in full in the box provided. Please note incorrect codes will cause errors in the alignment, 001 is not a valid code for 2DL4*00101, the full numerical code must be entered. A consensus sequence based on those alleles in the alignment can be used by typing "consensus" into the reference box. The consensus sequence is not derived from all alleles at the locus selected but from those alleles selected for the alignment.
Select how you wish to view any mismatches - this option selects whether the to display the full sequence or to highlight the mismatches. The full sequence details every base pair for all sequences, highlighting mismatches represents only base pairs that differ between the sequence and the reference sequence used. Examples of both options are shown below. Show mismatches between sequences: KIR2DL1*001 CGCTCTTGGT Show all bases: KIR2DL1*001 CGCTCTTGGT Select how the alignment will be numbered - depending on the type of sequence selected different numbering styles can be selected. For nucleotide sequences the alignments can be displayed in blocks of 10 nucleotides or in the amino acid codons. Protein are always displayed in blocks of 10 amino acids. For both formats it may be necessary to increase the width of your browser to fully view the sequence. Full details of how sequenced are numbered is explained below. Do you want to omit alleles unsequenced for this region - due to the high number of alleles in some alignments, you can now omit those alleles that are not sequenced over the region of interest. This will reduce the time taken to perform the alignment and the space required to display the output. Where possible select only the sequences needed, this will reduce time and make the alignments easier to view. Sequence Alignment Display OptionsThe IPD - KIR Database produces the official KIR sequence alignments. The alignment files produced use the following nomenclature and numbering conventions. These conventions are based on the recommendations published for Human Gene Mutations. These were prepared by a nomenclature-working group looking at how to name and store sequences for human allelic variants. These recommendations can be found in Human Mutation 11:1-3, 1998 (1).
In the sequence alignments the following conventions are used.
Numbering of the Sequence AlignmentIn order to provide standardised sequences for any loci, the following numbering system has been established that accurately represents the sequence at both the nucleotide and protein level. We have looked at the HUGO Gene Nomenclature Committee (1) recommendations proposed for the numbering of genomic sequences, and use a similar model for the KIR sequences held in the IPD - KIR Database. Many of their proposals already match our current strategy. HUGO recommends that for all nomenclature systems a standard reference sequence should be used for each locus. In the case of KIR sequences a standard reference sequence is already established for each gene. The remaining recommendations for nucleotide sequences are as follows; Nucleotide Sequence Numbering
The following recommendations are used for describing mutations in nucleotide sequences;
Protein Sequence Numbering
Mutations in protein sequences follow a similar format;
References
Further InformationFor more information about the database, queries (including website) please contact IPD Support. Please see our licence for terms of use. ![]() |