The NullHap utility estimates haplotype frequencies using genotype data from unrelated individuals. It can be applied to loci with a codominant allele(s) and/or a silent (null) alleles (as often happens with KIR loci). The NullHap is based on maximum likelihood approach and uses an expanded Expectation-Maximalisation algorithm.
Usage
NullHap [options] input_file
Allowed options:
Details of input file format are given below. Two sections should be given : description of loci and an description of observations. The ';' (semicolon) starts a comment. It ends at the next newline.
Loci description section contains description of every locus. The keyword 'Locus' (or 'L') starts an locus description. Next, in parentheses three locus attributes should be defined:
Locus("A" 2 true) Locus("B" 2 false)
Observation descriptions section contains the genotypes and number of observations. The keyword 'Observation' (or 'O') starts an observation description, then two observation attributes should be defined:
Null allele should be included only if no other allele was found in a given locus (i.e. in a homozygote). So: Observation("genotype" number_of_individuals) Examples:
O("A1.B1." 3) O("A1.B1.B2." 5) O("A0.B1." 1)
The results are send to the standard output. It is a set of lines, each containing a haplotype description and its probability. When the results should be written to a file type:
NullHap input_file > output_filemain page