PedGenie Haplotype Example

The Haplotype example allows a user to examine the joint inheritance of two or more markers, using phased data. A haplotype frequency file is required for this analysis. We have also included quantitative data in this analysis. If you have problems opening a file, at the error page try using toolbar View, Source or click on Page, View Source to view the file.

Files required to run this example are :

PhasedPedigree.dat - Linkage pre-makeped pedigree file with a header linePhasedPedigree.hap - Loci haplotype frequency data fileTrait.dat - Trait / Covariate data filePedGenieHaplotype.rgen - .rgen Parameter File

Output file of this analysis is :

PedGenieHaplotype.report

To run this analysis:

java -jar Genie.jar PedGenie Haplotype.rgen

PhasedPedigree.datThe example pedigree file is composed of 16 three generation families. Each individual has information on two markers.

Trait.datThe quantitative test requires a quantitative trait file. It contains the trait and covariate information for each individual.

Haplotype.rgenThe set-up of Haplotype.rgen is similar to the set-up of the SingleLocus.rgen. Only differences between the two files will be described in this section.

  1. top=”HapFreqTopSim” Instead of dropping alleles from the pedigree founders as we illustrated with the SingleLocus.rgen example, in this example haplotypes are dropped.
  2. In the locus section, loci are defined, as is the distance (‘dist’) between the loci. The ‘dist’ is defined as the distance between the marker and the proceeding marker. If the ‘dist’ is listed as a value <=0.5, it is assumed to be a recombination fraction. If the ‘dist’ is listed as a value >0.5 it is assumed to a value in centiMorgans (cM). The default value for ‘dist’ is 0.5, assuming linkage equilibrium between markers.
  3. The [“top-sample”] option of ‘all’, ‘founder’, or ‘GeneCounter’ is for allele frequency estimation. For composite genotype and haplotypes analyses, the [“top-sample”] option is ignored. Haplotype frequencies from the ‘.hap’ file are used instead.
  4. In the first cctable, we compare haplotype 1-2 versus the most common haplotype 1-1. Under wt=0, we define all of the situations where a haplotype 1-1 is observed. We point out to users that within a defined block, each line listed is joined together by an ‘AND’ statement. Between blocks are ‘OR’ statements. Thus in the first block, the first line states that the first locus (SNP1) is a ‘1/1’ genotype AND the second line states that the second locus (SNP2) also is the ‘1/1’ genotype. In the second block, we require that the first locus (SNP1) be a ‘1/2’ genotype and the second locus is the ‘1/1’ genotype. We continue on, listing five situations where the 1-1 phased haplotype is observed. Under wt=1, we define all of the scenarios possible that include haplotype 1-2. Thus, in the first block, we require SNP1 to be the 1/1 genotype and SNP2 must have a ‘2’ at either the first position or the second position [(./2) | (2/.)]. We list five scenarios where haplotype 1-2 is present.
  5. In the second and third cctable analyses, we illustrate how haplotype 2-2 can be compared to haplotype 1-1 and how haplotype 2-1 can be compared to haplotype 1-1.
  6. In the fourth cctable analysis, we illustrate how haplotype 1-2 can be compared to all other haplotypes. We take advantage of the wild card (‘.’) option available in PedGenie. The wild card option allows a user to exclude all patterns previously counted. Note that we begin with wt=1 rather than wt=0. The lowest weight group (in this case ‘all other genotype combinations’) is the reference group. We define the 1-2 haplotype, using the wild card option. Next, we define wt=0, and list for locus 1 (SNP1) that any other genotype is permissible (./.), and for SNP2 that any other genotype is permissible (./.). Although the reference group is wt=0, the order listed in the output file will be reversed. We have added to our Model name ‘REV’ as a reminder that the order in the output file is reversed.
  7. In the fifth and sixth cctable analyses, we illustrate how the 2-2 and 2-1 haplotypes can be compared to ‘all other’ haplotypes, respectively.
  8. In the seventh and eighth cctable analyses, we illustrate how the 2-2 haplotype can be analyzed in a dominant and recessive mode of inheritance, respectively.
  9. Finally in the ninth cctable analysis, we show the set-up for the additive test for haplotype 2-2. Three columns are defined, but in reverse order. All statistics available will be run, including the TDT tests.

PhasedPedigree.hapThe PhasedPedigree.hap frequency file lists each haplotype frequency with its corresponding haplotype. The haplotype frequencies listed must sum to ‘1’. Haplotypes with ‘0’ frequency may be listed, but they will be ignored. Haplotypes are defined as each marker, corresponding to the same order listed in the locus list defined in the ‘*.rgen’ file, separated by a ‘-‘.

PedGenieHaplotype.report output file

We refer users to the SingleLocus.report output for detailed information regarding the PedGenieHaplotype.report output file. Only differences between the two files will be described here.

  1. We point out to the users that in Analysis5, the column weights were switched so that we take advantage of the wild card option in PedGenie. Column 2 for this analysis is wt=0 and column 1 is wt=1.
  2. For Analysis9, note that in addition to the contingency table and quantitative tables, a trio TDT table showing parental genotype data and case genotype data is shown, and a combined TDT table is shown. The TDT analyses require set-up of 3 columns of data to run.

Home   PedGenie Example