Wiki
Clone wikiTassel 5 Source / UserManual / WeightedMLM / WeightedMLM
Weighted MLM
Weighted MLM and MLM are very similar. They have the same results and interface except that weighted MLM can take a list of weights for the taxa.
Association analysis of replicated phenotypes using a mixed linear model can either use a one-stage or two-stage analysis. In a two-stage analysis, the phenotype data is evaluated first to produce a single estimate for each taxon (sample). Those estimates are then combined with genotypes to test associations. A one-stage analysis does both steps at once, fitting a model that has all of the individual observations. Because the MLM scales quadratically with the number of observations, a one-stage analysis is often not computationally feasible. If the phenotype data is balanced the one and two-stage methods produce the same result. Unbalanced data can result in biased effect estimates and loss of power for the two-stage model. The weighted MLM (Xue et al. 2017) weights each sample estimate by its error variance to produce results similar to the one-stage method but with the computational cost of the two-stage method.
The user interface and results for the weighted MLM are basically the same as the non-weighted version except that the weighted MLM can take a weight matrix as an additional argurment. If a weight matrix is not provided, then the analysis defaults to the non-weighted MLM. The weight matrix needs to be input as a phenotype file with two columns. The first column is taxa, and the second column is weight.
Using the TASSEL GUI: Do NOT join the weight matrix with the phenotype data. Instead select it as a third dataset when running the analysis. In other words, select the joint phenotype-genotype data, the kinship matrix, and the weight data then click Analysis/Association/WeightedMLM.
Example of a command line for running weighted MLM:
./run_pipeline.pl -Xmx4G -debug \ -fork1 -importGuess mdp_genotype.hmp.txt \ -FilterSiteBuilderPlugin -siteMinAlleleFreq 0.05 -endPlugin \ -fork2 -importGuess mdp_phenotype.txt \ -fork3 -importGuess kinship.txt \ -fork4 -importGuess weights.txt \ -combine5 -input1 -input2 -intersect \ -combine6 -input3 -input4 -input5 \ -WeightedMLMPlugin -endPlugin -export mlm_output
Reference: Shang Xue, Funda Ogut, Zachary Miller, Janu Verma, Peter J. Bradbury, James B. Holland (2017) Comparison of one-stage and two-stage genome-wide association studies. bioRxiv. doi: https://doi.org/10.1101/099291
Updated