ANGSD: Analysis of next generation Sequencing Data

Latest tar.gz version is (0.938/0.939 on github), see Change_log for changes, and download it here.

# Difference between revisions of "SYKmaf"

From angsd

(→ML estimator with unknown minor) |
|||

Line 18: | Line 18: | ||

<math> | <math> | ||

− | P(D|M,f) = \prod_i P(D_i|M,f) = | + | P(D|M,f) = \prod_i P(D_i|M,f) = \sum_m \sum_{A_1,A_2} P(D_i|G=A_1A_2)p(G=A_1A_2|m,M)p(m), |

</math> | </math> |

## Latest revision as of 19:50, 15 December 2014

## ML estimator with known minor

First infer the Major and Minor allele and then use BFGS (-doMaf 1) optimazation or the EM algorithm (-doMaf 2) to estimate the allele frequencies.

## ML estimator with unknown minor

First infer the Major allele and then use BFGS (-doMaf 4) optimazation or the EM algorithm (-doMaf 8) to estimate the allele frequencies. Here only the Major allele needs to be known and the uncertaincy of infering the minor allele is modelled.

Let denote the major an minor allele assuming adiallelic site, then the maximum likelihood estimate of this pair is found using the likelihood function