Skip NavigationSkip to Content

Statistical Resolution of Ambiguous HLA Typing Data

  1. Author:
    Listgarten, J.
    Brumme, Z.
    Kadie, C.
    Gao, X. J.
    Walker, B.
    Carrington, M.
    Goulder, P.
    Heckerman, D.
  2. Author Address

    Listgarten, Jennifer, Kadie, Carl, Heckerman, David] Microsoft Res, Redmond, WA USA. [Brumme, Zabrina, Walker, Bruce, Goulder, Philip] Harvard Univ, Massachusetts Gen Hosp, Sch Med, Partners AIDS Res Ctr, Boston, MA USA. [Gao Xiaojiang, Carrington, Mary] NCI, SAIC Frederick, Frederick, MD 21701 USA. [Walker, Bruce] Howard Hughes Med Inst, Frederick, MD USA. [Goulder, Philip] Univ Oxford, Dept Paediat, Oxford, England.
    1. Year: 2008
  1. Journal: Plos Computational Biology
    1. 4
    2. 2
  2. Type of Article: Article
  3. Article Number: e1000016
  1. Abstract:

    High-resolution HLA typing plays a central role in many areas of immunology, such as in identifying immunogenetic risk factors for disease, in studying how the genomes of pathogens evolve in response to immune selection pressures, and also in vaccine design, where identification of HLA-restricted epitopes may be used to guide the selection of vaccine immunogens. Perhaps one of the most immediate applications is in direct medical decisions concerning the matching of stem cell transplant donors to unrelated recipients. However, high-resolution HLA typing is frequently unavailable due to its high cost or the inability to re-type historical data. In this paper, we introduce and evaluate a method for statistical, in silico refinement of ambiguous and/or low-resolution HLA data. Our method, which requires an independent, high-resolution training data set drawn from the same population as the data to be refined, uses linkage disequilibrium in HLA haplotypes as well as four-digit allele frequency data to probabilistically refine HLA typings. Central to our approach is the use of haplotype inference. We introduce new methodology to this area, improving upon the Expectation-Maximization (EM)based approaches currently used within the HLA community. Our improvements are achieved by using a parsimonious parameterization for haplotype distributions and by smoothing the maximum likelihood (ML) solution. These improvements make it possible to scale the refinement to a larger number of alleles and loci in a more computationally efficient and stable manner. We also show how to augment our method in order to incorporate ethnicity information (as HLA allele distributions vary widely according to race/ethnicity as well as geographic area), and demonstrate the potential utility of this experimentally. A tool based on our approach is freely available for research purposes at http://microsoft.com/science.

    See More

External Sources

  1. DOI: 10.1371/journal.pcbi.1000016
  2. PMID: 18392148

Library Notes

  1. No notes added.
NCI at Frederick

You are leaving a government website.

This external link provides additional information that is consistent with the intended purpose of this site. The government cannot attest to the accuracy of a non-federal site.

Linking to a non-federal site does not constitute an endorsement by this institution or any of its employees of the sponsors or the information and products presented on the site. You will be subject to the destination site's privacy policy when you follow the link.

ContinueCancel