Skip NavigationSkip to Content

Characterization of mismatch and high-signal intensity probes associated with Affymetrix genechips

  1. Author:
    Wang, Y. H.
    Miao, Z. H.
    Pommier, Y.
    Kawasaki, E. S.
    Player, A.
  2. Author Address

    Ctr Canc Res, NCI, Mol Pharmacol Lab, Bethesda, MD 20892 USA. NCI, NIH, Microarray Core Facil, Bethesda, MD 20892 USA. SAIC Frederick Inc, NCI, Ft Detrick, MD 21702 USA.;Wang, YH, Chinese Acad Sci, Shanghai Inst Mat Med, Shanghai 201203, Peoples R China.;wangyong@mail.nih.gov
    1. Year: 2007
    2. Date: Aug
  1. Journal: Bioinformatics
    1. 23
    2. 16
    3. Pages: 2088-2095
  2. Type of Article: Article
  3. ISSN: 1367-4803
  1. Abstract:

    Motivation: For Affymetrix microarray platforms, gene expression is determined by computing the difference in signal intensities between perfect match ( PM) and mismatch ( MM) probesets. Although the use of PM is not controversial, MM probesets have been associated with variance and ultimately inaccurate gene expression calls. A principal focus of this study was to investigate the nature of the MM signal intensities and demonstrate its contribution to the experimental results. Results: While most MM intensities were likely associated with random noise, a subset of similar to 20% ( 99 485) of the MM probes displayed relatively high signal intensities to the corresponding PM probes (MM > PM) in a non-random fashion; 13 440 of these probes demonstrated exceptionally high 'outlier' intensities. About 15 938 PM probes also demonstrated exceptionally high outlier intensities consistently across all hybridizations. About 92% of the MM > PM probes had either a dThymidine (dT) or a dCytidine (dC) at the 13th position of the probe sequence. MM and PM probes displaying extremely high outlier intensities contained high dC rich nucleotides, and low dA contents at other nucleotides positions along the 25mer probe sequence. Differentially expressed genes generated using Genechip Operating System (GCOS) or modified PM-only methods were also examined. Of those candidate genes identified in the PM-only method, 157 of them were designated by GCOS as absent across all datasets and many others contained probes with MM > PM signal intensities. Our data suggests that MM intensity from PM signal can be a major source of error analysis, leading to fewer potentially biologically important candidate genes. Contact: wangyong@mail.nih.gov Supplementary information: Supplementary data are available at Bioinformatics online.

    See More

External Sources

  1. DOI: 10.1093/bioinformatics/btm306
  2. WOS: 000249818300008

Library Notes

  1. No notes added.
NCI at Frederick

You are leaving a government website.

This external link provides additional information that is consistent with the intended purpose of this site. The government cannot attest to the accuracy of a non-federal site.

Linking to a non-federal site does not constitute an endorsement by this institution or any of its employees of the sponsors or the information and products presented on the site. You will be subject to the destination site's privacy policy when you follow the link.

ContinueCancel