bioRxiv
New Results

Functional normalization of 450k methylation array data improves replication in large cancer studies

Jean-Philippe Fortin , Aurelie Labbe , Mathieu Lemire , Brent W. Zanke , Thomas J. Hudson , Elana J. Fertig , Celia M.T. Greenwood , Kasper D. Hansen

Abstract

We propose an extension to quantile normalization which removes unwanted technical variation using control probes. We adapt our algorithm, functional normalization, to the Illumina 450k methylation array and address the open problem of normalizing methylation data with global epigenetic changes, such as human cancers. Using datasets from The Cancer Genome Atlas and a large case-control study, we show that our algorithm outperforms all existing normalization methods with respect to replication of results between experiments, and yields robust results even in the presence of batch effects. Functional normalization can be applied to any microarray platform, provided suitable control probes are available.