Faculty Publications

Efficiently identifying genome-wide changes with next-generation sequencing data

Weichun Huang, National Institute of Environmental Health Sciences (NIEHS)
David M. Umbach, National Institute of Environmental Health Sciences (NIEHS)
Nicole Vincent Jordan, The University of North Carolina at Chapel Hill
Amy N. Abell, The University of North Carolina at Chapel HillFollow

Abstract

We propose a new and effective statistical framework for identifying genome-wide differential changes in epigenetic marks with ChIP-seq data or gene expression with mRNA-seq data, and we develop a new software tool EpiCenter that can efficiently perform data analysis. The key features of our framework are: (i) providing multiple normalization methods to achieve appropriate normalization under different scenarios, (ii) using a sequence of three statistical tests to eliminate background regions and to account for different sources of variation and (iii) allowing adjustment for multiple testing to control false discovery rate (FDR) or family-wise type I error. Our software EpiCenter can perform multiple analytic tasks including: (i) identifying genome-wide epigenetic changes or differentially expressed genes, (ii) finding transcription factor binding sites and (iii) converting multiple-sample sequencing data into a single read-count data matrix. By simulation, we show that our framework achieves a low FDR consistently over a broad range of read coverage and biological variation. Through two real examples, we demonstrate the effectiveness of our framework and the usages of our tool. In particular, we show that our novel and robust 'parsimony' normalization method is superior to the widely-used 'tagRatio' method. Our software EpiCenter is freely available to the public. © The Author(s) 2011. Published by Oxford University Press.

Publication Title

Nucleic Acids Research

Recommended Citation

Huang, W., Umbach, D., Vincent Jordan, N., & Abell, A. (2011). Efficiently identifying genome-wide changes with next-generation sequencing data. Nucleic Acids Research, 39 (19) https://doi.org/10.1093/nar/gkr592

Link to Full Text

COinS

Faculty Publications

Efficiently identifying genome-wide changes with next-generation sequencing data

Abstract

Publication Title

Recommended Citation

Search

Browse

Author Corner

Libraries

Faculty Publications

Efficiently identifying genome-wide changes with next-generation sequencing data

Authors

Abstract

Publication Title

Recommended Citation

Share

Search

Browse

Author Corner

Libraries