ABSTRACT

Analyzing high-dimensional gene expression and DNA methylation data with R is the first practical book that shows a ``pipeline" of analytical methods with concrete examples starting from raw gene expression and DNA methylation data at the genome scale. Methods on quality control, data pre-processing, data mining, and further assessments are presented in the book, and R programs based on simulated data and real data are included. Codes with example data are all reproducible.

Features:
• Provides a sequence of analytical tools for genome-scale gene expression data and DNA methylation data, starting from quality control and pre-processing of raw genome-scale data.
• Organized by a parallel presentation with explanation on statistical methods and corresponding R packages/functions in quality control, pre-processing, and data analyses (e.g., clustering and networks).
• Includes source codes with simulated and real data to reproduce the results. Readers are expected to gain the ability to independently analyze genome-scaled expression and methylation data and detect potential biomarkers.

This book is ideal for students majoring in statistics, biostatistics, and bioinformatics and researchers with an interest in high dimensional genetic and epigenetic studies.

chapter Chapter 1|4 pages

Introduction

chapter Chapter 2|12 pages

Genome-scale gene expression data

chapter Chapter 3|22 pages

Genome-scale epigenetic data

chapter Chapter 4|17 pages

Screening genome-scale genetic and epigenetic data

chapter Chapter 5|43 pages

Cluster Analysis in Data mining

chapter Chapter 8|20 pages

Network construction and analyses