Date Available
10-6-2016
Year of Publication
2016
Document Type
Doctoral Dissertation
Degree Name
Doctor of Philosophy (PhD)
College
Arts and Sciences
Department/School/Program
Statistics
Advisor
Dr. Arnold J. Stromberg
Co-Director of Graduate Studies
Dr. Chi Wang
Abstract
This paper intend to develop novel statistical methods to improve genomic data analysis, especially for differential analysis. We considered two different data type: NanoString nCounter data and somatic mutation data. For NanoString nCounter data, we develop a novel differential expression detection method. The method considers a generalized linear model of the negative binomial family to characterize count data and allows for multi-factor design. Data normalization is incorporated in the model framework through data normalization parameters, which are estimated from control genes embedded in the nCounter system. For somatic mutation data, we develop beta-binomial model-based approaches to identify highly or lowly mutated genes and to compare somatic mutations between patient groups. An empirical Bayes shrinkage approach is used to improve estimation of model parameters in all projects.
Digital Object Identifier (DOI)
https://doi.org/10.13023/ETD.2016.397
Recommended Citation
Wang, Hong, "IMPROVED MODELS FOR DIFFERENTIAL ANALYSIS FOR GENOMIC DATA" (2016). Theses and Dissertations--Statistics. 21.
https://uknowledge.uky.edu/statistics_etds/21