Faculty Publications

A Bayesian approach for inducing sparsity in generalized linear models with multi-category response

Behrouz Madahian, University of Memphis
Sujoy Roy, University of Memphis
Dale Bowman, University of Memphis
Lih Y. Deng, University of Memphis
Ramin Homayouni, University of Memphis

Abstract

Background: The dimension and complexity of high-throughput gene expression data create many challenges for downstream analysis. Several approaches exist to reduce the number of variables with respect to small sample sizes. In this study, we utilized the Generalized Double Pareto (GDP) prior to induce sparsity in a Bayesian Generalized Linear Model (GLM) setting. The approach was evaluated using a publicly available microarray dataset containing 99 samples corresponding to four different prostate cancer subtypes. Results: A hierarchical Sparse Bayesian GLM using GDP prior (SBGG) was developed to take into account the progressive nature of the response variable. We obtained an average overall classification accuracy between 82.5% and 94%, which was higher than Support Vector Machine, Random Forest or a Sparse Bayesian GLM using double exponential priors. Additionally, SBGG outperforms the other 3 methods in correctly identifying pre-metastatic stages of cancer progression, which can prove extremely valuable for therapeutic and diagnostic purposes. Importantly, using Geneset Cohesion Analysis Tool, we found that the top 100 genes produced by SBGG had an average functional cohesion p-value of 2.0E-4 compared to 0.007 to 0.131 produced by the other methods. Conclusions: Using GDP in a Bayesian GLM model applied to cancer progression data results in better subclass prediction. In particular, the method identifies pre-metastatic stages of prostate cancer with substantially better accuracy and produces more functionally relevant gene sets.

Publication Title

BMC Bioinformatics

Recommended Citation

Madahian, B., Roy, S., Bowman, D., Deng, L., & Homayouni, R. (2015). A Bayesian approach for inducing sparsity in generalized linear models with multi-category response. BMC Bioinformatics, 16 (13) https://doi.org/10.1186/1471-2105-16-S13-S13

Link to Full Text

COinS

Faculty Publications

A Bayesian approach for inducing sparsity in generalized linear models with multi-category response

Abstract

Publication Title

Recommended Citation

Search

Browse

Author Corner

Libraries

Faculty Publications

A Bayesian approach for inducing sparsity in generalized linear models with multi-category response

Authors

Abstract

Publication Title

Recommended Citation

Share

Search

Browse

Author Corner

Libraries