Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Massive parallelization of serial inference algorithms for a complex generalized linear model
Suchard M., Simpson S., Zorych I., Ryan P., Madigan D. ACM Transactions on Modeling and Computer Simulation23 (1):1-17,2013.Type:Article
Date Reviewed: Apr 12 2013

Computers with multicore central processing units (CPUs) and multiple graphics processing units (GPUs) exist today to speed up the parallel processing of computationally intensive statistical prediction algorithms. Unfortunately, several of the existing statistical algorithms [1], designed to cope with the instantaneous scrutiny of record-keeping systems in areas such as healthcare, are still sequential and thus computationally deficient. How should efficient statistical algorithms be designed to uncover and use the current and historical trends in medical claim databases to reliably predict the medical products associated with adverse events such as myocardial infarction or severe renal and liver collapse?

The authors of this paper critique the limitations of the existing statistical algorithms for coping with regulation and compliance issues in healthcare industries. They recognize the need to explore the parallelization capability of GPUs for solving generalized linear models (GLMs) that involve the solution of computationally intensive log-likelihood functions. Readers who are unfamiliar with computational statistics should browse Kennedy and Gentle’s introduction [1] to sequential algorithms for solving unconstrained optimization and nonlinear regression, prior to exploring the insightful parallel algorithms used in this paper to solve GLMs with Bayesian priors or indefinite parameter regularization.

The authors present a sequential cyclic coordinate descent algorithm used to fit the familiar Bayesian self-controlled case series. The algorithm targets the time-wasting computation of 1D gradients and Hessian matrices for extensive parallelization. They cleverly show how to represent and manipulate sparse matrices and dense vectors in parallel to derive the gradients and Hessians, and apply the parallel algorithms to compute the maximum a posteriori (MAP) probability estimates for numerous observational healthcare databases. Using GPUs to perform the sparse operations significantly increases the speed of the MAP estimation, compared to using CPUs to execute the sparse or dense computation.

Exploiting parallel algorithms to fit complex GLMs to huge datasets offers new opportunities for associating adverse events with specific drugs, while controlling for covariates such as patient demographics, coexisting diseases, and coinciding drugs. However, a complete Bayesian analysis of the entire set of unidentified parameters is missing from the proposed model. Clearly, the authors recognize the roles of cross-validation and bootstrapping in estimating the hyperparameters of the model. However, are accurate estimates of the model hyperparameters really computationally infeasible, as the authors claim? I strongly encourage all computational statisticians to read this perceptive paper and weigh in on this question.

Reviewer:  Amos Olagunju Review #: CR141131 (1307-0638)
1) Kennedy, W. J. Jr.; Gentle, J. E. Statistical computing. M. Dekker, New York, NY, 1980.
Bookmark and Share
  Reviewer Selected
Featured Reviewer
 
 
Statistical Computing (G.3 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Statistical Computing": Date
Computer analysis of sequential medical trials
Duan-Zheng X., Ellis Horwood, Upper Saddle River, NJ, 1990. Type: Book (9780131618527)
Jul 1 1992
APL2 as a specification language for statistics
Thomson N. IBM Systems Journal 30(4): 539-542, 1991. Type: Article
Dec 1 1993
Multivariate statistical simulation
Johnson M., John Wiley & Sons, Inc., New York, NY, 1987. Type: Book (9789780471822905)
Dec 1 1987
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy