Combining Information From Multiple Data Sources to Create Multivariable Risk Models: Illustration and Preliminary Assessment of a New Method

Samsa, Greg; Hu, Guizhou; Root, Martin

doi:https://doi.org/10.1155/JBB.2005.113

BioMed Research International

On this page

Abstract Copyright Related Articles

Special Issue

Data Mining in Genomics and Proteomics

View this Special Issue

Research article | Open Access

Volume 2005 | Article ID 524952 | https://doi.org/10.1155/JBB.2005.113

Combining Information From Multiple Data Sources to Create Multivariable Risk Models: Illustration and Preliminary Assessment of a New Method

Greg Samsa,^1,2Guizhou Hu,³and Martin Root³

Received05 Feb 2004

Revised29 Mar 2004

Accepted06 Apr 2004

Abstract

A common practice of metanalysis is combining the results of numerous studies on the effects of a risk factor on a disease outcome. If several of these composite relative risks are estimated from the medical literature for a specific disease, they cannot be combined in a multivariate risk model, as is often done in individual studies, because methods are not available to overcome the issues of risk factor colinearity and heterogeneity of the different cohorts. We propose a solution to these problems for general linear regression of continuous outcomes using a simple example of combining two independent variables from two sources in estimating a joint outcome. We demonstrate that when explicitly modifying the underlying data characteristics (correlation coefficients, standard deviations, and univariate betas) over a wide range, the predicted outcomes remain reasonable estimates of empirically derived outcomes (gold standard). This method shows the most promise in situations where the primary interest is in generating predicted values as when identifying a high-risk group of individuals. The resulting partial regression coefficients are less robust than the predicted values.

Copyright

Copyright © 2005 Hindawi Publishing Corporation. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation Order printed copies

Views

298

Downloads

1049

Citations