Abstract:
We present a method for performing statistical valid linear regressions on the union of distributed chemical databases that preserves confidentiality of those databases. The method employs secure multi-party computation to share local sufficient statistics necessary to compute least squares estimators of regression coefficients, error variances and other quantities of interest. We illustrate with an example containing four companies’ rather different databases.
Keywords:
Chemical database, distributed data, regression model, secure multi-party computation
Publication Date:
Wednesday, June 1, 2005File Attachment:

Report Number:
152