Publications
Estimation of propensity scores using generalized additive models." Statisics in Medicine 27 (2008): 3806-3816.
"Homeland Insecurity." In Terrorism Informatics, edited by Hsinchun Chen, Edna Reid, Joshua Sinai, Andrew Silke and Boaz Ganor, 197-218. Vol. 18. Integrated Series In Information Systems 18. Springer US, 2008.
"Low-fat dietary pattern and cancer incidence in the Women’s Health Initiative Dietary Modification Randomized Controlled Trial." Journal of National Cancer Institute 100 (2008): 284.
"Pooled ANOVA." Computational Statistics & Data Analysis 52 (2008): 5215.
"Sensitivity to noise variance in a social network dynamics model." Q. Applied Mathematics 66 (2008): 233-247.
"Social Networks." In Encyclopedia of Risk Assessment IV. Wiley, 2008.
"Computer Model Validation with Functional Output." Annals of Statistics 35, no. 5 (2007): 1874-190.
"Exploration of cluster structure-activity relationship analysis in efficient high-throughput screening." Journal of Chemical Information and Modeling 47 (2007): 1206-1214.
"Inferential, robust non-negative matrix factorization analysis of microarray data." Bioinformatics 23 (2007): 44-49.
"Secure computation with horizontally partitioned data using adaptive regression splines." Journal of Computational Statistics and Data Analysis 51 (2007): 5813-5820.
"Secure logistic regression with distributed databases." In Bulletin of International Statistics Institute., 2007.
"Statistics in metrology: International key comparisons and interlaboratory studies." Journal of Data Science 5 (2007): 393-412.
"Techniques for classifying executions of deployed software to support software engineering tasks." IEEE TRANSACTIONS ON SOFTWARE ENGINEERING 33 (2007): 287-304.
"Clustering Scotch Whiskies using Non-Negative Matrix Factorization." Q&SPES News 14 (2006): 11-13.
"Combinations of SDC methods for microdata protection." In Privacy in Statistical Databases: CENEX–SDC Project International Conference, PSD 2006 Rome, Italy, December 13–15, 2006 Proceedings, edited by J. Domingo–Ferrer and L. Franconi., 2006.
"Data quality: A statistical perspective." Statistical Methodology 3 (2006): 137-173.
"A framework for evaluating the utility of data altered to protect confidentiality." The American Statistician 60 (2006): 224-232.
"PharmID: Pharmacophore identification using Gibbs sampling." Journal of Chemical Information and Modeling 46 (2006): 1352-1359.
"Secure, privacy-preserving analysis of distributed databases." Technometrics 48 (2006): 133-143.
"Statistical analysis for multiple artifact problem in key comparisons with linear trends." Metrologia 43 (2006): 21-26.
"Statistical design of pools using optimal coverage and minimal collision." Technom 48 (2006): 133-143.
"Survey Costs: Workshop Report and White Paper. National Institute of Statistical Sciences, 2006.
Applying classification techniques to remotely-collected program execution data In Proc. ACM SIGSOFT Symposium Foundations of Software Engineering 2005. New York: ACM, 2005.
Data dissemination and disclosure limitation in a world without microdata: A risk-utility framework for remote access analysis servers." Statistical Science 20 (2005): 163-177.
"Data quality and data confidentiality for microdata: implications and strategies." In Bull. International Statistical Inst., 55th Session., 2005.
"Data Swapping as a Decision Problem." Journal of Official Statistics 21 (2005): 635-655.
"Default Priors for Gaussian Processes." Annals of Statistics 33 (2005): 556-582.
"Discussion of ‘The impact of technology on the scientific method' by S. Keller–McNulty, A. G.Wilson and G. Wilson." Chance 18 (2005): 1.
"Distributed performance testing using statistical modeling." In ICSE 2005 Workshop on Advances in Model-Based Software Testing (A-MOST)., 2005.
"National Institute of Statistical Sciences/Education Statistics Services Institute Task Force on Graduation, Completion and Dropout Indicators: Final Report. US Department of Education, Institute of Education Sciences, NCES, 2005.
PowerMV: A Software Environment for Molecular Viewing, Descriptor Generation, Data Analysis and Hit Evaluation." Journal of Chemical Information and Modeling 45 (2005): 515-522.
"Recursive partitioning as a tool for pharmcogenetic studies of complex diseases: II. Statistical considerations." Pharmacogenomics 6 (2005): 77-89.
"Sample size calculation for multiple testing in microarray data analysis." Biostatistics 6 (2005): 157-169.
"Secure analysis of distributed chemical databases without data integration." J. Computer-Aided Molecular Design 19 (2005): 739-747.
"Secure Regression on Distributed Databases." J. Computational and Graphical Statist 14 (2005): 263-279.
"Secure statistical analysis of distributed databases using partially trusted third parties. Manuscript in preparation." In In Statistical Methods in Counterterrorism: Game Theory, Modeling, Syndromic Surveillance, and Biometric Authentication, edited by D. Olwell, A. G.Wilson and G. Wilson. New York: Springer–Verlag, 2005.
"A statistical meteorologist looks at computational system models." In Proceedings of 2004 Workshop on Verification & Validation of Computer Models of High-consequence Engineering Systems., 2005.
"Title IX Data Collection: Technical Manual for Developing the User’s Guide. National Institute of Statistical Sciences, 2005.
Analysis of integrated data without data integration." Chance 17 (2004): 26-29.
"Calibration and Validation of Macroscopic, Deterministic Traffic Models. Vol. Masters. Raleigh: North Carolina State University, 2004.
Data confidentiality, data quality and data integration for federal databases." In Proc. dg.o 2004, National Conference on Digital Government Research, 91-92., 2004.
"Design of diversity and focused combinatorial libraries in drug discovery." Current Opinion in Drug Discovery & Development 7 (2004): 318-324.
"Disclosure Risk vs Data Utility: The R-U Confidentiality Map." Chance 17 (2004): 16-20.
"How Large Is the World Wide Web?" In Web Dynamics, 23-43. Springer Berlin Heidelberg, 2004.
"A Model for Relating Browsing Behavior to Site Design on the World Wide Web In Proceedings of JSM 2004. Alexandria, VA: American Statistical Association, 2004.
Privacy preserving regression modelling via distributed computation." In Proc. Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 677-682., 2004.
"Regression on distributed databases via secure multi-party computation." In Proc. dg.o 2004, National Conference on Digital Government Research, 405-406., 2004.
"Secure regression for vertically partitioned, partially overlapping data." In ASA Proceedings 2004., 2004.
"Traffic Simulation Failure Detection and Analysis. Vol. Ph.D. Raleigh: North Carolina State University, 2004.
Bayesian Stochastic Computation with application to Model Selection and Inverse Problems. Durham: Duke University, 2003.