Publications
Preserving confidentiality of high-dimensional tabular data: Statistical and computational issues." STATISTICS AND COMPUTING 8 (2003): 363-370.
"Bounding Entries in Multi-way Contingency Tables Given a Set of Marginal Totals." In Foundations of Statistical Inference, edited by Yoel Haitovsky, Yaacov Ritov and HansRudolf Lerche, 3-16. Contributions to Statistics. Physica-Verlag HD, 2003.
"Disclosure Risk vs Data Utility: The R-U Confidentiality Map." Chance 17 (2004): 16-20.
"Does code decay? Assessing the evidence from change management data." In In IEEE Transactions on Software Engineering, 1-12., 2001.
"A Web laboratory for software data analysis." World Wide Web 1 (1998): 55-60.
"Visualizing Software Changes." INTERACTIONS 17 (2002): 29-31.
"Combining Estimates from Multiple Surveys." Wiley StatsRef: Statistics Reference Online.
"Small Area Estimates for End-of-Season Agricultural Quantities In JSM Proceedings. Survey Research Methods Section. Alexandria, VA: American Statistical Association., 2017.
Calibration using Constrained Smoothing with Application to Mass Spectrometry Data." Biometrics 70 (2014): 398-408.
"Homeland Insecurity." In Terrorism Informatics, edited by Hsinchun Chen, Edna Reid, Joshua Sinai, Andrew Silke and Boaz Ganor, 197-218. Vol. 18. Integrated Series In Information Systems 18. Springer US, 2008.
"Inferential, robust non-negative matrix factorization analysis of microarray data." Bioinformatics 23 (2007): 44-49.
"Evaluation of unmixing methods for the separation of Quantum Dot sources." In Hyperspectral Image and Signal Processing: Evolution in Remote Sensing, 2009. WHISPERS ’09. First Workshop on, 1-4., 2009.
"Predicting ozone levels and trends with semiparametric modeling." Journal of Agricultural, Biological, and Environmental Statistics 1 (1996): 404-425.
"Data Swapping as a Decision Problem." Journal of Official Statistics 21 (2005): 635-655.
"Parameter space exploration of an ocean general circulation model using an isopycnal mixing parameterization." Journal of Marine Research 52 (1994): 773-796.
"Conditional Genotypic Probabilities for Microsatellite Loci." Genetics 155 (2000): 1973-1980.
"An Empirical Study of Regression Test Selection Techniques." In Proceedings of the 20th International Conference on Software Engineering, 188-197. ICSE ’98. Washington, DC, USA: IEEE Computer Society, 1998.
"Inferring change effort from configuration management databases." In Software Metrics Symposium, 1998. Metrics 1998. Proceedings. Fifth International, 267-273., 1998.
"A Model for Relating Browsing Behavior to Site Design on the World Wide Web In Proceedings of JSM 2004. Alexandria, VA: American Statistical Association, 2004.
Techniques for classifying executions of deployed software to support software engineering tasks." IEEE TRANSACTIONS ON SOFTWARE ENGINEERING 33 (2007): 287-304.
"Exploring blood spectra for signs of ovarian cancer." Chance 16 (2003): 19-23.
"A spatio-temporal absorbing state model for disease and syndromic surveillance." Statistics in Medicine 31 (2012): 2123-2136.
"Meteorologically-dependent trends in urban ozone." Environmetrics 10 (1999): 103-118.
"Synthesizing route travel time distributions from segment travel time distributions." Trans. Res. Rec. (2013): 71-81.
"PharmID: Pharmacophore identification using Gibbs sampling." Journal of Chemical Information and Modeling 46 (2006): 1352-1359.
"Influence of microstructure and fracture on the transport properties in cement-based materials." In Brittle Matrix Composites - International Symposium, 199-220. Vol. 5., 1997.
"ChemModLab: A web-based cheminromates modeling laboratory." Cheminformatics 11 (2012): 61-81.
"A Hybrid High-Order Markov Chain Model for Computer Intrusion Detection." 10 (2001): 277-295.
"Analysis of high-dimensional structure-activity screening datasets using the optimal bit string Tree." Technomet 55 (2013): 161-173.
"Privacy-preserving analysis of vertically partitioned data using secure matrix products." Journal of Official Statistics 25 (2009): 125-138.
"Computer intrusion: detecting masqueraders." Statistical Science 16 (2001): 1-17.
"Bayesian CAR models for syndromic surveillance on multiple data streams: Theory and practice." Information Fusion 13 (2012): 105-116.
"Data confidentiality, data quality and data integration for federal databases." In Proc. dg.o 2004, National Conference on Digital Government Research, 91-92., 2004.
"Secure computation with horizontally partitioned data using adaptive regression splines." Journal of Computational Statistics and Data Analysis 51 (2007): 5813-5820.
"A Bayesian spatio-temporal approach for real-time detection of disease outbreaks: A case study." BMC Medical Informatics and Decision Making 14 (2014).
"Data dissemination and disclosure limitation in a world without microdata: A risk-utility framework for remote access analysis servers." Statistical Science 20 (2005): 163-177.
"Confidentiality and Data Access in the Use of Big Data: Theory and Practical Approaches.", edited by J. Lane, V. Stodden, H. Nissenbaum and S. Bender. Cambridge University Press, 2014.
"Workshop Report: Workshop on Statistics and Information Technology. National Institute of Statiatical Sciences, 2001.
Secure statistical analysis of distributed databases using partially trusted third parties. Manuscript in preparation." In In Statistical Methods in Counterterrorism: Game Theory, Modeling, Syndromic Surveillance, and Biometric Authentication, edited by D. Olwell, A. G.Wilson and G. Wilson. New York: Springer–Verlag, 2005.
"Regression on distributed databases via secure multi-party computation." In Proc. dg.o 2004, National Conference on Digital Government Research, 405-406., 2004.
"Preserving data utility via BART." Journal of Statistical Planning Inf. 140 (2010): 2551-2561.
"National Institute of Statistical Sciences (US)." In Encyclopedia of Environmetrics. Wiley, Chichester, 2002.
"Secure logistic regression with distributed databases." In Bulletin of International Statistics Institute., 2007.
"Masking methods that preserve positivity constraints in microdata." J. Statist. Planning Inf. 141 (2010): 31-41.
"Good Statistical Practice.", edited by C. E. Minder and F. Friedl, 175?179. Austrian Statistical Society, 1998.
"Visual Scalability." Journal Comp. Graphical Statistics 11 (2002): 22-43.
"Predicting fault incidence using software change history." IEEE Transportation Software Engineering 26 (2000): 653?661.
"Table servers protect confidentiality in tabular data releases." Comm. ACM 46 (2003): 57-58.
"Global measures of data utility for microdata masked for disclosure limitation." Journal of Privacy and Confidentiality 1 (2009): 111-124.
"Analytical frameworks for data release: A statistical view." In Confidentiality and Data Access in the Use of Big Data: Theory and Practical Approaches. . New York City, NY: Cambridge University Press, 2014.
"