On Friday, May 3rd at 10 am Dr. Jan Hannig will be giving a talk at the Duke University School of Medicine. The title of his talk is "Data Integration Via Analysis of Subspaces (DIVAS)". Dr. Hannig is a board member of the National Institute of Statistical Sciences.
Abstract
A major challenge in the age of Big Data is the integration of disparate data types into a data analysis. That is tackled here in the context of data blocks measured on a common set of experimental subjects. This data structure motivates the simultaneous exploration of the joint and individual variation within each data block. This is done here in a way that scales well to large data sets (with blocks of wildly disparate size), using principal angle analysis, careful formulation of the underlying linear algebra, and differing outputs depending on the analytical goals. Ideas are illustrated using cancer and neuroimaging data sets.