Analysis of high-dimensional structure-activity screening datasets using the optimal bit string Tree