Fig. 4

Compound diversity of top scoring 1000 compounds. The nearest neighbor Tanimoto similarity was calculated for each of the 1000 compounds and plotted as a fitted histogram. The nearest neighbor similarity was calculated for each of the 3 predictive models BaSH: green, HTSFP: orange, and ECFP4: blue