Fig. 3

Molecular features distribution of the data set using A Extended circular fingerprints (ECFP), B MACCS keys fingerprints, C RDKit fingerprints, D Physicochemical descriptors, and E SMILES tokens. Red arrows indicate the unique non-overlap island of chemicals in each group