From: A systematic review of deep learning chemical language models in recent era
 | Unbiased model | Target model | Samples | P-value |
---|---|---|---|---|
Training dataset size | 1,128,920 | 2507 | 17 |  < 0.0001 |
Validity | 98.05 | 95.5 | 10 | 0.1602 |
Uniqueness | 97.9 | 90.2 | 11 | 0.0144 |
Novelty | 91.6 | 96.0 | 8 | 0.8438 |