Skip to main content

Table 3 Comparison results of different PEs and position encoding methods during pretraining of the BERT model on SMILES data

From: Positional embeddings and zero-shot learning using BERT for molecular-property prediction

PEs and position encoding

Training time (h)

Optimal learning rate

Accuracy

Absolute

167

8.14e−5

0.9568

Relative_key

180

8.68e−6

0.9746

Relative_key_query

120

4.59e−6

0.9763

Sinusoidal [52]

105

1.67e−7

0.9755

  1. Bold value denotes the best-achieved performance for clarity and emphasis