Si-2021-08-04-all-icsd

../../_images/pareto157.png

The current structure dataset comprises 13264 structures generated from unique ICSD prototype structures composed of single elements with zero oxidation state. A more detailed procedure is found in Phys. Rev. B 99, 214108 (2019). The procedure to estimate interatomic potentials from the dataset is found in Phys. Rev. B 99, 214108 (2019) and Phys. Rev. B 102, 174104 (2020).

Improvement from **-dataset-10000-all-icsd

  • More robust for structures with a small interatomic distance

  • More robust for structures with a large interatomic distance

  • More complex potential models are included.

  • MLPs are estimated without using DFT stress tensors.

  • MLPs are estimated by using small regression weights for energetically unstable structures.

Predictions using Pareto optimal MLPs

../../_images/prediction-ecoh-volume157.png

The cohesive energy and volume are obtained by performing a local structure optimization from the DFT equilibrium structure. In addition, the DFT equilibrium structure is obtained by optimizing a prototype structure included in ICSD, and the prototype is used as the structure legend in the figure. Therefore, the structure type of the converged structure is sometimes different from that shown in the legend even if the potential energy surface predicted by MLP is almost the same as the true one.

The other properties predicted by each Pareto optimal MLP are available from column Predictions in the following table.

Si-2021-08-04-all-icsd shows large prediction errors. They should be carefully used. Such an MLP is often accurate for reasonable structures, but it is not accurate for unrealistic structures.

Pareto optimals

Name

Time [ms] (1core/36cores)

RMSE [meV/atom]/[eV/A]

Predictions

Files

pair-27

0.018 / 0.005

44.970 / 0.2028

pair-28

0.027 / 0.007

43.429 / 0.2251

pair-31

0.050 / 0.007

35.972 / 0.1910

pair-32

0.051 / 0.010

34.690 / 0.1864

pair-33

0.069 / 0.012

31.769 / 0.1880

pair-37

0.136 / 0.018

27.344 / 0.1714

pair-38

0.171 / 0.022

26.797 / 0.1684

gtinv-300

0.228 / 0.023

14.498 / 0.1059

gtinv-235

0.281 / 0.023

11.869 / 0.0893

gtinv-312

0.472 / 0.036

8.9804 / 0.0889

predictions

mlp.lammps input log

gtinv-190

0.523 / 0.038

8.5179 / 0.0802

predictions

mlp.lammps input log

gtinv-236

0.804 / 0.059

7.3617 / 0.0786

predictions

mlp.lammps input log

gtinv-172

1.152 / 0.075

6.6844 / 0.0764

predictions

mlp.lammps input log

gtinv-237

1.177 / 0.073

6.5564 / 0.0748

predictions

mlp.lammps input log

gtinv-313

1.590 / 0.102

5.0332 / 0.0755

predictions

mlp.lammps input log

gtinv-191

1.600 / 0.102

4.5847 / 0.0699

predictions

mlp.lammps input log

gtinv-256

1.780 / 0.109

4.2431 / 0.0675

predictions

mlp.lammps input log

gtinv-192

2.106 / 0.124

4.1695 / 0.0673

predictions

mlp.lammps input log

gtinv-257

2.164 / 0.129

3.9191 / 0.0651

predictions

mlp.lammps input log

gtinv-193

3.465 / 0.217

3.6334 / 0.0653

predictions

mlp.lammps input log

gtinv-258

3.556 / 0.220

3.3380 / 0.0631

predictions

mlp.lammps input log

gtinv-259

4.078 / 0.244

3.1236 / 0.0614

predictions

mlp.lammps input log

gtinv-287

6.946 / 0.371

2.9180 / 0.0596

predictions

mlp.lammps input log

gtinv-223

13.159 / 0.653

2.8965 / 0.0600

predictions

mlp.lammps input log

gtinv-288

13.824 / 0.659

2.6584 / 0.0581

predictions

mlp.lammps input log

gtinv-289

14.825 / 0.712

2.5353 / 0.0568

predictions

mlp.lammps input log

Column “Time” shows the time required to compute the energy and forces for 1 MD step and 1 atom, which is estimated from a simulation of 10 runs for a structure with 284 atoms using a workstation with Intel(R) Xeon(R) CPU E5-2695 v4 @ 2.10GHz. Note that the MLPs should be carefully used for extreme structures. The MLPs often return meaningless values for them.

  • All Pareto optimal MLPs are available here.