write_after_distillation_data_split

write_after_distillation_data_split#

autoplex.fitting.common.utils.write_after_distillation_data_split(distillation, force_max, split_ratio, vasp_ref_name='vasp_ref.extxyz', train_name='train.extxyz', test_name='test.extxyz', force_label='REF_forces', energy_label='REF_energy')[source]#

Write train.extxyz and test.extxyz after data distillation and split.

Reject structures with large force components and split dataset into training and test datasets.

Parameters:

distillation (bool) – For using data distillation.
force_max (float) – Maximally allowed force in the data set.
split_ratio (float) – Parameter to divide the training set and the test set. A value of 0.1 means that the ratio of the training set to the test set is 9:1
vasp_ref_name (str) – name of the VASP reference data file.
train_name (str) – name of the training data file.
test_name (str) – name of the test data file.
force_label (str) – label of the force entries.
energy_label (str) – label of the energy entries.

Return type:

None

write_after_distillation_data_split

Contents

write_after_distillation_data_split#