Replies: 1 comment 1 reply
-
@mm21d013 Hi, could you provide more detailed information about the training sets and test errors in the two cases you discussed? For example, in case 2, where you "only take P6₃/mmc, and C12/m1 structures", what structures are in the training set? How was the formation energy calculated? Is it from relaxation results or just static energy calculation? As training error is going down, I suppose static energy calculation is becoming more accurate, while relaxation might still be challenging if training structures do not cover sufficient space. From my own experience, if we have sufficient sampling of structures related to one local minimum structure, e.g., the relaxed structure of Si with P6₃/mmc space group, the MLIP should be able to get reasonable matching with DFT for that local minimum. But if the sampling is not good enough, it is normal for MLIP to have unreliable extrapolation results. Sufficient sampling usually requires inclusion of equation of state structures, strained structures, relaxation trajectory snapshots, molecular dynamics trajectory snapshots, etc. You can refer to our previous works on MLIP and see how training set is constructed to cover sufficiently large space of structures. (1, 2, 3) And I don't think that stable structures are compulsory to be included, though it is a common practice, as augmentation of training set with known stable phases improve the overall reliability of MLIP to the whole system of interest. I'm happy to learn more and see if this understanding is correct. |
Beta Was this translation helpful? Give feedback.
-
Please refer to the attached image

(i) Lets say if I have three structures of Silicon (the image is attached). If I take Fd3̅m1,P6₃/mmc, and C12/m1 space group structures, do some quantum level analysis on it (like some DFT calculations) and use the same as training data for potential development, I get good results.
(ii) If I only take P6₃/mmc, and C12/m1 structures , do DFT calculations and fit in as training data for potential development, the results aren't good.By "results aren't good", I mean...the formation energy predicted via the potential isn't matching with the formation energy predicted by my DFT calculations for those structures. Even though in all this process, the RMSE errors for energy is decreasing with increasing number of epochs...for both the training energy and validation energy
The reason I feel why this happens is because in the (i) case, I have taken the structures which consists of most stable ie...Fd3̅m space group structures alongwith the metastable or the unstable structure(since they have certain energies above the hull plot). However in the (ii) case, I have only taken the metastable or the unstable structure. The stable structure hasn't been part of the training data.
I just wanted to confirm if my inference correct or not ?
Note:I always predict on structures on which I am training in.
Beta Was this translation helpful? Give feedback.
All reactions