Density and Orbital Features in CiderPress

To predict the exchange-correlation energy \(E_{\text{xc}}\), we need to train a machine learning model \(e_{\text{xc}}(\mathbf{x})\) such that

\[E_{\text{xc}} = \int \text{d}^3\mathbf{r}\, e_\text{xc}(\mathbf{x}[n_1](\mathbf{r}))\]

where \(\mathbf{x}[n_1](\mathbf{r})\) is a position-dependent feature vector that is a functional of the density \(n(\mathbf{r})\) in “pure” Kohn-Sham DFT and a functional of the density matrix \(n_1(\mathbf{r}, \mathbf{r}')\) in the case of “generalized” Kohn-Sham DFT. Cider provides several types of feature that can be used as input to the ML model. These features can be divided into four categories:

  • Semilocal Features (SL): Same features as in conventional GGA/meta-GGA functionals (i.e. \(n\), \(\nabla n\), \(\tau\)). NOTE: All Cider models must include semilocal features. They need not be used explicitly in the model, but evaluating them is required to evalute baseline functionals and other quantities in the code.

  • Nonlocal Density Features (NLDF): These features are constructed by integrating the density over a real-space kernel function to characterize the shape of the density around a point \(\mathbf{r}\).

  • Nonlocal Orbital Features (NLOF): EXPERIMENTAL, NOT TESTED, NOT RECOMMENDED FOR USE. One coordinate of the density matrix is operated on by a fractional Laplacian.

  • Smooth Density Matrix Exchange (SDMX): One coordinate of the density matrix is convolved at different length scales. Then these convolutions are contracted to approximately characterize the shape of the density matrix around a point \(\mathbf{r}\).

The set of features to be used in a model is specified using the FeatureSettings object. To see the code API for setting up feature settings, see the Settings module section. To see mathematical descriptions and physical intuition for the different types of features, see the subsections below.