Embedded Cluster Expansion ========================== The embedded cluster expansion (eCE) method is a modification to the classical cluster expansion method. It uses machine-learning tools and dimensionality-reduction techniques to enable an efficient modeling of multi-component systems. Motivation ---------- Let's take a ternary system on a square lattice with species A, B, and C. An example of a configuration in this system is shown below. .. figure:: figures/configuration_example_ABC.png :height: 400 :align: center Illustrative arrangement of a ternary alloy on a square lattice with periodic boundaries. In order to construct a configurational cluster expansion of this system, let's use the occupational site basis functions. .. figure:: figures/sitebasis_example_ABC.png :width: 800 :align: center Occupational site basis functions for a ternary system composed of species A, B, and C. This basis corresponds to counting B atoms for the site basis function :math:`\varphi_1` and C atoms for the site basis function :math:`\varphi_2`. As a reminder of the documentation :doc:`cluster_expansion`, the site basis function :math:`\varphi_0` yields 1 for all species. Considering only nearest neighbor bonds, the cluster expansion reads as: .. math:: E(\sigma) &= J_0 + J_1 \langle \varphi_1 \rangle (\sigma) + J_2 \langle \varphi_2 \rangle (\sigma) + J_{11} \langle \varphi_1 \varphi_1 \rangle_{NN} (\sigma) + J_{12} \langle \varphi_1 \varphi_2 \rangle_{NN} (\sigma) + J_{22} \langle \varphi_2 \varphi_2 \rangle_{NN} (\sigma) \\ &= J_0 + J_1 x_B (\sigma) + J_2 x_C (\sigma) + J_{11} x_{BB} (\sigma) + J_{12} x_{BC} (\sigma) + J_{22} x_{CC} (\sigma) Here, :math:`x_{B}`, :math:`x_{C}`, :math:`x_{BB}`, :math:`x_{BC}`, and :math:`x_{CC}` represent the content of B atoms, C atoms, B-B bonds, B-C bonds, and C-C bonds, respectively. Now, let's take the situation where the species B and C are identical, e.g., they correspond isotopes atoms. There are at least two distinct ways to look at this situation: #. **Learn the similarity between the species through the ECI:** Swapping a B atom with a C atom does not change the energy. The interaction strength (i.e., the ECI) associated with a B atom (:math:`J_1`) or a C atom (:math:`J_2`) must be identical. Similarly, the interaction strength (i.e., the ECI) associated with a B-B bond (:math:`J_{11}`), a B-C atom (:math:`J_{12}`), or a C-C bond (:math:`J_{22}`) must be the same as well. The cluster expansion becomes: .. math:: E(\sigma) = J_0 + J_1 [x_B (\sigma) + x_C (\sigma)] + J_{11} [x_{BB} (\sigma) + x_{BC} (\sigma) + x_{CC} (\sigma)] #. **Learn the similarity between the species through the site basis functions:** Another way to tackle this situation is to learn the similarity prior to the parameterization of the ECI. The idea is to find a set of site basis functions that translates the similarity relation that the atoms B and C behaves the same. A set of site basis functions satisfying the similarity relation is shown below: .. figure:: figures/sitebasis_example_AB=C.png :width: 800 :align: center Modified occupational site basis functions for a ternary system composed of species A, B, and C, with the similarity relation that the species B and C are the same. This modified site basis function :math:`\tilde{\varphi}_1` now corresponds to counting the occupation of both B and C atoms. Notice that the number of site basis functions is reduced. Only 2 functions are necessary instead of the 3 in the case above. This reduced number of site basis functions automatically leads to a reduced number of fitting parameters (ECI). The modified cluster expansion reads as: .. math:: E(\sigma) = \tilde{J}_0 + \tilde{J}_1 \tilde{x}_{(B,C)} (\sigma) + \tilde{J}_{11} \tilde{x}_{(B,C)(B,C)} (\sigma) The second picture can be seen as projecting the 3 species in 3 dimensions onto 2 dimensions, or treating a ternary system as an pseudo-binary system. The fundamental idea behind the embedded cluster expansion is to treat a system with :math:`c` species as a system with only :math:`k` effective species, where :math:`k