Variational Autoencoder Toolkit

Translationally Invariant VAE

In this approach, the latent space is composed of the intrinsic latent variables along with two additional components representing the translation vector (txt_x, tyt_y). As a result, the dimensionality of the latent space becomes N+2N+2, where NN corresponds to the number of intrinsic latent variables. This framework allows us to disentangle and analyze both the intrinsic factors of variation and the translational symmetry within the data.

Source:tVAE
(a) The latent manifold generated from the tVAE demonstrates the reconstructed structure across the latent space. (b) The latent space distribution with KDE highlights clusters.

Figure 1:(a) The latent manifold generated from the tVAE demonstrates the reconstructed structure across the latent space. (b) The latent space distribution with KDE highlights clusters.

We observe well-defined clusters, which strongly suggest the presence of underlying physical principles governing the system.

Source:tVAE
Latent maps generated from the tVAE.

Figure 2:Latent maps generated from the tVAE.

The latent maps reveal fascinating insights. The translation vectors closely resemble the polarization components, aligning with the underlying physics of ferroelectrics, where the polarization arises from the shift of the central atom relative to the corner atoms. Simultaneously, the intrinsic latent variables z1z_1, and z2z_2 exhibit chemical contrast and mistilt effects, with minor leakage of polarization information into their representation.