Skip to content

lorrespz/nqs_hyperbolic_rnn

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

96 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Hyperbolic recurrent neural network as the first type of non-Euclidean neural quantum state ansatz

This is the GitHub repo for the work https://arxiv.org/abs/2505.22083, featured in Quantum Zeitgeist (May 29, 2025):

https://quantumzeitgeist.com/hyperbolic-neural-networks-enhance-quantum-many-body-problem-solutions/

  • In this work, we introduce hyperbolic GRU as the first type of non-Euclidean neural quantum state (NQS) for quantum many-body systems in condensed matter physics. We investigate the viability and performance of hyperbolic GRU NQS ansatz in 4 prototypical settings for many-body quantum physics: one-dimensional & two-dimensional transverse field Ising models, one-dimensional Heisenberg $J_1J_2$ and $J_1J_2J_3$ models. In each of these 4 settings, hyperbolic GRU is benchmarked against different variants of Euclidean RNNs (either RNN or GRU), as well as against the exact results provided by DMRG (Density Matrix Renormalization Group) method. The results of running the Variational Monte Carlo (VMC) experiments using hyperbolic GRU as well as Euclidean RNNs NQS ansatzes for each of the 4 settings are shown graphically below (Each dot with error bar in the graphs represents the mean energy with standard error of an NQS ansatz - The closer the dot is to the red horizontal line representing the exact ground state energy, the better the NQS is at approximating the ground state energy).

  • The main Python scripts are located in the utility directory, with other directories, corresponding to a Hamiltonian system under study, containing the Jupyter training notebooks (directory training) as well as the neural network trained weights (folder results). All neural networks were built with TF2 (Tensorflow 2), based on the TF1 (Tensorflow 1) codes from:

1D TFIM

  • In the 1D TFIM setting, with the number of spins N being 20, 40, 80 and 100, where the Hamiltonian comprises only the nearest neighbor interaction and a transverse magnetic field, with no hierarchical interaction structure, three out of four times, hyperbolic GRU does not outperform either Euclidean RNN or GRU.

2D TFIM

  • In the 2D TFIM setting, with the number of size of the 2D square lattice $(N_x, N_y)$ being (5, 5), (7, 7), (8, 8), (9, 9) corresponding to N = 25, 49, 64, 81 spins, where the Hamitonian comprises the horizontal and vertical nearest neighbor interaction in two dimensions and a transverse magnetic field, 2D Euclidean RNN always emerge as the best ansatz, followed by 1D hyperbolic GRU, which almost always outperforms 1D Euclidean GRU, except for a single instance in the training result for $(N_x,N_y)$ = (8,8). In this case, due to the artificial rearrangement or reshaping/folding of the 1D spin chain to mimic the 2D lattice, a hierarchy in the interaction structure, comprising the first and the Nth nearest neighbor interactions, is introduced when originally faraway spins (at site i and site $i + N$) in the 1D chains become immediate vertical neighbors. The fact that the hyperbolic GRU NQS ansatz outperformed the Euclidean GRU NQS ansatz for this setting is our first clue of the role played by the hyperbolic geometry of the neural network in determining its performance when there is a hierarchical structure in the Hamiltonian.

1D Heisenberg J1J2

  • In the 1D Heisenberg $J_1J_2$ setting with N = 50 spins, $J_1 = 1.0$ (fixed) and $J_2$ = 0.0, 0.2, 0.5, 0.8, hyperbolic GRU definitively outperforms Euclidean GRU three out of four times when the $J_1J_2$ Hamiltonian exhibits a hierarchical interaction structure when $J_2\neq 0$ which takes into account the second nearest neighbor interactions in addition to the first nearest neighbor interactions. This is our second clue of the role played by the hyperbolic geometry of the neural network.

1D Heisenberg J1J2J3

  • In the 1D Heisenberg $J_1J_2J_3$ setting with N = 30 spins, $J_1 = 1.0$ (fixed) and $(J_2, J_3) = (0.0, 0.5), (0.2, 0.2), (0.2,0.5), (0.5,0.2)$, hyperbolic GRU definitively outperforms Euclidean GRU four of four times. In this setting, the $J_1J_2J_3$ Hamiltonian exhibits even more pronounced hierarchical interaction structures comprising the first, second and third nearest neighbors. This is our third clue.

About

Github repo for the work https://arxiv.org/abs/2505.22083. Neural quantum states using Euclidean RNN/GRU and hyperbolic GRU variational ansatzes applied to 1D and 2D quantum many body systems (1D & 2D Transverse Ising models, 1D Heisenberg J1J2 and J1J2J3 models.)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors