The autoregressive neural network architecture of the Boltzmann distribution of pairwise interacting spins systems

التفاصيل البيبلوغرافية
العنوان: The autoregressive neural network architecture of the Boltzmann distribution of pairwise interacting spins systems
المؤلفون: Biazzo, Indaco
سنة النشر: 2023
المجموعة: Computer Science
Condensed Matter
Statistics
مصطلحات موضوعية: Condensed Matter - Disordered Systems and Neural Networks, Condensed Matter - Statistical Mechanics, Computer Science - Machine Learning, Statistics - Machine Learning
الوصف: Generative Autoregressive Neural Networks (ARNNs) have recently demonstrated exceptional results in image and language generation tasks, contributing to the growing popularity of generative models in both scientific and commercial applications. This work presents an exact mapping of the Boltzmann distribution of binary pairwise interacting systems into autoregressive form. The resulting ARNN architecture has weights and biases of its first layer corresponding to the Hamiltonian's couplings and external fields, featuring widely used structures such as the residual connections and a recurrent architecture with clear physical meanings. Moreover, its architecture's explicit formulation enables the use of statistical physics techniques to derive new ARNNs for specific systems. As examples, new effective ARNN architectures are derived from two well-known mean-field systems, the Curie-Weiss and Sherrington-Kirkpatrick models, showing superior performance in approximating the Boltzmann distributions of the corresponding physics model compared to other commonly used architectures. The connection established between the physics of the system and the neural network architecture provides a means to derive new architectures for different interacting systems and interpret existing ones from a physical perspective.
Comment: 20 pages, 10 figure plus the Supplementary Information
نوع الوثيقة: Working Paper
DOI: 10.1038/s42005-023-01416-5
URL الوصول: http://arxiv.org/abs/2302.08347
رقم الأكسشن: edsarx.2302.08347
قاعدة البيانات: arXiv
الوصف
DOI:10.1038/s42005-023-01416-5