Mathematical Language Models: A Survey

التفاصيل البيبلوغرافية
العنوان: Mathematical Language Models: A Survey
المؤلفون: Liu, Wentao, Hu, Hanglei, Zhou, Jie, Ding, Yuyang, Li, Junsong, Zeng, Jiayi, He, Mengliang, Chen, Qin, Jiang, Bo, Zhou, Aimin, He, Liang
سنة النشر: 2023
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computation and Language
الوصف: In recent years, there has been remarkable progress in leveraging Language Models (LMs), encompassing Pre-trained Language Models (PLMs) and Large-scale Language Models (LLMs), within the domain of mathematics. This paper conducts a comprehensive survey of mathematical LMs, systematically categorizing pivotal research endeavors from two distinct perspectives: tasks and methodologies. The landscape reveals a large number of proposed mathematical LLMs, which are further delineated into instruction learning, tool-based methods, fundamental CoT techniques, and advanced CoT methodologies. In addition, our survey entails the compilation of over 60 mathematical datasets, including training datasets, benchmark datasets, and augmented datasets. Addressing the primary challenges and delineating future trajectories within the field of mathematical LMs, this survey is positioned as a valuable resource, poised to facilitate and inspire future innovation among researchers invested in advancing this domain.
Comment: arXiv admin note: text overlap with arXiv:1705.04146, arXiv:2304.10977, arXiv:2112.00114, arXiv:1905.13319, arXiv:2304.12244, arXiv:2206.01347, arXiv:2006.09265 by other authors
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2312.07622
رقم الأكسشن: edsarx.2312.07622
قاعدة البيانات: arXiv