SPARQL Generation with Entity Pre-trained GPT for KG Question Answering

التفاصيل البيبلوغرافية
العنوان: SPARQL Generation with Entity Pre-trained GPT for KG Question Answering
المؤلفون: Bustamante, Diego, Takeda, Hideaki
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Databases, Computer Science - Information Retrieval, 68P20, 68T50, H.2.3, H.3.3, I.2.7
الوصف: Knowledge Graphs popularity has been rapidly growing in last years. All that knowledge is available for people to query it through the many online databases on the internet. Though, it would be a great achievement if non-programmer users could access whatever information they want to know. There has been a lot of effort oriented to solve this task using natural language processing tools and creativity encouragement by way of many challenges. Our approach focuses on assuming a correct entity linking on the natural language questions and training a GPT model to create SPARQL queries from them. We managed to isolate which property of the task can be the most difficult to solve at few or zero-shot and we proposed pre-training on all entities (under CWA) to improve the performance. We obtained a 62.703% accuracy of exact SPARQL matches on testing at 3-shots, a F1 of 0.809 on the entity linking challenge and a F1 of 0.009 on the question answering challenge.
Comment: 7 pages, 1 figure, 2 tables. For the implementation, see https://github.com/DiegoEmilio01/SPARQL-generation-with-entity-pre-trained-GPT-for-KG-Question-Answering
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2402.00969
رقم الأكسشن: edsarx.2402.00969
قاعدة البيانات: arXiv