X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Models

التفاصيل البيبلوغرافية
العنوان: X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Models
المؤلفون: Held, Jan, Itani, Hani, Cioppa, Anthony, Giancola, Silvio, Ghanem, Bernard, Van Droogenbroeck, Marc
المساهمون: Telim, Montefiore Institute - Montefiore Institute of Electrical Engineering and Computer Science - ULiège, BE
المصدر: International Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), CVsports, du 17 au 21 juin 2024
سنة النشر: 2024
مصطلحات موضوعية: Soccer, Football, Referee, Refereeing, Large language model, Video assistant referee, SoccerNet, VAR, Sports, Machine learning, Engineering, computing & technology, Electrical & electronics engineering, Ingénierie, informatique & technologie, Ingénierie électrique & électronique
الوصف: The rapid advancement of artificial intelligence has led to significant improvements in automated decision-making. However, the increased performance of models often comes at the cost of explainability and transparency of their decision-making processes. In this paper, we investigate the capabilities of large language models to explain decisions, using football refereeing as a testing ground, given its decision complexity and subjectivity. We introduce the EXplainable Video Assistant Referee System, X-VARS, a multi-modal large language model designed for understanding football videos from the point of view of a referee. X-VARS can perform a multitude of tasks, including video description, question answering, action recognition, and conducting meaningful conversations based on video content and in accordance with the Laws of the Game for football referees. We validate X-VARS on our novel dataset, SoccerNet-XFoul, which consists of more than 22k video- question-answer triplets annotated by over 70 experienced football referees. Our experiments and human study illustrate the impressive capabilities of X-VARS in interpreting complex football clips. Furthermore, we highlight the potential of X-VARS to reach human performance and support football referees in the future.
نوع الوثيقة: conference paper
http://purl.org/coar/resource_type/c_5794
conferenceObject
peer reviewed
اللغة: English
Relation: https://www.soccer-net.org/
URL الوصول: https://orbi.uliege.be/handle/2268/316230
حقوق: open access
http://purl.org/coar/access_right/c_abf2
info:eu-repo/semantics/openAccess
رقم الأكسشن: edsorb.316230
قاعدة البيانات: ORBi