Scalable Detection of Salient Entities in News Articles

التفاصيل البيبلوغرافية
العنوان: Scalable Detection of Salient Entities in News Articles
المؤلفون: Asgarieh, Eliyar, Thadani, Kapil, O'Hare, Neil
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computation and Language
الوصف: News articles typically mention numerous entities, a large fraction of which are tangential to the story. Detecting the salience of entities in articles is thus important to applications such as news search, analysis and summarization. In this work, we explore new approaches for efficient and effective salient entity detection by fine-tuning pretrained transformer models with classification heads that use entity tags or contextualized entity representations directly. Experiments show that these straightforward techniques dramatically outperform prior work across datasets with varying sizes and salience definitions. We also study knowledge distillation techniques to effectively reduce the computational cost of these models without affecting their accuracy. Finally, we conduct extensive analyses and ablation experiments to characterize the behavior of the proposed models.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2405.20461
رقم الأكسشن: edsarx.2405.20461
قاعدة البيانات: arXiv