دورية أكاديمية

Improved N-Best Extraction with an Evaluation on Language Data

التفاصيل البيبلوغرافية
العنوان: Improved N-Best Extraction with an Evaluation on Language Data
المؤلفون: Johanna Björklund, Frank Drewes, Anna Jonsson
المصدر: Computational Linguistics, Vol 48, Iss 1, Pp 119-153 (2022)
بيانات النشر: The MIT Press, 2022.
سنة النشر: 2022
المجموعة: LCC:Computational linguistics. Natural language processing
مصطلحات موضوعية: Computational linguistics. Natural language processing, P98-98.5
الوصف: AbstractWe show that a previously proposed algorithm for the N-best trees problem can be made more efficient by changing how it arranges and explores the search space. Given an integer N and a weighted tree automaton (wta) M over the tropical semiring, the algorithm computes N trees of minimal weight with respect to M. Compared with the original algorithm, the modifications increase the laziness of the evaluation strategy, which makes the new algorithm asymptotically more efficient than its predecessor. The algorithm is implemented in the software Betty, and compared to the state-of-the-art algorithm for extracting the N best runs, implemented in the software toolkit Tiburon. The data sets used in the experiments are wtas resulting from real-world natural language processing tasks, as well as artificially created wtas with varying degrees of nondeterminism. We find that Betty outperforms Tiburon on all tested data sets with respect to running time, while Tiburon seems to be the more memory-efficient choice.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 0891-2017
1530-9312
Relation: https://direct.mit.edu/coli/article/48/1/119/108848/Improved-N-Best-Extraction-with-an-Evaluation-on; https://doaj.org/toc/0891-2017; https://doaj.org/toc/1530-9312
DOI: 10.1162/coli_a_00427
URL الوصول: https://doaj.org/article/04be039535e94327bfda4290c825083e
رقم الأكسشن: edsdoj.04be039535e94327bfda4290c825083e
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:08912017
15309312
DOI:10.1162/coli_a_00427