GPTopic: Dynamic and Interactive Topic Representations

التفاصيل البيبلوغرافية
العنوان: GPTopic: Dynamic and Interactive Topic Representations
المؤلفون: Reuter, Arik, Thielmann, Anton, Weisser, Christoph, Fischer, Sebastian, Säfken, Benjamin
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computation and Language
الوصف: Topic modeling seems to be almost synonymous with generating lists of top words to represent topics within large text corpora. However, deducing a topic from such list of individual terms can require substantial expertise and experience, making topic modelling less accessible to people unfamiliar with the particularities and pitfalls of top-word interpretation. A topic representation limited to top-words might further fall short of offering a comprehensive and easily accessible characterization of the various aspects, facets and nuances a topic might have. To address these challenges, we introduce GPTopic, a software package that leverages Large Language Models (LLMs) to create dynamic, interactive topic representations. GPTopic provides an intuitive chat interface for users to explore, analyze, and refine topics interactively, making topic modeling more accessible and comprehensive. The corresponding code is available here: https://github.com/ArikReuter/TopicGPT.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2403.03628
رقم الأكسشن: edsarx.2403.03628
قاعدة البيانات: arXiv