Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling

التفاصيل البيبلوغرافية
العنوان: Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling
المؤلفون: Nan, Zheng, Dang, Ting, Sethu, Vidhyasaharan, Ahmed, Beena
سنة النشر: 2023
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Machine Learning
الوصف: Connectionist temporal classification (CTC) is commonly adopted for sequence modeling tasks like speech recognition, where it is necessary to preserve order between the input and target sequences. However, CTC is only applied to deterministic sequence models, where the latent space is discontinuous and sparse, which in turn makes them less capable of handling data variability when compared to variational models. In this paper, we integrate CTC with a variational model and derive loss functions that can be used to train more generalizable sequence models that preserve order. Specifically, we derive two versions of the novel variational CTC based on two reasonable assumptions, the first being that the variational latent variables at each time step are conditionally independent; and the second being that these latent variables are Markovian. We show that both loss functions allow direct optimization of the variational lower bound for the model log-likelihood, and present computationally tractable forms for implementing them.
Comment: 5 pages, 3 figures, conference
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2309.11983
رقم الأكسشن: edsarx.2309.11983
قاعدة البيانات: arXiv