Towards an Unequivocal Representation of Actions

التفاصيل البيبلوغرافية
العنوان: Towards an Unequivocal Representation of Actions
المؤلفون: Wray, Michael, Moltisanti, Davide, Damen, Dima
سنة النشر: 2018
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Computer Vision and Pattern Recognition
الوصف: This work introduces verb-only representations for actions and interactions; the problem of describing similar motions (e.g. 'open door', 'open cupboard'), and distinguish differing ones (e.g. 'open door' vs 'open bottle') using verb-only labels. Current approaches for action recognition neglect legitimate semantic ambiguities and class overlaps between verbs (Fig. 1), relying on the objects to disambiguate interactions. We deviate from single-verb labels and introduce a mapping between observations and multiple verb labels - in order to create an Unequivocal Representation of Actions. The new representation benefits from increased vocabulary and a soft assignment to an enriched space of verb labels. We learn these representations as multi-output regression, using a two-stream fusion CNN. The proposed approach outperforms conventional single-verb labels (also known as majority voting) on three egocentric datasets for both recognition and retrieval.
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/1805.04026
رقم الأكسشن: edsarx.1805.04026
قاعدة البيانات: arXiv