AABLSTM: A Novel Multi-task Based CNN-RNN Deep Model for Fashion Analysis

التفاصيل البيبلوغرافية
العنوان: AABLSTM: A Novel Multi-task Based CNN-RNN Deep Model for Fashion Analysis
المؤلفون: Xianlin Zhang, Mengling Shen, Xueming Li, Xiaojie Wang
المصدر: ACM Transactions on Multimedia Computing, Communications, and Applications. 19:1-18
بيانات النشر: Association for Computing Machinery (ACM), 2023.
سنة النشر: 2023
مصطلحات موضوعية: Computer Networks and Communications, Hardware and Architecture
الوصف: With the rapid growth of online commerce and fashion-related applications, visual clothing analysis and recognition has become a hotspot in computer vision. In this paper, we propose a novel AABLSTM network, which is based on deep CNN-RNN, to solve the visual fashion analysis of clothing category classification, attribute detection, and landmark localization. The designed fashion model is leveraged with the multi-task driven mechanism as follows: firstly, a bidirectional LSTM (Bi-LSTM) branch is proposed for efficiently mining the semantic association between related attributes so as to improve the precision of clothing category classification and attribute detection; then, an imitated hourglass sub-network of “down-up sampling” is constructed for boosting the accuracy of fashion landmark localization; and finally, a specially designed multi-loss function is constructed to better optimize the network training. Extensive experimental results on large-scale fashion datasets demonstrate the superior performance of our approach.
تدمد: 1551-6865
1551-6857
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::5a1306156ff24c77c865addb02bc5395
https://doi.org/10.1145/3519029
رقم الأكسشن: edsair.doi...........5a1306156ff24c77c865addb02bc5395
قاعدة البيانات: OpenAIRE