Exploring uplift modeling with high class imbalance

التفاصيل البيبلوغرافية
العنوان: Exploring uplift modeling with high class imbalance
المؤلفون: Otto Nyberg, Arto Klami
المساهمون: Department of Computer Science, Helsinki Institute for Information Technology
سنة النشر: 2023
مصطلحات موضوعية: Computer Networks and Communications, Undersampling, Heterogeneous treatment effect, 113 Computer and information sciences, Uplift modeling, Computer Science Applications, Information Systems, High class imbalance
الوصف: Uplift modeling refers to individual level causal inference. Existing research on the topic ignores one prevalent and important aspect: high class imbalance. For instance in online environments uplift modeling is used to optimally target ads and discounts, but very few users ever end up clicking an ad or buying. One common approach to deal with imbalance in classification is by undersampling the dataset.In this work, we show how undersampling can be extended to uplift modeling.We propose three new undersampling methods for uplift modeling and one novel calibration method.We compare the proposed methods empirically and show when some methods have a tendency to break down.One key observation is that accounting for the imbalance is particularly important for uplift random forests, which explains the poor performance of the model in earlier works. Undersampling is also crucial for class-variable transformation based models.
اللغة: English
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d77f9d1b1e110c114055461bd6d1b705
http://hdl.handle.net/10138/355378
حقوق: OPEN
رقم الأكسشن: edsair.doi.dedup.....d77f9d1b1e110c114055461bd6d1b705
قاعدة البيانات: OpenAIRE