R-DDQN: Optimizing Algorithmic Trading Strategies Using a Reward Network in a Double DQN

التفاصيل البيبلوغرافية
العنوان:	R-DDQN: Optimizing Algorithmic Trading Strategies Using a Reward Network in a Double DQN
المؤلفون:	Chujin Zhou, Yuling Huang, Kai Cui, Xiaoping Lu
المصدر:	Mathematics, Vol 12, Iss 11, p 1621 (2024)
بيانات النشر:	MDPI AG, 2024.
سنة النشر:	2024
المجموعة:	LCC:Mathematics
مصطلحات موضوعية:	reinforcement learning, algorithmic trading, reward network, deep learning, Mathematics, QA1-939
الوصف:	Algorithmic trading is playing an increasingly important role in the financial market, achieving more efficient trading strategies by replacing human decision-making. Among numerous trading algorithms, deep reinforcement learning is gradually replacing traditional high-frequency trading strategies and has become a mainstream research direction in the field of algorithmic trading. This paper introduces a novel approach that leverages reinforcement learning with human feedback (RLHF) within the double DQN algorithm. Traditional reward functions in algorithmic trading heavily rely on expert knowledge, posing challenges in their design and implementation. To tackle this, the reward-driven double DQN (R-DDQN) algorithm is proposed, integrating human feedback via a reward function network trained on expert demonstrations. Additionally, a classification-based training method is employed for optimizing the reward function network. The experiments, conducted on datasets including HSI, IXIC, SP500, GOOGL, MSFT, and INTC, show that the proposed method outperforms all baselines across six datasets and achieves a maximum cumulative return of 1502% within 24 months.
نوع الوثيقة:	article
وصف الملف:	electronic resource
اللغة:	English
تدمد:	2227-7390
Relation:	https://www.mdpi.com/2227-7390/12/11/1621; https://doaj.org/toc/2227-7390
DOI:	10.3390/math12111621
URL الوصول:	https://doaj.org/article/c4727a1fa70f49b7a9b432b4d2ac3e1f
رقم الأكسشن:	edsdoj.4727a1fa70f49b7a9b432b4d2ac3e1f
قاعدة البيانات:	Directory of Open Access Journals

Find this article in full text from ProQuest

Full Text Finder

الوصف
تدمد:	22277390
DOI:	10.3390/math12111621