[1]
“Combining Reinforcement Learning and Human Feedback for AI System Optimization”, IJSR, vol. 2, no. 1, pp. 31–39, Mar. 2024, Accessed: Jul. 23, 2025. [Online]. Available: https://ijsupport.com/index.php/ijsrs/article/view/12