[1]
“Combining Reinforcement Learning and Human Feedback for AI System Optimization”, IJSR, vol. 2, no. 1, pp. 31–39, Mar. 2024, Accessed: Feb. 02, 2026. [Online]. Available: https://ijsupport.com/index.php/ijsrs/article/view/12