A Reinforcement Learning-Based Framework for Multi-Criteria Decision-Making

Authors

  • Laxminarayan Sahoo Department of Computer and Information Science, Raiganj University, Raiganj-733134, India. https://orcid.org/0000-0001-7464-451X Author
  • Sumanta Lal Ghosh Department of Computer and Information Science, Raiganj University, Raiganj-733134, India. https://orcid.org/0009-0000-3281-2132 Author

DOI:

https://doi.org/10.59543/ydm6xr12

Keywords:

Multi-Criteria decision-Making, Reinforcement Learning, Learning-to-Rank, Preference Aggregation

Abstract

Multi-Criteria Decision-Making (MCDM) methods have been applied to various ranking problems in finance, engineering, and management; however, many traditional methods rely on data normalization and fixed weights assigned to the criteria. Often, such data normalization has led to distortion of ordinal information and inconsistent results. In this study, we propose an RL-based framework for solving MCDM problems when the decision data are ranked. The ranking task was then reformulated as a learning-to-rank optimization problem, where the preference scores are assigned to alternatives and learned by maximizing agreement with a benchmark ranking. The proposed approach was first illustrated with a simple example and then applied to the real-world evaluation of banks using the CAMELS criteria. The results showed that the RL-based approach achieved greater consistency with benchmark rankings than normalization-based MCDM methods, thereby extending the applicability of rank-preserving approaches.

Downloads

Published

2026-03-07

How to Cite

Sahoo, L., & Ghosh, S. L. (2026). A Reinforcement Learning-Based Framework for Multi-Criteria Decision-Making. Intelligent Systems Research and Applications Journal, 2, 129-145. https://doi.org/10.59543/ydm6xr12

Issue

Section

Articles