Abstract: In this article, we present a model-free output feedback (OPFB) Q-learning algorithm to find the optimal Nash equilibrium strategy for the decentralized control problem (DCP) of nonzero-sum ...
Genome assembly remains an unsolved problem, and de novo strategies (i.e., those run without a reference) are relevant but computationally complex tasks in genomics. Although de novo assemblers have ...
Abstract: Successive Over-Relaxation Q-learning (SOR-QL) has been proposed recently as an alternative to the widely popular Q-learning algorithm as it is seen to provide better performance where ...
Finding the shortest path in a network is a classical problem, and a variety of search strategies have been proposed to solve it. In this paper, we review traditional approaches for finding shortest ...