Skip to content
  • View menu
  • View sidebar

Mathematics and Decision

  • Invited speakers
  • Program
  • Book of Abstracts
  • Registration
  • Mini-Symposiums
  • Abstract submission
  • Organizers
  • Scientific committee
  • Local organizers
  • Fees
  • Housing
  • The venue
  • Flyer
  • Participants
  • Mathematics & Decision 2023

Recent Posts

  • Pierre Auger
  • Session Posters: Vanguard Center
  • Phd Posters
  • Session III
  • Mini Symposium (L. Maniar)

Recent Comments

No comments to show.

Archives

  • December 2024
  • September 2024
  • December 2023

Categories

  • Uncategorized
December 6, 2023December 6, 2023 by alkhwarizmi

Reda Ouhemma

  • Uncategorized
  \begin{quote}         \begin{center}             \textbf{Finite-time Convergence for Decentralized Payoff-based Two-Player Zero-Sum Markov Games under weak reachability assumptions}         \end{center}          \medskip We consider decentralized learning for two-player zero-sum Markov games, where players have limited feedback in the form of their own payoff information without knowledge of each other's actions. Convergence to a Nash equilibrium in this setting was proven under strong reachability assumptions, namely that the induced Markov chain of any stationary policy pair is irreducible and aperiodic. It remained an open problem whether, under weaker assumptions, it would be possible to achieve an approximate Nash equilibrium efficiently.  In this work, we answer this question positively and present a value-iteration-based algorithm with a Tsallis-entropy smoothing that can learn an approximate Nash equilibrium in polynomial time. Our method requires only the existence of a policy pair that induces an irreducible and aperiodic Markov chain, a considerably weaker assumption compared to previous literature. Our analysis utilizes Lyapunov drift inequalities and novel properties of Tsallis entropy that we believe to be of independent interest.  \end{quote}

Post navigation

← Previous Post Achour El Mehdi
Next Post → Ziva Urbancic
Proudly powered by WordPress | Theme: editor by Array
Mathematics and Decision
  • Invited speakers
  • Program
  • Book of Abstracts
  • Registration
  • Mini-Symposiums
  • Abstract submission
  • Organizers
  • Scientific committee
  • Local organizers
  • Fees
  • Housing
  • The venue
  • Flyer
  • Participants
  • Mathematics & Decision 2023