Announcement_11
Policy gradient can teach you how to bluff… provably! Check my NeurIPS ‘25 paper with Gabriele Farina settling the theoretical problem of convergence of REINFORCE-style algorithms in imperfect information games: Policy Gradient Methods Converge Globally in Imperfect-Information Extensive-Form Games.