Mathematical programming for influence diagrams - École des Ponts ParisTech Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2019

Mathematical programming for influence diagrams

Résumé

Influence Diagrams (ID) are a flexible tool to represent discrete stochastic optimization problems, including Markov Decision Process (MDP) and Partially Observable MDP as standard examples. More precisely, given random variables considered as vertices of an acyclic digraph, a probabilistic graphical model defines a joint distribution via the conditional distributions of vertices given their parents. In ID, the random variables are represented by a probabilistic graphical model whose vertices are partitioned into three types : chance, decision and utility vertices. The user chooses the distribution of the decision vertices conditionally to their parents in order to maximize the expected utility. Leveraging the notion of rooted junction tree, we present a mixed integer linear formulation for solving an ID, as well as valid inequalities, which lead to a computationally efficient algorithm. We also show that the linear relaxation yields an optimal integer solution for instances that can be solved by the "single policy update", the default algorithm for addressing IDs.

Dates et versions

hal-02043198 , version 1 (20-02-2019)

Identifiants

Citer

Axel Parmentier, Victor Cohen, Vincent Leclère, Guillaume Obozinski, Joseph Salmon. Mathematical programming for influence diagrams. 2019. ⟨hal-02043198⟩
169 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More