Dynamic Routing and Wavelength Assignment Using First Policy Iteration

Hyytiä, E.; Virtamo, J.

E. Hyytiä and J. Virtamo, Dynamic Routing and Wavelength Assignment Using First Policy Iteration, in the Fifth IEEE Symposium on Computers and Communications, ISCC'2000, pp. 146-151, 2000, Antibes, Juan les Pins, France.

Abstract: With standard assumptions the routing and wavelength assignment problem (RWA) can be viewed as a Markov Decision Process (MDP). The problem, however, defies an exact solution because of the huge size of the state space. Only heuristic algorithms have been presented up till now. In this paper we propose an approach where, starting from a given heuristic algorithm, one obtains a better algorithm by the first policy iteration. In order to estimate the relative costs of states, we make a simulation on the fly studying, at each decision epoch, the consequences of all the alternatives actions. Being computationally intensive, this method can be used in real time only for systems with slow dynamics. Off-line it can be used to assess how close the heuristic algorithms come to the optimal policy. Numerical examples are given about the policy improvement.

Links:

(C) 2000 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

BibTeX entry:

@inproceedings{hyytia-iscc-2000,
  author = {Esa Hyyti{\"a} and Jorma Virtamo},
  title = {{Dynamic Routing and Wavelength Assignment Using First Policy Iteration}},
  booktitle = {the Fifth {IEEE} Symposium on Computers and Communications, {ISCC'2000}},
  address = {Antibes, Juan les Pins, France},
  editors = {Samir Tohm{\'e} and Mehmet Ulema},
  year = {2000},
  month = {Jul.},
  pages = {146--151},
  doiopt = {10.1109/ISCC.2000.860631}
}