"Many applications require efficient methods for automated decision making, such as control systems, crisis response, finance, logistics, network security, robotics and traffic management. These problems involve sequential learning and decision making under uncertainty in an unknown environment. As we have incomplete information about the state and dynamics of the environment, the outcome of any ...