package obandit

  1. Overview
  2. Docs

The WrapRange01 functor is a convenience aliasing of WrapRange with an initial "standard" range of 0,1.

Parameters

module P : BanditParam
module B (Pb : BanditParam) : Bandit

Signature

val getAction : float -> int

A Mutable bandit.

The getAction function mutates the bandit one step further in the bandit game. The argument is the reward for the last action and the result is the next action. Rewards are floats in 0,1 and actions are integers in 0,n-1. The first reward is discarded. In order to use rewards larger than 1, please use the WrapDoubling functor.

OCaml

Innovation. Community. Security.