Researchers propose methods that theoretically guarantee minimal loss for worst case scenarios with minimal prior information for heavy-tailed reward distributions The exploration algorithms for ...
This paper considers the construction of an upper confidence bound on the range of a set of p (⩾ 2) location parameters following a sequential range test. Although details are given only for the ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results
Feedback