Let be a sequence of i.i.d. Gaussian random variables with mean and unit variance defined on the probability space . Let be a random variable and let .
Without loss of generality assume . Recall that if then for we have
Since 's are assumed to be independent standard normal, we have . Analogous results can be obtained for subgaussian random variables using Hoeffding's inequality instead. We also remark that tighter bounds in (3) are possible, which require more sophisticated tools.
By independence we can write
For define
Then is a function of and hence is -measurable. By the definition of we have on the subset where is defined. It then remains to check that is well-defined, namely almost surely.
By the law of iterated logarithm, we have for almost every ,
Thus for every and there exists such that
which implies
which is greater than for sufficiently large. This means that the set in the definition of is a nonempty subset of , so almost surely.
Since partitions the sample space, we have
[1] Lattimore, Tor, and Csaba Szepesvári. Bandit algorithms. Cambridge University Press, 2020.