Try   HackMD

Concentration for sum of a random length sequence

Exercise 7.1 from Bandit Algorithms [1]

Let

(Xt)t be a sequence of i.i.d. Gaussian random variables with mean
μ
and unit variance defined on the probability space
(Ω,F,P)
. Let
T:ΩN={1,2,3,}
be a random variable and let
μ^:=1Tt=1TXt
.

  1. Show that if
    T
    and
    (Xt)t
    are independent, the for any
    δ(0,1)
    ,
    P(μ^μ2logδ1T)δ
  2. Now drop the assumption that
    T
    and
    (Xt)t
    are independent. Let
    Ft:=σ(X1,,Xt)
    . For each
    δ(0,1)
    find
    T
    such that
    {T=t}Ft
    for all
    t=1,2,
    and
    P(μ^μ2logδ1T)=1
  3. Show that in general we have
    P(μ^μ2log(T(T+1)/δ)T)δ

Without loss of generality assume

μ=0. Recall that if
YN(μ,σ2)
then for
y0
we have
P(Yy)exp(y22σ2).
Since
Xi
's are assumed to be independent standard normal, we have
μ^N(μ,1/T)
. Analogous results can be obtained for subgaussian random variables using Hoeffding's inequality instead. We also remark that tighter bounds in (3) are possible, which require more sophisticated tools.

(1)

By independence we can write

P(μ^2logδ1T)=t=1P(μ^2logδ1T|T=t)P(T=t)=t=1P(1ti=1tXt2logδ1t)P(T=t)t=1exp(2tlog1δ21t)P(T=t)=δ.

(2)

For

ωΩ define
T(ω):=inf{n1:1nt=1nXt(ω)2logδ1n}.
Then
I{T=t}
is a function of
X1,,Xt
and hence is
Ft
-measurable. By the definition of
T
we have
μ^2(logδ1)/T
on the subset
Ω
where
T
is defined. It then remains to check that
T
is well-defined, namely
T<
almost surely.

By the law of iterated logarithm, we have for almost every

ωΩ,
lim supnt=1nXt(ω)2nloglogn=1.

Thus for every
ϵ(0,1)
and
m1
there exists
nm
such that
t=1nXt(ω)2nloglogn1ϵ,
which implies
1nt=1nXt(ω)(2logδ1)/n(1ϵ)loglognlogδ1,
which is greater than
1
for
m
sufficiently large. This means that the set in the definition of
T
is a nonempty subset of
N
, so
T<
almost surely.

(3)

Since

{T=t}t1 partitions the sample space, we have
P(μ^2log(T(T+1)/δ)T)=t=1P(μ^2log(T(T+1)/δ)T and T=t)t=1P(1ti=1tXi2log(t(t+1)/δ)t)t=1exp(2log(t(t+1)/δ)t21t)=t=1δt(t+1)=δ.

References

[1] Lattimore, Tor, and Csaba Szepesvári. Bandit algorithms. Cambridge University Press, 2020.