# Another question (related to binomial distributions) ## Problem Let $s,e,k\in \mathbb{N}^*$ and $p\in[0,1]$. We have the following experiment. $k$ times we're doing the following: * we're drawing $s$ times a random biased coin with probability $p$ to get heads * let $n$ be the number of heads * the output of the experiment is $ne$ The output of the whole experiment is the sum of outputs of the $k$ sub-experiments. Question: what is the probability that the output is smaller than $\frac{2}{3}kesp$, that is, two thirds of the expected output? ## Formalization Let $X$ the random variable for the whole experiment, and $Y_i$ the random variable for the $i$th inner experiment. We have $E[X] = \sum_{i=1}^k E[Y_i] = k \cdot e\cdot sp$. The probability that the output is $n$ is $$P[X=n]=\sum_{n_1+\dots+n_{k}=n}\prod_{i=1}^kP[Y_i=n_i]$$ And we have: * $P[Y_i=ne] = \binom{s}{n}p^n(1-p)^{s-n}$, * $P[Y_i=m]=0$ for $m$ not a multiple of $e$. We need to compute $$\sum_{n=0}^{u}P[X=n]$$ where $u := \lfloor\frac{2}{3}kesp\rfloor$. But how to compute this numerically (exactly or approximatively), or get a reasonably good upper bound? Values for $s$ and $k$ would be in the thousands (say $2^{13}$), and $e$ would be rather smaller, say $2^i$ with $0\leq i \leq 6$, and $p$ also small (say $p=10^{-5}$). ## Answer The answer from [stackexchange](https://math.stackexchange.com/questions/4598522/how-to-approach-nested-experiments-involving-binomial-distributions/): > Assuming sub-experiments are performed independently, the result $X$ of the complete experiment is $e$ times the number of heads in $s·k$ independent coin flips, so $X/e$ is $b(sk, p)$ (i.e. binomially) distributed. Then $$ \mathbb{P}[X \leq \frac{2}{3}kesp] = \mathbb{P}[X/e \leq \frac{2}{3}ksp] = \mathbb{P}[b(sk,p) \leq \frac{2}{3}ksp]$$ > Since you are interested in large values of $s·k$ and relatively small $p$, this probability can be estimated well by the [Poisson approximation](http://www.stat.yale.edu/~pollard/Courses/241.fall97/Poisson.pdf) of the binomial distribution. If $n$ is sufficiently large, you can also use the [Normal approximation](https://en.wikipedia.org/wiki/Binomial_distribution#Normal_approximation) of the binomial distribution, and this would give an estimate in terms of the CDF $Φ$ of $\mathcal{N}(0,1)$ in a relatively closed form. A says: > My distribution would be approximated by $N(skp,skp(1-p))$, and then the desired probability would be approximately $P(Z<\frac{1}{3}\sqrt{\frac{skp}{(1-p)}})$, where $Z$ is the standard normal distribution.