Let be the output length in bits of a hash function , and mutually distinct message be randomly selected to test collision .
Estimate the probability of "at least one collision" to be . Show that when is large, the required number of messages is approximately
Assume is fixed. The probability of no collision, , is
The Taylor series of at suggests that . The probability can be expressed approximately as
or
Solving for yields the resulting approximation.
For Birthday problem, and . Compute . Inserting them into above result yields
In general, when there are choices, we anticipate 50% chance of presence of collisions if the number of random tests is above