21 - Group Testing

  • Collection: Easy/fast/cheap
  • Testing: Difficult/slow/expensive
  • Imagine a population of people
    P1,...,Pn
    and
    k
    people are infected.
  • We can combine samples from the infected population and test them for the virus but is slow
  • We also now that we have enough samples for each person and that we can combine multiple people's samples into a single test
  • In these multi-person test's if any of the samples have the virus, the whole thing will come up positive

Ex:
k=1

Let's say

n=1000000 and
k=1

  • Make
    1000
    groups of
    1000
    people each
  • Test each group (1000 tests)
  • Exactly one group's test should come up positive call it
    G
  • Individually test each sample in
    G
    (1000 tests)
  • Exactly one of those should come up positive: total 2000 tests

Single pooling

Back to original scenario

  • Partition
    P1,...,Pn
    into
    ns
    groups
    G1,G2,...,Gns
    each of size
    s
    • Test each group
      Gins
      tests
    • If
      Gi
      tests positive the individuals in
      GiS
      tests
  • P1,...,Pn
    is a random permutation of
    n
    people
    • Image Not Showing Possible Reasons
      • The image file may be corrupted
      • The server hosting the image is unavailable
      • The image path is incorrect
      • The image format is not supported
      Learn More →
    • S= the set of permutations of n people
    • Pr(ω)=1|S|=1n!
      Probability of getting and single sample space
  • Let
    X
    be the number of tests performed, what is
    E(X)
    ?
    • We know that
      X
      must be greater than or equal to
      ns
      because we do
      ns
      tests
    • X=ns+?
  • For each
    i{1,...,n}
    , define an indicator random variable
    Xi
    • Xi=1
      if
      Pi
      's sample is tested individually
    • Xi=0
      otherwise
  • X=ns+i=0nXi

We need to distinguish between those are infected and those who aren't.

I={i:Pi is infected}
N={1,...,N}I
(Not infected)
|I|=k

|N|=nk

Now we can change how we thinking about the sum

  • X=ns+i=0nXi+i=0nXi
    • The total number of tests is equal to the initial number of tests plus the tests on all the people are infected in the second tests and the tests on people who aren't
    • But people who are infected will always be tested so:
  • X=ns+i=0n1+i=0nXi
  • Need to find out expected number of people who will not be infected but be in a group that has an infected person
  • When is
    Xi=1
    ?
  • Define
    Ai= no one in Pi's group is infected
  • Ai¯= someone in Pi's group is infected=Xi=1
  • E(Xi)=Pr(Xi=1)=1Pr(Ai)
  • What is
    Pr(Ai)
    ?
  • Pr(Ai)=|Ai||S|=n!
  • Procedure:
    • Image Not Showing Possible Reasons
      • The image file may be corrupted
      • The server hosting the image is unavailable
      • The image path is incorrect
      • The image format is not supported
      Learn More →
  • Pr(Ai)=n(nsk)k!(nk1)!n!
  • Image Not Showing Possible Reasons
    • The image file may be corrupted
    • The server hosting the image is unavailable
    • The image path is incorrect
    • The image format is not supported
    Learn More →
  • Image Not Showing Possible Reasons
    • The image file may be corrupted
    • The server hosting the image is unavailable
    • The image path is incorrect
    • The image format is not supported
    Learn More →
  • Image Not Showing Possible Reasons
    • The image file may be corrupted
    • The server hosting the image is unavailable
    • The image path is incorrect
    • The image format is not supported
    Learn More →
  • E(X)=E(ns+i=1nXi)=ns+i=In1+i=NE(Xi)
  • =ns+k+(nk)(1(nsk)(n1k))
  • Expected number of tests we have to perform.

Multiple pooling

Image Not Showing Possible Reasons
  • The image file may be corrupted
  • The server hosting the image is unavailable
  • The image path is incorrect
  • The image format is not supported
Learn More →

  • For each group $G_{i,j}, test group
    Gi,j
    (cns)
    tests
  • For each person
    x
    :
    • If everyone one of
      x
      's groups (all
      c
      of them) tests positive, test
      x

Image Not Showing Possible Reasons
  • The image file may be corrupted
  • The server hosting the image is unavailable
  • The image path is incorrect
  • The image format is not supported
Learn More →

Image Not Showing Possible Reasons
  • The image file may be corrupted
  • The server hosting the image is unavailable
  • The image path is incorrect
  • The image format is not supported
Learn More →

Image Not Showing Possible Reasons
  • The image file may be corrupted
  • The server hosting the image is unavailable
  • The image path is incorrect
  • The image format is not supported
Learn More →

tags: COMP2804 Probability