Pooled Testing for Viruses: How many tests can it save?

Pooled Testing for Viruses: How many tests can it save?

By Max Candocia


November 26, 2020

In order to help curb COVID-19, mass testing is an important step that should be implemented. It is far easier said, than done, however. Many sites and facilities are already reaching capacity, such as those around Fayetteville, AK.

While number of personnel is one limiting factor in certain locations, the number of available tests is currently a significant problem, with manufacturing limitations being a major bottleneck.

Pooled Testing

Pooled testing, the use of a single test to test multiple individuals at once, is an attractive option to overcome certain limitations with testing. World Back to Work recently announced

“By using our Pooled Testing, we can now help employers, schools, and other large groups get rapid COVID testing with 95% sensitivity rates," says Rob Wright, Co-Founder & Chief Executive Officer for World Back to Work. "But an even bigger benefit is that we can provide this testing at a fraction of the price; employers don't have to sacrifice reliability for cost. Our Pooled Testing option can be as much as 75% less than traditional PCR testing, making it far more accessible. With 24-hour lab turnaround times, our Pooled Testing helps prevent continued spread that can occur with tests requiring longer processing times for results.”

Pooled testing roughly works in this manner:

  1. Collect samples from a large number of individuals and combine them.
  2. Test the combined sample. If the result is negative, then all individuals are considered negative.
  3. If it tests positive, then each individual must be tested again individually, so you end up having one extra test.

Theoretical Efficiency of Pooled Testing

The efficiency of pooled testing is limited by the percent of the population that is infected. If a large portion of the population is infected, pooled tests can only reliably include a few individuals per test. Theoretically, if only 0.01% of the population was infected, you could have 100 individuals sampled per test, save for practical limitations (e.g., is the sample too diluted?).

plot of chunk test_heatmap

In the above graph, the brown sections have the highest efficiency, and the blue sections the lowest. The black line is the ideal number of individuals per pool for a given population infection rate, and the red line is the point at which individual tests are just as efficient as pooled testing for a given size.

As you can see, around 32% there is no theoretical gain in testing, and for smaller infection rates, the efficiency of the tests increases quite a bit. An alternate way of looking at this is the efficiency of the tests over the population size:

plot of chunk linear_efficiency_chart

With an infection rate of 0.01%, it only takes one test to test 50 individuals on average, with a pool size of about 100, but for 0.1%, that number goes down to 15, for 1%, 5, and at 10% it is about 2.

Leon Mutesa, a geneticist at the University of Rwanda in Kigali, and another co-author who is part of the government task force, says that he has been able to identify one positive sample in a pool of 100 in the lab.
Sigrun Smola, a molecular virologist at Saarland University Medical Center in Homburg, Germany, who has been testing samples in groups of up to 20, doesn’t recommend grouping more than 30 samples in one test, to ensure sufficient accuracy. Larger groups will make it harder to detect the virus, and increase the chances of missing positives, she says.

Additional Testing Methods

There are different methods that sampling can be done. The above was the simplest way, which only requires 2 rounds of testing at most, and was used earlier this year in China. For the purposes of public health, an ideal pooling schema should be limited to as few rounds as possible to increase the speed at which results are returned.

These methods are generally more complicated, and from what I gathered, have drawbacks when multiple samples per pool are positive. They do increase efficiency, either through fewer tests, fewer rounds, or an efficiency increase. One other drawback of many of these is that they require a minimum number of samples to be taken for the scheme to work most efficiently.


Pooled testing can be a great way to boost efficiency of mass testing, but it is severely limited by the rate of infection for a given population. For populations with low infection rates, it is a good option if available, but for populations with high infection rates, it is not a viable option.


The above figures were made with the assumption that a pooled sample would have the same false negative rate as an individual sample, and that the specific testing method would only do one round of pooled testing, as opposed to successive rounds (e.g., splitting a population of 100 into two samples of 50 if a positive is found). In theory, successive sampling would increase efficiency, but because testing is time-sensitive, I made the assumption that only one round of pooled testing is used, and that all individuals would be tested if the pool came back positive.

Additionally, if individuals in a pool are from a similar geographic region, (e.g., a neighborhood), then this method becomes more efficient, since more than one positive is likely to be detected, due to the localized nature of infection outbreaks.

Equipment used to sample individuals (e.g., cotton swabs) are not considered "test equipment" in the above. It is assumed that sampling equipment is not the limiting factor in the ability to test on a mass scale.

I am not a medical professional/researcher. My expertise is in statistics and data science, so there may be additional considerations and limitations I am not aware of. If you have any feedback, please email me at [myfirstname][mylastname]@[gmail dot com].

Notes on Math Behind Graph

The statistics in the graph are the expected value of the number of tests needed for a given population, given an infection rate and a number of individual samples per pool.


  1. Mak, Aron. Why There are Suddenly not Enough Covid Tests. The Slate Group. 2020-11-24. https://www.msn.com/en-us/health/medical/why-there-are-suddenly-not-enough-covid-tests/ar-BB1bjZmf
  2. Mallapaty, Smriti. The mathematical strategy that could transform Covid testing. 2020-07-10. nature.com. https://www.nature.com/articles/d41586-020-02053-6
  3. Pooled Sample Testing and Screening Testing for COVID-19. Food and Drug Administration. https://www.fda.gov/medical-devices/coronavirus-covid-19-and-medical-devices/pooled-sample-testing-and-screening-testing-covid-19
  4. Wilson, Megan. COVID-19 test facilities reach capacity as testing increases. Nextar, Inc. 2020-11-18. https://www.nwahomepage.com/news/covid-19-test-facilities-reach-capacity-as-testing-increases/
  5. World Back to Work. World Back to Work's COVID 19 RT-PCR POOLED TESTING Option Helps Reduce Cost and Provides Rapid Results. 2020-11-19. https://www.prnewswire.com/news-releases/world-back-to-works-covid-19-rt-pcr-pooled-testing-option-helps-reduce-cost-and-provides-rapid-results-301177411.html

Source Code

You can find the source code for this article on https://github.com/mcandocia/pooled-testing.


Recommended Articles

How Would the US Vote for a Candy?

What would it look like if people across the US voted for a candy? Explore different results using different voting methods and different types of representation, such as a national vote versus the Electoral College.

Converting fields of lists into wide and tall formats in R

If you have ever downloaded survey data, or any other kind of data, that has a field that is itself comma-separated, you may have found it annoying/difficult to reshape the data into a more useful form.