Free AI web copilot to create summaries, insights and extended knowledge, download it at here

Abstract

test result to change an initial belief into a final belief. Next, we look at diagnostic tests with different levels of reliability, and see how their results might affect the beliefs of doctors.</p><h1 id="5776">The Application of Diagnostic Tests in Medicine</h1><p id="cc86" type="7">You followed the doctor’s advice and got a diagnostic test. The doctor now receives a positive or negative result for coronavirus and now needs to make a decision:</p><p id="78c6" type="7">1. Prescribe any necessary medication and let you go.</p><p id="d8c0" type="7">2. Order you to be quarantined.</p><h2 id="cf57">Decision Making in Medicine — Reducing Uncertainty And Taking Acceptable Risks</h2><p id="20f2"><b>To decide when to treat, the doctor needs to have confidence in a diagnosis.</b></p><p id="b288">In general medical practice, a doctor can make two mistakes:</p><ol><li>You are not infected, but the doctor decides that you are, and gives treatment.</li><li>You are infected, but the doctor decides that you aren’t, and does not give you the proper treatment / lets you go.</li></ol><p id="9b22">In order to avoid these mistakes, the doctor orders a test. They had a prior belief, and the test is used as information to reduce uncertainty. Once the doctor has enough certainty, the doctor can make a recommendation.</p><p id="e024"><b>How much uncertainty can be reduced by a test?</b></p><p id="e06b">We first look at a reliable test. Consider a test that has sensitivity and specificity at 98%. We show what the prior and posterior probabilities of infection are, given a positive or negative result.</p><figure id="4611"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*Bd4OverqkSfGzgd9ZfM6Og.png"><figcaption></figcaption></figure><ul><li>If you think the probability of someone being infected is 80%, then a positive result takes it to near 100% (near certain infection), while a negative result reduces the probability to under 10% (unlikely infection).</li><li>Similarly, if you think that the probability of someone being infected is only 20%, then a positive result increases it to over 90% (likely infection), while a negative result takes it to near 0% (infection is nearly impossible).</li><li>Any initial probability which is greater than 10% increases to a higher probability (>80%) after a positive result.</li><li>Any initial probability which is less than 90% decreases to a lower probability(<20%) after a negative result.</li><li>Since the test is able to change a large range of initial probabilities into final probabilities near 0% and 100%, it resolves the uncertainty of whether a patient is infected or not. Hence it is a reliable diagnostic tool.</li></ul><p id="0b51"><b>The Ideal test</b></p><figure id="b7c8"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*LWzStJMvj_VVdXQh8nEUNA.png"><figcaption></figcaption></figure><ul><li>The orange curve (bottom, for negative result) is very close to the x-axis, which would mean that a negative result virtually guarantees a zero probability of infection.</li><li>The blue curve (top, for positive result) is straight up the y-axis and right, which would mean that a positive result virtually guarantees a certain probability of infection.</li></ul><p id="feb6"><b>Diagnostic tests are not perfect — bad luck exists, decisions are always risks.</b></p><p id="02b9">If the doctor has a reliable test, then a prior belief of 50% can be reduced to less than 5%, or some other very low figure. Unfortunately, these tests are not perfect. There is still a non-zero probability of actually being infected, so the doctor can make a mistake out of sheer bad luck.</p><p id="c141"><b>How much risk is tolerable?</b></p><p id="5447">Because the doctor ultimately has to make a decision, even under uncertainty, the doctor has to decide what is an acceptable risk.</p><p id="fd06">Examples of where risk is unacceptable:</p><ul><li>Near-certainty is required before starting a harsh treatment with irreversible side effects.</li><li>A very low probability of having a lethal infectious disease is required before the patient is released.</li></ul><p id="93ff">Examples of where risk is more acceptable:</p><ul><li>Less certainty is needed if the condition is mild and not life-threatening, and the treatment does not have permanent effects.</li><li>Less certainty is needed if the patient is in immediate mortal danger and requires medical intervention as soon as possible (the potential harm of treatment is less than the harm of no treatment).</li></ul><h2 id="b0bb">Decision Making Without Reliable Diagnostic Tests</h2><p id="6cf1">(From here, we switch focus from general medicine, back to the coronavirus outbreak.)</p><p id="c12d">We’ve established that a doctor needs to resolve uncertainty using a diagnostic test before they can have the confidence to take an acceptable risk.</p><p id="8369">However, what happens if the diagnostic test is not reliable?</p><p id="b4ff">We try to use realistic values. <a href="epmonthly.com/article/accurate-rapid-flu-tests/">Many flu tests have a specificity of around 90–95%.</a> Keeping in line with this, I assume a 90% specificity (feel free to correct me if you have better information).</p><p id="beb9">I assume a 40% sensitivity. This turns out to be close to the reported sensitivity for COVID-19 coronavirus tests.</p><figure id="89a8"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*E0zX1n44R_d7hNBNXd4L8w.png"><figcaption></figcaption></figure><p id="eda5">Suppose that a patient lives in an area which recently had an outbreak, and is now presenting to the medical clinic with symptoms of coronavirus. If the doctor deems the probability of a COVID-19 infection to be 50% (likely infection), and orders this unreliable test for the patient:</p><ul><li>A positive result increases the probability to 80% (likely infection)</li><li>A negative result decreases the probability to a value to 40% (possible infection).</li><li>40% probability of infection is still too high. Given that the virus is contagious and lethal, releasing the patient is an unacceptable risk.</li></ul><p id="c275">With the reliable test:</p><ul><li>A 50% probability of infection can be decreased to 2% with a negative result.</li><li>A COVID-19 infection can potentially be ruled out (if 2% is acceptable risk).</li></ul><p id="7a77">Using a single unreliable test, a patient‘s probability of infection cannot be reduced to an acceptable level, regardless of a positive or negative result.</p><p id="0c00"><b>The unreliable test cannot rule out infection.</b></p><figure id="342f"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*tpfEPIfEnAlSSCog3J6fjQ.gif"><figcaption>\What happens as a test gets less and less sensitive? Negative results are less meaningful — the orange line (negative result) quickly approaches the straight green line (no test performed).</figcaption></figure><h2 id="3f2c">No decision making (for epidemics) without reliable diagnostic tests</h2><p id="c3f1">Now imagine a scenario where people are quarantined until the authorities are confident that they are not infected. If the test is unreliable, then it cannot prove that there is no infection. In other words, the test has no consequences on decision making — the patient will remain in quarantine either way.</p><p id="b9d9"><b>With contagion potential, unreliable tests have no influence on decision making.</b></p><h2 id="aa19">Mistakes happen when test reliability is not understood</h2><p id="40b7"><b>What if a doctor does not know the reliability of a test?</b></p><p id="8f9d">A doctor would understand that COVID-19 is highly contagious and lethal, and thus, their level of acceptable risk would be very low.</p><p id="8966">Suppose they mistook an unreliable test for a reliable test. Their initial belief of 50% probability would drop to an actual 40% probability of infection, while in their mind, it would be 2% because they assumed that the test was reliable. If they went on to release the patient, this would potentially have disastrous consequences.</p><p id="cc03">This is why the tools used in the clinic are backed by research. Doctors need certainty about what their tools are and aren’t capable of.</p><h2 id="069c">Checking multiple times with multiple tests.</h2><p id="8d27">If a doctor knows about the unreliability of a test and trusts their own judgement, then a single negative test result will not convince them. The doctor will dismiss the test result as an error.</p><p id="1ae7"><b>How many negative test results does it take to convince the doctor?</b></p><p id="bb49">Assume that if the probability is reduced to less than 5%, then the doctor will be sufficiently convinced that there is no infection.</p><p id="e951">After a clinical examination, a doctor believes that a patient has 50% probability of having COVID-19. He knows that the diagnostic test is unreliable, with 40% sensitivity and 90% specificity, and so he orders a series of diagnostic tests.</p><figure id="a73a"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*rFPpFgoO_T6iwFXPBu05og.png"><figcaption></figcaption></figure><p id="bae4">Every diagnostic test returns a negative result. The initial probability is 50%, and each test decreases the probability.</p><p id="9907">50% → 40% →30.8% → 22.9% → 16.5% → 11.6% → 8.0% → 5.5% → 3.7%</p><p id="d6be">In the graph, you can see that the green point has 5.5% prior probability and ends up with 3.7% posterior probability, which is past the grey threshold of acceptable risk.</p><p id="6009">It takes 8 consecutive negative results to convince a doctor who starts with a probability of 50%. Note that if it takes X tests starting from probability Y, then it will take at least X tests for any probability greater than Y. Any patient with an initial probability of infection greater than 8% will require at least 2 tests to rule out an infection.</p><p id="d929">Is this a feasible solution?</p><h1 id="5a99">The Role of Unreliable Tests in the COVID-19 Outbreak</h1><h2 id="bcd5">The COVID-19 Diagnostic Tests Are Unreliable — Low Sensitivity</h2><p id="2b4f">For the current outbreak of coronavirus, there have been reports of negative results for people who are actually infected — a false negative.</p><blockquote id="59ee"><p>On the patients being tested with nucleic acid tests (NATs), used to diagnose COVID-19.</p></blockquote><blockquote id="d7e4"><p>“Not all of those infected by it return positive NATs”.</p></blockquote><blockquote id="f70e"><p>“Even patients who definitely have the disease only come back positive 30 per cent to 50 per cent of the time.”</p></blockquote><blockquote id="13c1"><p>“Testing throat swabs (from potentially infected people) also returns a lot of false negatives.” — Professor Wang Chen, director of the Chinese Academy of Medical Sciences (<a href="https://www.caixinglobal.com/2020-02-08/key-diagnostic-test-might-be-missing-many-coronavirus-cases-101513176.html">source: Caixin Media</a>, a Feb 8 article.)</p></blockquote><p id="5f99">The experts themselves are saying that the tests are not sensitive enough — the probability of a positive result for an infected patient is too low.</p><p id="4023">The tests being described by these experts are similar to the unreliable tests that I have been using as an example.</p><p id="1505"><b>How is the containment of COVID-19 progressing?</b></p><h2 id="9af0">Releasing or not accepting patients with a negative result</h2><blockquote id="564e"><p>“Some patients whose CT scans clearly showed signs of viral infection but whose NATs tested negative were “released” back into their communities due to a shortage of hospital berths.” — anonymous doctor in the imaging department of a major Wuhan hospital.</p></blockquote><blockquote id="62e

Options

d"><p>(Source: <a href="https://www.caixinglobal.com/2020-02-08/key-diagnostic-test-might-be-missing-many-coronavirus-cases-101513176.html">Caixin Media</a>)</p></blockquote><blockquote id="4dc5"><p>“Though her doctor was almost certain she was infected with the virus, a throat-swab test she had taken came back negative, which meant the facility wouldn’t take her.”</p></blockquote><blockquote id="61d4"><p>(Source: <a href="https://www.wsj.com/articles/at-outbreaks-center-wuhan-residents-question-accuracy-of-virus-tests-11581454218">Wall Street Journal</a>)</p></blockquote><p id="9d9d">While these events are driven more by a lack of resources, it is sad to see patients being rejected from the hospital because of low-sensitivity tests.</p><h2 id="f47f">Ordering multiple tests, not releasing patients with a negative result</h2><blockquote id="dc45"><p>Wuhan’s top Communist Party official, Ma Guoqiang, cast doubt on the swab test on Monday, urging those with negative results to try again to be sure. “One needs to wait one day later, to get another negative, in order to rule out [the virus],” he said.</p></blockquote><blockquote id="a756"><p>(Source: <a href="https://www.wsj.com/articles/at-outbreaks-center-wuhan-residents-question-accuracy-of-virus-tests-11581454218">Wall Street Journal</a>)</p></blockquote><blockquote id="7d46"><p>“Her clinical features are very typical of the coronavirus infection. We didn’t discharge her, because we still have enough beds and can’t run the risk of letting her infect someone else. And on the fifth try, the result finally came back positive.” — Bin Song, director of the radiology department at Huaxi Hospital in Chengdu.</p></blockquote><blockquote id="2f96"><p>(Source: <a href="https://www.aljazeera.com/news/2020/02/china-coronavirus-toll-exceeds-1300-hubei-reports-242-deaths-200213003453493.html">Interview with Al Jazeera</a>)</p></blockquote><p id="7489">While multiple tests can provide confidence if a positive result is eventually reached, it has its costs, and the patients should have been quarantined and treated either way (since disconfirmation is nearly impossible).</p><h2 id="ca17">Do tests even add any valuable information for Wuhan patients?</h2><p id="1335">For a 40% sensitive, 90% specific test, we just saw that it takes 8 consecutive negative results to reduce a 50% probability of infection to less than 5% (where very low probabilities are required to reduce the risk of contagion). We also saw that it takes 2 consecutive negative results to reduce 8% to less than 5%. What is the probability of a person in Wuhan being infected, if they:</p><ul><li>Have already shown signs in a CT scan? (surely >50% probability?)</li><li>Have a cough/fever? (surely >8% probability?)</li><li>Are related to someone who has been infected?</li><li>Have entered any hospital?</li></ul><p id="bfe8">It seems like testing for the purposes of ruling out infection is not a feasible solution in an active outbreak zone. There are simply not enough resources to test everyone multiple times.</p><p id="9e4d">This may partially explain why…</p><h2 id="59c7">… Authorities have already implemented large-scale containment measures — they assume that everyone is, or will be infected.</h2><p id="ec7d"><b>Mandatory quarantine.</b></p> <figure id="18d9"> <div> <div> <img class="ratio" src="http://placehold.it/16x9"> <iframe class="" src="https://cdn.embedly.com/widgets/media.html?type=text%2Fhtml&key=a19fcc184b9711e1b4764040d3dc5c07&schema=twitter&url=https%3A//twitter.com/globaltimesnews/status/1228521356532146176&image=https%3A//i.embed.ly/1/image%3Furl%3Dhttps%253A%252F%252Fpbs.twimg.com%252Fmedia%252FEQyVoJNUUAE4zmI.jpg%253Alarge%26key%3Da19fcc184b9711e1b4764040d3dc5c07" allowfullscreen="" frameborder="0" height="281" width="500"> </div> </div> </figure></iframe></div></div></figure><p id="ab72"><b>Wartime controls</b></p><p id="9c83">Zhangwan district in the city of Shiyan, Hubei, implements wartime controls. Residents are not allowed to enter or leave buildings, with the exception of medical workers and a few other workers with critical roles.</p><div id="8155" class="link-block"> <a href="http://www.chinadaily.com.cn/a/202002/13/WS5e44ba07a310128217277470.html"> <div> <div> <h2>Hubei district launches wartime control order to better fight virus</h2> <div><h3>Zhangwan district in Shiyan, Hubei province, launched the nation's first wartime control order for better controlling…</h3></div> <div><p>www.chinadaily.com.cn</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*UOsX64a9WykLRVmw)"></div> </div> </div> </a> </div><p id="899d"><b>Of course, unreliable diagnostic testing isn’t the only reason for all of these events, nor is it even the most important reason. There are plenty of reasons why many of these decisions were made.</b></p><p id="a33e">In terms of official policy and guidelines…</p><h2 id="8a18">The NHC changes diagnostic criteria to no longer require lab tests</h2><p id="c532">Given that low sensitivity tests cannot rule out infections and infectious people are dangerous to the public, the following news makes sense.</p><blockquote id="3cc7"><p><a href="http://www.nbd.com.cn/articles/2020-02-05/1405679.html">The National Health Commision changed the diagnostic criteria to add clinically confirmed cases as a category </a>— cases which do not require a positive rest result, but only suspected cases with the imaging features of pneumonia.</p></blockquote><h2 id="674d">The CDC does not even bother to test evacuees from Wuhan</h2><p id="3109">The Centers for Disease Control and Prevention in the US did not even bother to give diagnostic tests to the evacuees who escaped Wuhan. We don’t know if they are using the same test which is used in Wuhan, but we know that the test is not sensitive.</p><blockquote id="bae3"><p>“The test for the coronavirus often results in a false negative if given too early, when the person is infected but not yet showing symptoms.”</p></blockquote><blockquote id="7725"><p>“A false negative could provide a false sense of security” — Dr. Christopher Braden, CDC official.</p></blockquote><blockquote id="8d74"><p>(Source: <a href="https://edition.cnn.com/2020/02/13/health/coronavirus-quarantine-evacuee-petition-cdc/index.html">CNN</a>)</p></blockquote><p id="0a8c">The CDC knows that negative results do not add any useful information, and positive results will not have any effect on decision making. Evacuees will be kept in quarantine regardless of results, and sent to the hospital if symptoms develop.</p><h1 id="362f">Has the sensitivity problem been solved?</h1><p id="1ff7"><b>Feb 7: a Xinhua article claims that Wuhan is able to test 4,000–5,000 samples per day.</b></p><div id="2d44" class="link-block"> <a href="http://www.xinhuanet.com/english/2020-02/07/c_138763081.htm"> <div> <div> <h2>Hubei still short of at least 2,250 medical personnel</h2> <div><h3>WUHAN, Feb. 7 (Xinhua) — Central China’s Hubei Province is still in short of at least 2,250 medical personnel in face…</h3></div> <div><p>www.xinhuanet.com</p></div> </div> <div> <div style="background-image: url(https://miro.readmedium.com/v2/resize:fit:320/0*iMp8xrE4hLOl9nMq)"></div> </div> </div> </a> </div><p id="21f2"><b>Feb 9: A New York Times report claims that Hubei has the capability of performing 6,000 tests per day.</b></p><blockquote id="a785"><p>“The local health department says the labs can run 6,000 tests a day.”</p></blockquote><blockquote id="596a"><p>Source: <a href="https://www.nytimes.com/2020/02/09/world/asia/china-coronavirus-tests.html">New York Times</a>, Feb 9</p></blockquote><figure id="d7ce"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*22vJyaecoAtI0tVXfWNWxA.png"><figcaption>Number of new confirmed cases of COVID-19 for each day. Source: Wikipedia.</figcaption></figure><p id="0d4a">We have information on:</p><ul><li>Potential number of diagnostic tests</li><li>Reported number of confirmed cases</li></ul><p id="664a">We want to consider:</p><ul><li>What is the true number of infections (assuming confirmed cases require positive results)?</li><li>What is the true number of diagnostic tests?</li><li>What is the sensitivity of the diagnostic tests (has it been improved)?</li></ul><p id="6551">We assume that they reserve the tests for the most obvious/severe cases — cases which have gone through many rounds of assessment in the clinic, radiology, etc. This justifies my next assumption that a very large proportion of the tested people are going to be infected (>90%). This assumption remains while the number of suspected cases/clinical confirmations is still significant.</p><h2 id="3645">Wuhan</h2><p id="4ac6">Around the time of Feb 7, there are around 1,200–2,000 positive results per day.</p><p id="70cb">Suppose that they can do 4,000 tests on 4,000 people, and the tests have 40% sensitivity, 90% specificity.</p><p id="9666">(1) Suppose all 4,000 are infected. Then on average, 1,600 people would test positive, which is exactly in line with the number of confirmed figures.</p><p id="8c05">(2) Now suppose only 90% of tested people are infected — 3,600 are infected and 400 are not.</p><ul><li>Of 3,600 infected, 1,440 will test positive.</li><li>Of the 400 not infected, 40 will test positive.</li><li>In total, 1,480 will test positive. On the lower end, but in line with confirmed figures.</li><li>An alternative is that there are more than 4000 tests per day.</li></ul><p id="7fcd">(3) For 5000 tests a day and 90% infection, 4,500 people are infected.</p><ul><li>Of 4,500 infected, 1,800 test positive.</li><li>Of 500 not infected, 50 test positive.</li><li>In total, 1,850 people test positive. Another realistic figure.</li></ul><p id="255f">Now suppose that the tests are sensitive: 90% sensitivity and specificity.</p><p id="0597">(4) Suppose 40% of 4000 tested people are infected — 1,600 infected, 2,400 not infected.</p><ul><li>Of 1,600 infected: 1440 test positive.</li><li>Of 2,400 not infected: 240 test positive.</li><li>In total, 1680 test positive, which is in line with confirmed figures.</li></ul><p id="c994">Given that the article about test sensitivity was published Feb 8, the most realistic scenario is 90–100% infection rate among the tested, 30–50% sensitivity and 4000–5000 tests per day.</p><p id="34f4">This seems to confirm that test sensitivity was too low at the time of February 7 — which justifies the authorities’ decision to add clinical confirmations.</p><h2 id="9062">Hubei</h2><p id="2582">The data for Hubei around/before Feb 9 shows 2,000–3,000 new confirmed cases per day.</p><p id="04d1">Suppose that they do 6,000 tests per day, with 40% sensitivity, 90% specificity.</p><p id="6cb6">(1) Suppose that 90% of people are infected — 5,400 infections.</p><ul><li>Of the 5,400 infected, 2,160 test positive.</li><li>Of the 600 not infected, 60 test positive.</li><li>In total, 2,220 test positive.</li></ul><p id="ceef">While this is in the range, it is on the lower end, so there may be more infections, more testing capabilities than reported, or possibly better test sensitivity (if other parts of Hubei use different tests from Wuhan).</p><h2 id="5dc4">Either way, it is clear that low sensitivity diagnostic tests are a major problem in this COVID-19 outbreak.</h2></article></body>

The 2020 Novel Coronavirus Outbreak | Thoughts on Probability and Statistics

Bayes’ Rule, Unreliable Diagnostic Testing, And Containing COVID-19

How false-negatives in diagnostic testing are leading to the release of infected people, motivating extreme containment measures. The COVID-19 outbreak, explained with Bayes’ Rule.

If you are reading this after 2020, please keep in mind that this post was written during the early stages of the COVID-19 pandemic, and hence, may not reflect a reality beyond this time.

We are currently in February 2020. Over the past month, a deadly virus has been spreading throughout China and the world, sending the infected to the ICU and trapping others in their homes. As authorities try to manage this crisis, they face the challenging issue of containment — sending the infected to quarantine, while allowing the non-infected to go free.

The Problem With Epidemics That Plagues The Authorities

Here is the scenario. You have a cough and a fever. There is a chance that you have caught COVID-19 — the virus spreading throughout the world. You don’t know what this chance is, and you don’t want to take chances, so you seek advice from your doctor.

The issue from the authorities perspective is different. You want treatment, but the authorities need to contain the spread of the virus. From their point of view, there are 4 main outcomes of your visit to the doctor.

If you are infected and diagnosed with coronavirus, they will quarantine you for the public benefit.
If you are not infected but you are diagnosed with coronavirus, they will wrongly quarantine you, causing you inconvenience. The public will suffer no major harm, but the authorities will have to expend a small amount of resources.
If you are infected and not diagnosed with coronavirus, they will wrongly release you, causing you to spread the virus. This puts the public in grave danger of an outbreak.
If you are not infected and not diagnosed with coronavirus, they will rightly release you and save some resources.

The two mistakes that the authorities can make is scenario 2 and 3. Scenario 2 is a minor inconvenience (if not done too often), but scenario 3 is the major issue which can cascade into a larger outbreak, even if only done once. If an outbreak occurs, they will have to do contact tracing for possibly hundreds of people, given the contagiousness and lethality of coronavirus. This will be extremely costly for them, so their primary interest is in minimizing the probability of the third scenario.

Thus, the authorities need to make an accurate diagnosis, so that they can avoid releasing the infected and quarantining the non-infected. To achieve this, the authorities first make an initial assessment of all suspected infections, whether they are patients at the medical clinic, or travellers from places with active outbreaks.

Initial Assessment of a Patient

There are clues which will hint at a COVID-19 infection.

Location: there will be different probabilities of infection for people living in different places. People near the epicenter are more likely to be infected. People living near popular tourist destinations are more likely to be infected than people from isolated places like rural Alaska.
Travel history: travellers from places near the epicenter or other outbreak locations are more likely to have the infection (hence, why travellers are screened).
Social contacts: people who have close contact with the infected/those at risk of infection are more likely to be infected themselves.
Symptoms: people who have symptoms characteristic of coronavirus, such as fever, cough, and shortness of breath are more likely to be infected.
Imaging: people who show imaging features of coronavirus on an x-ray or CT scan are more likely to be infected.

Not all of these clues will be immediately available. Authorities at the airport can only screen people for temperature and identify travel history if needed. A doctor will know about the symptoms disclosed by the patient, but it is on the doctor to take the initiative to find and link the clues together. And an asymptomatic case may not have any clues.

You go to the doctor, and the doctor asks you a series of questions. You tell the doctor about your cough and fever. The doctor is suspicious and builds their own belief of whether you are infected or not. Their belief is strong enough to justify a coronavirus test, so they order you to take this test.

The Probability and Statistics Behind Diagnostic Testing

After gathering enough clues, a doctor may suspect coronavirus. To confirm this suspicion, the doctor will order a diagnostic test.

Misconceptions about what the diagnostic test result means

A diagnostic test is performed by collecting samples from your body (eg. mucus in the back of the nose) and looking for presence of the virus in those samples. It seems simple enough, and people have a lot of faith in science. This may lead people into making this first mistake.

Incorrect interpretation: a positive result means a patient has novel coronavirus, while a negative result means that a patient does not.

This is not true, because the test is not always reliable. There are many reasons why a test may give a misleading result:

A patient in the very early stages of an infection may not excrete a detectable amount of virus.
The virus itself may only exist deeper inside the body, hence being inaccessible by a swab test.
There may have been accidental contamination of the sample.

In general, the people who make the testing kits will specify the reliability of the test.

Suppose that a company now markets their test as “90% accurate”. This can lead to another common mistake.

Incorrect interpretation: for a 90% accurate test, a positive result means 90% chance of being infected, and a negative result means 90% chance of not being infected.

This interpretation is also not true, but is actually surprisingly common in the medical community— the great psychologist Gerd Gigerenzer shows how doctors misinterpret the results of mammogram results article.

Bayesian probability explains what the diagnostic test really means

The correct way to evaluate a diagnostic test requires thinking in terms of Bayesian probability. To put it simply, Bayesian probability involves having a prior probability and then using new information to update it. In terms of diagnostic testing, the prior probability is the doctor’s belief about whether the patient is infected or not. The test result is used as information, and this changes the doctor’s belief.

We can formulate the reliability question in terms of math equations.

Prior probability

The initial probability of infection, which is based on the doctor’s judgement.

Posterior probability: how reliable is a positive result?

The probability of infection is conditioned on a positive result. The positive result is information.

Posterior probability: how reliable is a negative result?

The probability of (no) infection is conditioned on a negative result. The negative result is information.

What we need to do now is to connect the doctor’s initial belief with the final belief. To do this, we use Bayes’ rule, which can easily be derived using basic facts about conditional probabilities.

The denominator is expressed like this.

We now have two new unknown probabilities.

The probability of a positive result given that a patient is infected. This number should be high — infected patients should be getting positive test results.
The probability of a positive result given that a patient is not infected . This number should be low — non-infected patients should be getting negative test results.

These two probabilities are actually measures of a test’s reliability, and they can be expressed in terms of two quantities: sensitivity and specificity.

Sensitivity

Probability of a positive result given infection.
The test is “sensitive” to the presence of coronavirus. If the coronavirus is present, the test will detect it.
Ideally, close to 100%.

Specificity

Probability of a negative result given no infection.
The test is “specific” to coronavirus. If there is no coronavirus infection, the test will not detect anything, and returns negative.
Ideally, close to 100%.

I should remind you here that I can express probabilities in terms of complements. Infection and no infection are mutually exclusive. Similarly, a positive and a negative result are also mutually exclusive. This means that our calculations will use these two equations, where A and B are events such as “the patient is infected”, or “the test result is positive”.

The probability of A not occurring is one minus the probability of A occurring.

Given that B has occurred, the probability of A not occurring is still one minus the probability of A occurring.

The probability of infection given a test result

We now have all the tools we need to interpret a test result

Prior probability — initial belief
Posterior probability — final belief
Bayes’ rule — connects initial and final belief
Sensitivity and specificity — allows us to do computations using Bayes’ rule

Combining these, we get

We now know how to incorporate the information from a test result to change an initial belief into a final belief. Next, we look at diagnostic tests with different levels of reliability, and see how their results might affect the beliefs of doctors.

The Application of Diagnostic Tests in Medicine

You followed the doctor’s advice and got a diagnostic test. The doctor now receives a positive or negative result for coronavirus and now needs to make a decision:

1. Prescribe any necessary medication and let you go.

2. Order you to be quarantined.

Decision Making in Medicine — Reducing Uncertainty And Taking Acceptable Risks

To decide when to treat, the doctor needs to have confidence in a diagnosis.

In general medical practice, a doctor can make two mistakes:

You are not infected, but the doctor decides that you are, and gives treatment.
You are infected, but the doctor decides that you aren’t, and does not give you the proper treatment / lets you go.

In order to avoid these mistakes, the doctor orders a test. They had a prior belief, and the test is used as information to reduce uncertainty. Once the doctor has enough certainty, the doctor can make a recommendation.

How much uncertainty can be reduced by a test?

We first look at a reliable test. Consider a test that has sensitivity and specificity at 98%. We show what the prior and posterior probabilities of infection are, given a positive or negative result.

If you think the probability of someone being infected is 80%, then a positive result takes it to near 100% (near certain infection), while a negative result reduces the probability to under 10% (unlikely infection).
Similarly, if you think that the probability of someone being infected is only 20%, then a positive result increases it to over 90% (likely infection), while a negative result takes it to near 0% (infection is nearly impossible).
Any initial probability which is greater than 10% increases to a higher probability (>80%) after a positive result.
Any initial probability which is less than 90% decreases to a lower probability(<20%) after a negative result.
Since the test is able to change a large range of initial probabilities into final probabilities near 0% and 100%, it resolves the uncertainty of whether a patient is infected or not. Hence it is a reliable diagnostic tool.

The Ideal test

The orange curve (bottom, for negative result) is very close to the x-axis, which would mean that a negative result virtually guarantees a zero probability of infection.
The blue curve (top, for positive result) is straight up the y-axis and right, which would mean that a positive result virtually guarantees a certain probability of infection.

Diagnostic tests are not perfect — bad luck exists, decisions are always risks.

If the doctor has a reliable test, then a prior belief of 50% can be reduced to less than 5%, or some other very low figure. Unfortunately, these tests are not perfect. There is still a non-zero probability of actually being infected, so the doctor can make a mistake out of sheer bad luck.

How much risk is tolerable?

Because the doctor ultimately has to make a decision, even under uncertainty, the doctor has to decide what is an acceptable risk.

Examples of where risk is unacceptable:

Near-certainty is required before starting a harsh treatment with irreversible side effects.
A very low probability of having a lethal infectious disease is required before the patient is released.

Examples of where risk is more acceptable:

Less certainty is needed if the condition is mild and not life-threatening, and the treatment does not have permanent effects.
Less certainty is needed if the patient is in immediate mortal danger and requires medical intervention as soon as possible (the potential harm of treatment is less than the harm of no treatment).

Decision Making Without Reliable Diagnostic Tests

(From here, we switch focus from general medicine, back to the coronavirus outbreak.)

We’ve established that a doctor needs to resolve uncertainty using a diagnostic test before they can have the confidence to take an acceptable risk.

However, what happens if the diagnostic test is not reliable?

We try to use realistic values. Many flu tests have a specificity of around 90–95%. Keeping in line with this, I assume a 90% specificity (feel free to correct me if you have better information).

I assume a 40% sensitivity. This turns out to be close to the reported sensitivity for COVID-19 coronavirus tests.

Suppose that a patient lives in an area which recently had an outbreak, and is now presenting to the medical clinic with symptoms of coronavirus. If the doctor deems the probability of a COVID-19 infection to be 50% (likely infection), and orders this unreliable test for the patient:

A positive result increases the probability to 80% (likely infection)
A negative result decreases the probability to a value to 40% (possible infection).
40% probability of infection is still too high. Given that the virus is contagious and lethal, releasing the patient is an unacceptable risk.

With the reliable test:

A 50% probability of infection can be decreased to 2% with a negative result.
A COVID-19 infection can potentially be ruled out (if 2% is acceptable risk).

Using a single unreliable test, a patient‘s probability of infection cannot be reduced to an acceptable level, regardless of a positive or negative result.

The unreliable test cannot rule out infection.

\What happens as a test gets less and less sensitive? Negative results are less meaningful — the orange line (negative result) quickly approaches the straight green line (no test performed).

No decision making (for epidemics) without reliable diagnostic tests

Now imagine a scenario where people are quarantined until the authorities are confident that they are not infected. If the test is unreliable, then it cannot prove that there is no infection. In other words, the test has no consequences on decision making — the patient will remain in quarantine either way.

With contagion potential, unreliable tests have no influence on decision making.

Mistakes happen when test reliability is not understood

What if a doctor does not know the reliability of a test?

A doctor would understand that COVID-19 is highly contagious and lethal, and thus, their level of acceptable risk would be very low.

Suppose they mistook an unreliable test for a reliable test. Their initial belief of 50% probability would drop to an actual 40% probability of infection, while in their mind, it would be 2% because they assumed that the test was reliable. If they went on to release the patient, this would potentially have disastrous consequences.

This is why the tools used in the clinic are backed by research. Doctors need certainty about what their tools are and aren’t capable of.

Checking multiple times with multiple tests.

If a doctor knows about the unreliability of a test and trusts their own judgement, then a single negative test result will not convince them. The doctor will dismiss the test result as an error.

How many negative test results does it take to convince the doctor?

Assume that if the probability is reduced to less than 5%, then the doctor will be sufficiently convinced that there is no infection.

After a clinical examination, a doctor believes that a patient has 50% probability of having COVID-19. He knows that the diagnostic test is unreliable, with 40% sensitivity and 90% specificity, and so he orders a series of diagnostic tests.

Every diagnostic test returns a negative result. The initial probability is 50%, and each test decreases the probability.

50% → 40% →30.8% → 22.9% → 16.5% → 11.6% → 8.0% → 5.5% → 3.7%

In the graph, you can see that the green point has 5.5% prior probability and ends up with 3.7% posterior probability, which is past the grey threshold of acceptable risk.

It takes 8 consecutive negative results to convince a doctor who starts with a probability of 50%. Note that if it takes X tests starting from probability Y, then it will take at least X tests for any probability greater than Y. Any patient with an initial probability of infection greater than 8% will require at least 2 tests to rule out an infection.

Is this a feasible solution?

The Role of Unreliable Tests in the COVID-19 Outbreak

The COVID-19 Diagnostic Tests Are Unreliable — Low Sensitivity

For the current outbreak of coronavirus, there have been reports of negative results for people who are actually infected — a false negative.

On the patients being tested with nucleic acid tests (NATs), used to diagnose COVID-19.

“Not all of those infected by it return positive NATs”.

“Even patients who definitely have the disease only come back positive 30 per cent to 50 per cent of the time.”

“Testing throat swabs (from potentially infected people) also returns a lot of false negatives.” — Professor Wang Chen, director of the Chinese Academy of Medical Sciences (source: Caixin Media, a Feb 8 article.)

The experts themselves are saying that the tests are not sensitive enough — the probability of a positive result for an infected patient is too low.

The tests being described by these experts are similar to the unreliable tests that I have been using as an example.

How is the containment of COVID-19 progressing?

Releasing or not accepting patients with a negative result

“Some patients whose CT scans clearly showed signs of viral infection but whose NATs tested negative were “released” back into their communities due to a shortage of hospital berths.” — anonymous doctor in the imaging department of a major Wuhan hospital.

(Source: Caixin Media)

“Though her doctor was almost certain she was infected with the virus, a throat-swab test she had taken came back negative, which meant the facility wouldn’t take her.”

(Source: Wall Street Journal)

While these events are driven more by a lack of resources, it is sad to see patients being rejected from the hospital because of low-sensitivity tests.

Ordering multiple tests, not releasing patients with a negative result

Wuhan’s top Communist Party official, Ma Guoqiang, cast doubt on the swab test on Monday, urging those with negative results to try again to be sure. “One needs to wait one day later, to get another negative, in order to rule out [the virus],” he said.

(Source: Wall Street Journal)

“Her clinical features are very typical of the coronavirus infection. We didn’t discharge her, because we still have enough beds and can’t run the risk of letting her infect someone else. And on the fifth try, the result finally came back positive.” — Bin Song, director of the radiology department at Huaxi Hospital in Chengdu.

(Source: Interview with Al Jazeera)

While multiple tests can provide confidence if a positive result is eventually reached, it has its costs, and the patients should have been quarantined and treated either way (since disconfirmation is nearly impossible).

Do tests even add any valuable information for Wuhan patients?

For a 40% sensitive, 90% specific test, we just saw that it takes 8 consecutive negative results to reduce a 50% probability of infection to less than 5% (where very low probabilities are required to reduce the risk of contagion). We also saw that it takes 2 consecutive negative results to reduce 8% to less than 5%. What is the probability of a person in Wuhan being infected, if they:

Have already shown signs in a CT scan? (surely >50% probability?)
Have a cough/fever? (surely >8% probability?)
Are related to someone who has been infected?
Have entered any hospital?

It seems like testing for the purposes of ruling out infection is not a feasible solution in an active outbreak zone. There are simply not enough resources to test everyone multiple times.

This may partially explain why…

… Authorities have already implemented large-scale containment measures — they assume that everyone is, or will be infected.

Mandatory quarantine.

Wartime controls

Zhangwan district in the city of Shiyan, Hubei, implements wartime controls. Residents are not allowed to enter or leave buildings, with the exception of medical workers and a few other workers with critical roles.

Hubei district launches wartime control order to better fight virus

Zhangwan district in Shiyan, Hubei province, launched the nation's first wartime control order for better controlling…

www.chinadaily.com.cn

Of course, unreliable diagnostic testing isn’t the only reason for all of these events, nor is it even the most important reason. There are plenty of reasons why many of these decisions were made.

In terms of official policy and guidelines…

The NHC changes diagnostic criteria to no longer require lab tests

Given that low sensitivity tests cannot rule out infections and infectious people are dangerous to the public, the following news makes sense.

The National Health Commision changed the diagnostic criteria to add clinically confirmed cases as a category — cases which do not require a positive rest result, but only suspected cases with the imaging features of pneumonia.

The CDC does not even bother to test evacuees from Wuhan

The Centers for Disease Control and Prevention in the US did not even bother to give diagnostic tests to the evacuees who escaped Wuhan. We don’t know if they are using the same test which is used in Wuhan, but we know that the test is not sensitive.

“The test for the coronavirus often results in a false negative if given too early, when the person is infected but not yet showing symptoms.”

“A false negative could provide a false sense of security” — Dr. Christopher Braden, CDC official.

(Source: CNN)

The CDC knows that negative results do not add any useful information, and positive results will not have any effect on decision making. Evacuees will be kept in quarantine regardless of results, and sent to the hospital if symptoms develop.

Has the sensitivity problem been solved?

Feb 7: a Xinhua article claims that Wuhan is able to test 4,000–5,000 samples per day.

Hubei still short of at least 2,250 medical personnel

WUHAN, Feb. 7 (Xinhua) — Central China’s Hubei Province is still in short of at least 2,250 medical personnel in face…

www.xinhuanet.com

Feb 9: A New York Times report claims that Hubei has the capability of performing 6,000 tests per day.

“The local health department says the labs can run 6,000 tests a day.”

Source: New York Times, Feb 9

Number of new confirmed cases of COVID-19 for each day. Source: Wikipedia.

We have information on:

Potential number of diagnostic tests
Reported number of confirmed cases

We want to consider:

What is the true number of infections (assuming confirmed cases require positive results)?
What is the true number of diagnostic tests?
What is the sensitivity of the diagnostic tests (has it been improved)?

We assume that they reserve the tests for the most obvious/severe cases — cases which have gone through many rounds of assessment in the clinic, radiology, etc. This justifies my next assumption that a very large proportion of the tested people are going to be infected (>90%). This assumption remains while the number of suspected cases/clinical confirmations is still significant.

Wuhan

Around the time of Feb 7, there are around 1,200–2,000 positive results per day.

Suppose that they can do 4,000 tests on 4,000 people, and the tests have 40% sensitivity, 90% specificity.

(1) Suppose all 4,000 are infected. Then on average, 1,600 people would test positive, which is exactly in line with the number of confirmed figures.

(2) Now suppose only 90% of tested people are infected — 3,600 are infected and 400 are not.

Of 3,600 infected, 1,440 will test positive.
Of the 400 not infected, 40 will test positive.
In total, 1,480 will test positive. On the lower end, but in line with confirmed figures.
An alternative is that there are more than 4000 tests per day.

(3) For 5000 tests a day and 90% infection, 4,500 people are infected.

Of 4,500 infected, 1,800 test positive.
Of 500 not infected, 50 test positive.
In total, 1,850 people test positive. Another realistic figure.

Now suppose that the tests are sensitive: 90% sensitivity and specificity.

(4) Suppose 40% of 4000 tested people are infected — 1,600 infected, 2,400 not infected.

Of 1,600 infected: 1440 test positive.
Of 2,400 not infected: 240 test positive.
In total, 1680 test positive, which is in line with confirmed figures.

Given that the article about test sensitivity was published Feb 8, the most realistic scenario is 90–100% infection rate among the tested, 30–50% sensitivity and 4000–5000 tests per day.

This seems to confirm that test sensitivity was too low at the time of February 7 — which justifies the authorities’ decision to add clinical confirmations.

Hubei

The data for Hubei around/before Feb 9 shows 2,000–3,000 new confirmed cases per day.

Suppose that they do 6,000 tests per day, with 40% sensitivity, 90% specificity.

(1) Suppose that 90% of people are infected — 5,400 infections.

Of the 5,400 infected, 2,160 test positive.
Of the 600 not infected, 60 test positive.
In total, 2,220 test positive.

While this is in the range, it is on the lower end, so there may be more infections, more testing capabilities than reported, or possibly better test sensitivity (if other parts of Hubei use different tests from Wuhan).