Free AI web copilot to create summaries, insights and extended knowledge, download it at here
3462
Abstract
s being split as per the weights assigned in the experiment configuration.</p><p id="450e"><a href="https://www.bmj.com/about-bmj/resources-readers/publications/statistics-square-one/8-chi-squared-tests"><b>Chi-square statistics</b></a> can be used to check if the likelihood of the difference between the ratios is outside of normal chance. We can generate an SRM alert when the p-value of the test falls below the threshold of 0.01.</p><figure id="8f18"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/0*idJc0H8amAwsUtrP.png"><figcaption>Chi-Square Statistics</figcaption></figure>
<figure id="79e2">
<div>
<div>
<iframe class="gist-iframe" src="/gist/kharepratyush/2ea3aa7ab5735443aa6e4c8da137ccc0.js" allowfullscreen="" frameborder="0" height="undefined" width="undefined">
</div>
</div>
</figure></iframe></div></div></figure><h2 id="98ab">Flickers</h2><p id="ce89">Flickers are respondents who have switched between control and treatment groups. For instance, an application user may have an older version of the OS and choose to upgrade it while the experiment is running, leading them to experience both the control and treatment effects. Such a user constitutes a flicker in the experiment. Ideally, there should be no flickers in an experiment but depending on the configuration, we may get them. <b>We should remove flickers before running the analysis</b> and can also report the percentage of flickers in the experiment</p><h2 id="65fc">Bias Detection</h2><p id="e2b1">To check if there is any bias between sampled variants, we can apply bias detection which evaluates the goal metric on pre-experiment data using the <b>K Sample Anderson-Darling Test</b></p><p id="8862"><a href="https://www.itl.nist.gov/div898/software/dataplot/refman1/auxillar/andeksam.htm"><b>K-sample Anderson–Darling test</b></a> is available for testing whether several collections of observations can be modeled as coming from a single population, where the distribution function does not have to be specified.</p><p id="65d8">For k samples, the statistic can be computed as follows under the assumption that the distribution function Fᵢ of <i>ith</i> sample is continuous.</p><figure id="b31b"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*GNFR0P6G0RV2FN1PEICfow.png"><figcaption>K-Sample Anderson Darling Test Statistics (Source: <a href="https://en.wikipedia.org/wiki/Anderson%E2%80%93Darling_test">Wikipedia</a>)</figcaption></figure><p id="981e"><b>where</b></p><ul><li>nᵢ is the number of observations in the ith sample</li><li>N is the total number of observations in all samples</li><li>Zᵢ is the pooled ordered sample</li><li>Mᵢⱼ is the number of observations in the ith sample that are not greater than Zᵢ</li></ul>
<figure id="b051">
<div>
<div>
<iframe class="gist-iframe" src="/gist/kharepratyush/ae6c409460e8a7c7c790089a36713ac2.js" allowfullscreen="" frameborder="0" height="undefined" width="undefined">
</div>
</div>
</figure></iframe></div></div></figure><p id="5664">If bias exists, we can run a <b>difference-in-difference method, strategized sampling, </b>or <b>propensity matching</b><i> </i>on the data to get reliable results.</p><p id="7eca"><i>To read more about the <b>Difference-in-difference method</b> ple
Options
ase refer to the next blog of the series.</i></p><h1 id="da7c">Variance Reduction using CUPED</h1><p id="51d2">CUPED, or <b>Control using Pre-Experiment Data</b>, represents an innovative approach designed to enhance the robustness of experimental analyses. <b>The method strategically incorporates pre-experiment data to effectively control for the inherent variations observed in the metric under investigation during an experiment</b>. This pre-emptive adjustment serves the crucial purpose of eliminating or reducing the impact of natural fluctuations, thereby providing a more stable foundation for statistical analyses.</p><figure id="b8b9"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*pVTOZxVFW5MpDN88o3sTOA.png"><figcaption>CUPED metric (<a href="https://booking.ai/how-booking-com-increases-the-power-of-online-experiments-with-cuped-995d186fff1d">Source</a>)</figcaption></figure><p id="9b44">The primary advantage of employing CUPED lies in its ability to facilitate statistical tests with a diminished sample size requirement. <b>By reducing the influence of natural variation, researchers can achieve greater precision in their analyses</b>, leading to more reliable and conclusive results. This reduction in sample size not only streamlines the experimental process but also contributes to resource efficiency, making experiments more cost-effective and manageable.</p><figure id="61a1"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*MaAyuybK3RBDcMR9YZUgig.png"><figcaption>CUPED Equation</figcaption></figure><p id="c239">Furthermore, the versatility of CUPED is a noteworthy feature, as it seamlessly integrates into a diverse array of A/B testing frameworks. Its computational efficiency ensures that the method does not impose a significant computational burden, making it a practical and accessible tool for researchers and data scientists. Moreover, the straightforward coding implementation of CUPED simplifies its adoption, allowing researchers to easily incorporate this technique into their experimental designs.</p><p id="c3f9"><b><i>To continue exploring the successive components of this series, please refer to <a href="https://readmedium.com/9c09437dc6fa">Part 2</a> of the blog.</i></b></p><h1 id="1619">References:</h1><ol><li><a href="https://en.wikipedia.org/wiki/A/B_testing">https://en.wikipedia.org/wiki/A/B_testing</a></li><li><a href="https://www.scribbr.com/methodology/controlled-experiment/">https://www.scribbr.com/methodology/controlled-experiment/</a></li><li><a href="https://www.bmj.com/about-bmj/resources-readers/publications/statistics-square-one/8-chi-squared-tests">https://www.bmj.com/about-bmj/resources-readers/publications/statistics-square-one/8-chi-squared-tests</a></li><li><a href="https://www.itl.nist.gov/div898/software/dataplot/refman1/auxillar/andeksam.htm">https://www.itl.nist.gov/div898/software/dataplot/refman1/auxillar/andeksam.htm</a></li><li><a href="https://booking.ai/how-booking-com-increases-the-power-of-online-experiments-with-cuped-995d186fff1d">https://booking.ai/how-booking-com-increases-the-power-of-online-experiments-with-cuped-995d186fff1d</a></li></ol><p id="ab0b">If you have read so far, a big thank you for reading! I hope you find this article to be helpful.<i> </i>If you’d like, <a href="https://www.linkedin.com/in/kharepratyush/">add me on <i>LinkedIn</i>!</a></p><p id="4b24">Good luck this week,
Pratyush</p></article></body>