<?xml version="1.0"?>
<rss version="2.0">
   <channel>
      <title>SPFT R&amp;D Statistics: Handling missing data by Mutanda Daniel (Sussex Partnership Trust)</title>
      <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir</link>
      <description>A guide for some of the statistical methods used to handle missing data. Researcher must consider the patterns of missingness is there data and answer questions like, &#39;what percentage of the data is missing&#39;, &#39;how was the data collected&#39;, or &#39;what can explain why data is missing?&#39;. This resource serves as a roadmap for the journey to answering these questions and finding potential solutions for your data.A guide for some of the statistical methods used to handle missing data. Researchers must consider the patterns of missingness in their data and answer questions like, &#39;what percentage of the data is missing&#39;, &#39;how was the data collected&#39;, or &#39;what can explain why data is missing?&#39;. This resource serves as a roadmap for the journey to answering these questions and finding potential solutions for your data.</description>
      <language>en-us</language>
      <pubDate>2022-01-14 11:21:37 UTC</pubDate>
      <lastBuildDate>2022-06-06 15:25:21 UTC</lastBuildDate>
      <webMaster>hello@padlet.com</webMaster>
      <image>
         <url>https://padlet.net/icons/png/1f50d.png</url>
      </image>
      <item>
         <title>Missing data mechanisms</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992137391</link>
         <description><![CDATA[<div>To start, it is important to determine the missing data mechanism which is one of three explanations for a missing variable.</div>]]></description>
         <enclosure url="" />
         <pubDate>2022-01-14 11:25:40 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992137391</guid>
      </item>
      <item>
         <title>Missing Data Roadmap</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992138801</link>
         <description><![CDATA[<div>Research design can have a diverse range of methods to produce data, a common problem when it comes to analysing this data is encountering missingness and handling it in a way that is appropriate for the dataset and the researcher's expertise.</div>]]></description>
         <enclosure url="" />
         <pubDate>2022-01-14 11:26:57 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992138801</guid>
      </item>
      <item>
         <title>Missing Completely At Random (MCAR)</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992144664</link>
         <description><![CDATA[<div>When data is Missing Completely at Random (MCAR) it means there are no differences between participants with missing data and those with complete data.<br><br>Therefore, the reasons for the missingness are unrelated to the research questions that would be answered by analysing the data.<br><br>Consequently, when the dataset is MCAR, all observed data could be used in the analysis, such as with complete case analysis, and can produce unbiased inferences</div>]]></description>
         <enclosure url="" />
         <pubDate>2022-01-14 11:31:56 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992144664</guid>
      </item>
      <item>
         <title>Missing At Random (MAR)</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992145326</link>
         <description><![CDATA[<div>When data is MAR, missingness is related to the observed but not the unobserved data. Therefore, whether or not data is missing can be predicted by observed data (i.e with logistic regression). Accounting for variables that predict missingness can produce unbiased inferences</div>]]></description>
         <enclosure url="" />
         <pubDate>2022-01-14 11:32:34 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992145326</guid>
      </item>
      <item>
         <title>Missing Not At Random (MNAR)</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992146062</link>
         <description><![CDATA[<div>If the data is not MCAR or MAR, then we say it is Missing Not at Random (MNAR), this means the missingness of data is related to the unobserved data. If complete case analysis is biased and the causes of missing data are unmeasured variables, then this generally cannot be addressed in analysis and the inferences will likely be biased.</div>]]></description>
         <enclosure url="" />
         <pubDate>2022-01-14 11:33:13 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992146062</guid>
      </item>
      <item>
         <title>Example</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992147603</link>
         <description><![CDATA[<div>An example case for handling missing data, when examining the data set of a feasibility trial for group-based Behavioural Activation therapy, if we observe PHQ-9 as the primary outcome measuring depression (𝑋), GAD-7 as a secondary outcome that measures anxiety (𝑌), and group allocation (𝑍) as another variable that may be influencing the missing mechanism. For the data to be MCAR, it is necessary but not sufficient for neither GAD-7 nor group allocation to predict PHQ-9 missingness (<em>R</em>). In such a scenario, missingness may be due to an administrative error. Whereas for us to assume the data is MAR, it is necessary but not sufficient for PHQ-9 missingness to be predicted by GAD-7 and group allocation but not by the value of the PHQ-9 scores. In this scenario, being in a different group, such as an online group rather than in-person, predicts PHQ-9 missingness but not the value of the PHQ-9 score. In an MNAR scenario, group may predict PHQ-9 missingness, but the value of the PHQ-9 scores also predict its missingness, e.g. being more depressed leads to a lower likelihood of returning the PHQ-9 questionnaire.&nbsp;<br><br></div>]]></description>
         <enclosure url="" />
         <pubDate>2022-01-14 11:34:44 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992147603</guid>
      </item>
      <item>
         <title>Multiple Imputation</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992168714</link>
         <description><![CDATA[<div>Multiple imputation (MI) is a widely used general method for handling missingness in most types of MAR data. The general rule for using MI is the data missingness is above 5% but below 40-50%. MI could be used if other variables are also missing, but the mechanism for missingness must not be MCAR or MNAR (Jakobsen et al., 2017).<br><br></div><div>In many methods when a dataset has a dependent variable with missing values, it is important to identify auxiliary variables which enhance the estimation of the main outcomes variables.<br><br></div>]]></description>
         <enclosure url="" />
         <pubDate>2022-01-14 11:54:46 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992168714</guid>
      </item>
      <item>
         <title>Full Information Maximum Likelihood </title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992233900</link>
         <description><![CDATA[<div>Full information maximum likelihood (FIML) is a method for dealing with missing data that is generally simpler to implement than Multiple Imputation. It can produce unbiased estimates if the data is MCAR or MAR, and it works well when adding auxiliary variables even if missingness is above 50% (Dong &amp; Peng, 2013). However, incorporating auxiliary variables into FIML is not as straightforward as it is with MI.</div>]]></description>
         <enclosure url="" />
         <pubDate>2022-01-14 12:54:21 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992233900</guid>
      </item>
      <item>
         <title>Auxiliary variables</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992235955</link>
         <description><![CDATA[<div>When the dataset has a dependent variable with missing values, it is important to identify auxiliary variables, which are not variables of primary interest but are used to improve sampling or to enhance estimation of the main outcomes variables (Lavrakas, 2008). If no auxiliary variables are identified, then a complete case analysis should be performed.</div>]]></description>
         <enclosure url="" />
         <pubDate>2022-01-14 12:55:46 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992235955</guid>
      </item>
      <item>
         <title>Bootstrapping</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992242636</link>
         <description><![CDATA[<div>Bootstrap is a versatile resampling technique that can be used to generate inferences such as standard errors and confidence intervals. A bootstrap treats our observed sample as if it were an assumed infinite population and it mimics the process of randomly sampling from the population.</div>]]></description>
         <enclosure url="" />
         <pubDate>2022-01-14 13:00:56 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992242636</guid>
      </item>
      <item>
         <title>Multiple Imputation with Auxiliary variables</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992376403</link>
         <description><![CDATA[<div>When handling missing data under the assumption of MAR, Collins, Schafer, &amp; Kam (2001) recommend that researchers should include many auxiliary variables in their model. This is because if more auxiliary variables are included, it is less likely that the variables causing missingness and the analytic variables with missing data will be excluded. If this exclusion were to occur, it would go against the <em>conditional independence assumption</em> posited in MAR (<em>conditional independence between R and unobserved Y</em>) and induce bias (Collins, Schafer, &amp; Kam,2001).<br><br>However, the inclusion of an auxiliary variable can induce MNAR if the missing dependant variable and an unobserved variable are initially independent of each other but become dependent within strata of an added auxiliary variable (Thoemmes, &amp; Rose, 2014).<br><br>Therefore, it is recommended that variables that only predict missingness should not be used as auxiliary variables, this is because if a variable predicts missingness but not the variables being imputed, including it in the imputation model may not affect the imputation (Jakobsen et al., 2017; Carpenter and Smuk, 2021).<br><br>Moreover, rather than selecting auxiliary variables based on statistical evidence when data are already collected, it is recommended that when researchers design studies they should purposely select covariates that might explain underlying mechanisms which might cause missingness (Thoemmes, &amp; Rose, 2014).&nbsp;</div>]]></description>
         <enclosure url="" />
         <pubDate>2022-01-14 14:17:32 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/1992376403</guid>
      </item>
      <item>
         <title>Can you ignore missing data or is missingness below 5%?</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005367089</link>
         <description><![CDATA[]]></description>
         <enclosure url="" />
         <pubDate>2022-01-21 16:31:59 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005367089</guid>
      </item>
      <item>
         <title>Yes</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005369136</link>
         <description><![CDATA[]]></description>
         <enclosure url="" />
         <pubDate>2022-01-21 16:33:04 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005369136</guid>
      </item>
      <item>
         <title>Yes</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005369442</link>
         <description><![CDATA[]]></description>
         <enclosure url="" />
         <pubDate>2022-01-21 16:33:14 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005369442</guid>
      </item>
      <item>
         <title>Yes</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005369868</link>
         <description><![CDATA[]]></description>
         <enclosure url="" />
         <pubDate>2022-01-21 16:33:26 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005369868</guid>
      </item>
      <item>
         <title>No</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005370120</link>
         <description><![CDATA[]]></description>
         <enclosure url="" />
         <pubDate>2022-01-21 16:33:35 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005370120</guid>
      </item>
      <item>
         <title>No</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005370610</link>
         <description><![CDATA[]]></description>
         <enclosure url="" />
         <pubDate>2022-01-21 16:33:48 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005370610</guid>
      </item>
      <item>
         <title>No</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005370922</link>
         <description><![CDATA[]]></description>
         <enclosure url="" />
         <pubDate>2022-01-21 16:33:56 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005370922</guid>
      </item>
      <item>
         <title>Is missingness substantial (over 50%)?</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005382039</link>
         <description><![CDATA[]]></description>
         <enclosure url="" />
         <pubDate>2022-01-21 16:39:12 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005382039</guid>
      </item>
      <item>
         <title>Complete Case Analysis (unbiased)</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005401992</link>
         <description><![CDATA[<div>Complete case analysis only includes participants for which we have no missing data on the variables of interest (excluding those with missing data). Unbiased estimates can be produced if potential impact of the missing data is negligible (i.e. missingness is &lt; 5%).</div>]]></description>
         <enclosure url="" />
         <pubDate>2022-01-21 16:49:00 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005401992</guid>
      </item>
      <item>
         <title>MCAR</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005410406</link>
         <description><![CDATA[]]></description>
         <enclosure url="" />
         <pubDate>2022-01-21 16:53:24 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005410406</guid>
      </item>
      <item>
         <title>MAR</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005411311</link>
         <description><![CDATA[]]></description>
         <enclosure url="" />
         <pubDate>2022-01-21 16:53:55 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005411311</guid>
      </item>
      <item>
         <title>MNAR</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005411805</link>
         <description><![CDATA[]]></description>
         <enclosure url="" />
         <pubDate>2022-01-21 16:54:05 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005411805</guid>
      </item>
      <item>
         <title>Complete Case Analysis (potential bias)</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005439625</link>
         <description><![CDATA[<div>If auxiliary variables have been identified and there are large amounts of missingness on the primary outcome, then a complete case analysis may provide biased results (Garson, 2015). In such cases with large missingness, the results can be discussed to generate hypotheses but are not confirmative.&nbsp;</div>]]></description>
         <enclosure url="" />
         <pubDate>2022-01-21 17:07:51 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2005439625</guid>
      </item>
      <item>
         <title>MAR</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2008481330</link>
         <description><![CDATA[]]></description>
         <enclosure url="" />
         <pubDate>2022-01-24 10:35:06 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2008481330</guid>
      </item>
      <item>
         <title>Auxiliary variables identified?</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2008495835</link>
         <description><![CDATA[]]></description>
         <enclosure url="" />
         <pubDate>2022-01-24 10:45:13 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2008495835</guid>
      </item>
      <item>
         <title>Links and resources</title>
         <author>danielmutanda</author>
         <link>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2008589409</link>
         <description><![CDATA[<div><a href="https://bmcmedresmethodol.biomedcentral.com/track/pdf/10.1186/s12874-017-0442-1.pdf#page=10">When and how should multiple imputation be used for handling missing data in randomised clinical trials – a practical guide with flowcharts (biomedcentral.com)</a><a href="https://groups.google.com/g/missing-data/c/YMlbEtTfsSQ"><br></a><br>Full Information Maximum Likelihood<a href="https://groups.google.com/g/missing-data/c/YMlbEtTfsSQ"><br></a><a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3701793/">Principled missing data methods for researchers (nih.gov)</a><br><br>Auxiliary variables and Bias <br><a href="https://groups.google.com/g/missing-data/c/YMlbEtTfsSQ">Auxiliary variables that predict missingness</a><br><a href="http://jakewestfall.org/misc/thoemmes2014.pdf">HMBR_A_931799_O (jakewestfall.org)</a></div>]]></description>
         <enclosure url="" />
         <pubDate>2022-01-24 11:54:24 UTC</pubDate>
         <guid>https://padlet.com/danielmutanda/prot6r45tbg8ekir/wish/2008589409</guid>
      </item>
   </channel>
</rss>
