Notice: Function _load_textdomain_just_in_time was called incorrectly. Translation loading for the wordpress-seo domain was triggered too early. This is usually an indicator for some code in the plugin or theme running too early. Translations should be loaded at the init action or later. Please see Debugging in WordPress for more information. (This message was added in version 6.7.0.) in /home2/pollicyo/public_html/archive/wp-includes/functions.php on line 6114

Warning: Cannot modify header information - headers already sent by (output started at /home2/pollicyo/public_html/archive/wp-includes/functions.php:6114) in /home2/pollicyo/public_html/archive/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /home2/pollicyo/public_html/archive/wp-includes/functions.php:6114) in /home2/pollicyo/public_html/archive/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /home2/pollicyo/public_html/archive/wp-includes/functions.php:6114) in /home2/pollicyo/public_html/archive/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /home2/pollicyo/public_html/archive/wp-includes/functions.php:6114) in /home2/pollicyo/public_html/archive/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /home2/pollicyo/public_html/archive/wp-includes/functions.php:6114) in /home2/pollicyo/public_html/archive/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /home2/pollicyo/public_html/archive/wp-includes/functions.php:6114) in /home2/pollicyo/public_html/archive/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /home2/pollicyo/public_html/archive/wp-includes/functions.php:6114) in /home2/pollicyo/public_html/archive/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /home2/pollicyo/public_html/archive/wp-includes/functions.php:6114) in /home2/pollicyo/public_html/archive/wp-includes/rest-api/class-wp-rest-server.php on line 1893
{"id":956,"date":"2020-02-27T08:10:25","date_gmt":"2020-02-27T08:10:25","guid":{"rendered":"https:\/\/archive.pollicy.org\/?p=956"},"modified":"2024-07-21T23:42:12","modified_gmt":"2024-07-21T23:42:12","slug":"data-misinformation-pt2-a-step-by-step-guide-to-identify-and-avoid-statistical-fallacies","status":"publish","type":"post","link":"https:\/\/archive.pollicy.org\/2020\/02\/data-misinformation-pt2-a-step-by-step-guide-to-identify-and-avoid-statistical-fallacies\/","title":{"rendered":"Data & Misinformation Pt2: A Step-by-step Guide to Identify and Avoid Statistical Fallacies"},"content":{"rendered":"

Statistics refers to a branch of mathematics concerned with the collection, classification, analysis, and interpretation of numerical facts and deals mostly with drawing inferences about a population characteristic basing on a sample. This field is important because the world we live in today relies on data to formulate policies, make decisions, carry out planning, and plenty more. For example, If the government of Uganda wants to give out free mosquito nets to people living in a certain area in Uganda, they would need to know how many households exist there and how many people are living in each household, their monthly income to gauge who can and can\u2019t afford a mosquito net, etc. If the government implemented a free education policy, they would need to know how many students benefited from that policy and how that free education impacted their lives in order to measure or monitor progress. All this shows that data and statistics are crucial and that we need them to make sense of society as a whole and measure progress in an objective way. If we don\u2019t have this data, how can we measure our day-to-day problems and be able to fix them?<\/p>\n

But when it comes to numbers, you should be skeptical. Like Mark Twain once stated \u201cThere are three kinds of lies: lies, damned lies, and statistics<\/strong>\u201d, one needs to be able to tell which numbers are reliable and which ones aren\u2019t. This can be achieved by educating yourself on how to spot bad statistics. This blog is here to help you achieve that and explores how misinformation can occur with statistics.<\/p>\n

Misleading statistics<\/strong><\/h2>\n

This is by far the most common form in which misinformation can occur. It often happens when a user makes up a statistic that is spread out from one person to another. For example, let\u2019s say a certain minister while speaking to a group of people at a public event states that\u00a068 percent of Ugandans are engaged in subsistence farming<\/a>. Uganda has a population of over 40 million people and this would translate to over 27 million people being engaged in subsistence farming. Yet in an actual sense, it\u2019s not 68 percent of Ugandans that are engaged in subsistence but rather 68 percent of those farmers engaged in agriculture in Uganda. Their number would be around 6 million.<\/p>\n

Such claims made publicly by public officials should always be checked before being passed on to others in news reports to avoid misinformation. In Uganda,\u00a0PesaCheck<\/a>\u00a0is an initiative that aims at addressing such scenarios by verifying these claims and publishing correct information.<\/p>\n

Neglecting the baseline<\/strong><\/h2>\n

This is another case of using statistics to spread misleading information or misinformation and lies to users. This can happen when a user compares statistics of different areas without considering the underlying factors. For example when comparing crime rates between two or more districts in Uganda, one can claim that district A has a higher crime rate than district B simply because district A recorded more criminal cases than district B. Ignoring the fact that there could be underlying factors that could be causing this such as; district A having higher population and therefore more cases compared to B. A simple fix for such a scenario would be to obtain the crime per capita figure instead.<\/p>\n

\n
\n
\"\"<\/div>\n<\/div>\n<\/figure>\n

So all these underlying details should always be included when reporting your statistics and users should always look out for such information to avoid getting misinformed.<\/p>\n

Selection\/Sampling Bias<\/strong><\/h2>\n

Just because 80 percent of the people who responded to your poll selected president B doesn\u2019t mean that the same percentage of people will choose the same candidate elsewhere. As a user, always lookout for more information about how a private study was conducted before using their data or reporting their statistics. We have all seen election results where one candidate dominates one region and the other dominates another and this can apply to any study.<\/p>\n

\n
\n
\n
\n
\n
<\/div>\n

\"\"<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/figure>\n

However, it is also wrong to consider that statistics from polls or private studies are unreliable because you were not contacted to provide your answer or because not everybody is included in a poll or study. It is not impossible to get a view of millions of people by just interviewing a few thousand of them. A poll if perfectly unbiased with truthful answers obtained can always provide meaningful and reliable results\/statistics.<\/p>\n

Data Communication and Data Visualization<\/strong><\/h2>\n

Data visualization exists to help communicate data findings in an easily understandable format that many users of different backgrounds can easily digest. But these can be very misleading and at times can be used to spread misinformation directly or indirectly. As a data visualization user, always lookout for the following features which must be part of the data visualization you are viewing;<\/p>\n