Chapter 9 Data Quality Monitoring Report

If survey teams collect data with Kobo/Ona/MoDa/ODK and regularly send submissions to a server, then analysts can quickly generate a data quality report with Rmarkdown to regularly track survey results and detect problems that may arise in the field with data collection.

The example report below has been generalized to visualize survey progress by geographic areas and calculated results of key CH indicators by geographic areas and by enumerators using the raw data set downloaded from the Kobo/Ona/MoDa/ODK server.

For privacy/data security, the code of the enumerator should be used rather than reporting the enumerators name. If a particular operator needs to be identified or contacted - you can refer to the data collection form to match the enumerator code with the enumerator name.

9.1 Overview of the steps

Download a .sav (spss) version of the dataset from Kobo/Ona/MoDa/ODK - use exampledataEnglish_raw as an example.

Create a folder on your machine (“C:” is recommended but not required) with the name of the survey (i.e. NGA_022021_EFSA) and create a sub-folder “1_RawData” and save your SPSS dataset in the “1_RawData” folder.

Open up example_dqmonitoring_EN and verify the code (below) refers to the correct name of file and folder location.

#set working directory 
setwd("C:/exampledataquality")

#import dataset and call it "dataset"
dataset <- read_sav("1_RawData/exampledataEnglish_raw.sav")

Also, if the variable name of the Enumerators is not EnumName, insert the following code to rename the variable

#standardized enumerator name variable
dataset <- dataset %>% rename(EnumName = OLDNAMEOFENUMERATORVARIABLE) 

For calculations of the Food Consumption Groups - modify the code to use FCSCat21 for 21/35 thresholds or FCSCat28 for 28/42 thresholds as appropriate.

Modify the title and author fields to your survey.

title: "Example Data Quality Monitoring Report"
author: "WFP "

Click the Knit button and the report should generate (it might take a few minutes).

An .html file is produced in the survey folder - you can hover the cursor over the figures for more information.

9.1.1 Video Example

9.2 Data Quality Monitoring Tools

9.2.1 Example data set & Syntax

Here is the zipped file with the raw example data set and syntax:

exampledq_monitoring_EN

9.3 Troubleshooting & Going Further

This report has been designed to work with standardized variable names and indicators illustrated earlier in this guide. If the variable names in the raw data set do not match the code - their will be some adjustment needed.

Experienced R users are free to modify the report to meet their specific needs. Share what you create!