Skip to content

Project to better understand population movement drivers in the Northern Triangle (Honduras, El Salvador & Guatemala).

Notifications You must be signed in to change notification settings

unhcr-americas/LAPOP-SMA

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Latin American Public Opinion Project (LAPOP) Analysis

Project to better understand population movement drivers in the Northern Triangle (Honduras, El Salvador & Guatemala). The Analysis Executive summary is here

The analysis is based on longitudinal analysis of Latin American Public Opinion Project (LAPOP) survey from 2004 to 2018 (i.e. 8 survey dataset per country) and complements the exploration already available for Honduras, El Salvador & Guatemala.

The AmericasBarometer is a periodic study of 34 countries in the Western Hemisphere, with stratified nationally representative samples drawn in each country, a common questionnaire core, and country-specific modules. It is the only scientifically rigorous comparative survey of democratic values and behaviors that covers all independent countries in North, Central, and South America, as well as a significant number of countries in the Caribbean. The Americas Barometer measures attitudes, evaluations, experiences, and behavior in the Americas using national probability samples of voting-age adults. Standardizing methods and a common core questionnaire permit valid comparisons across countries and over time on topics including the economy, rule of law, state capacity, trust in institutions, individual values, corruption, security, and more.

Each country survey is implemented based on a national probability design. In some cases, oversamples are collected to allow precise analysis of opinion within sub-national regions. These data were supplied by the Latin American Public Opinion Project at Vanderbilt University, which takes no responsibility for any interpretation of the data.

This projects build from analysis of other shared repositories such as LAPOP-SMA from Craig Jolley, Snr Data Scientist @ USAID, lapop-dashboard from Vanderbilt Data Science Team or LAPOP-predictive-models

The scripts from those various repository were re-organized in an R package in order to ease analysis reproducibility. This package also aims at building capacity of humanitarian data analyst in operations

Key questions

The main focus of the study is organized around 3 linked questions that can influence population movement mitigation program:

Research Programme Design
What influence the intention to move to another country: push & pull factor? How different measurement of perception are correlated: protection-related push factor vs economic pull factor? Are economic conditions sufficient to motivate the intention to move? Resources allocation to protection intervention vs economic support
Are there specific profiles that are intending vs those who do not intend to move? Household targeting for cash intervention
Can we identify displacement hot-spots? What make those hot-spots specific? Geographic targeting and area-based approach for service delivery design

Getting Started

These are the steps you should take to begin using this repo.

Prerequisites

You will need the current version of R Statistical Language & RStudio to run this code.

Install

The package is still under development - if you want to fix- improve - contribute - please fork it in github and install locally the package.

devtools::install()

In order to use the package, the following packages will also be installed tidyverse, haven, labelled, sjmisc, assertr, janitor, rlang.

You will be then able to use all functions with the prefix: AmericasBarometer:: Once installed, you need to download and reshape the data with

## Load data from remote site and save them locally
AmericasBarometer::get_dataLAPOP()
## merge all dataset together
AmericasBarometer::merge_dataLAPOP()
## generate a mapping data dictionnary - also based on https://github.com/unhcr-americas/LAPOP-SMA/blob/master/data/LAPOP_Dictionnary.xlsx
AmericasBarometer::merge_dicoLAPOP()

Vignettes

Vignettes are R Markdown file (.rmd) that runs interactively stored int the vignettes folder. It does not require any additional installation, as it is a built-in function in R Studio. For an in-depth tutorial, visit this R Notebook Guide. There are also cheat sheets available for how to use R Markdown.

Once data have been downloaded, you can then knit the vignettes and the full site with

devtools::document()
## Generate data crunching vignette
AmericasBarometer::build_analysis_package(form = "form.xlsx")
pkgdown::build_site()

This will generate the full package documentation in the docs folder.

The final product is the Analysis Executive summary is here.

The article in the vignettes document the exploratory analysis:

  • Checking survey waves content

  • Exploratory data crunching using koboloadeR & a rebuilt data dictionary

  • Building a predictive model of intention to migrate, as well as Out of fear of being a crime victim and victim of bribery

  • Exploring profiles of respondents that intend to migrate through Cluster Analysis

  • Creating Composite indicators to synthetise information (Authoritarianism index, Community Activity index, Sympathy with government critics index, Fear index, Transparency index, Trust in Government & Wealth index). In technical terms, this implies to perform for each of those indices missing value imputation and principal component analysis. Then for each indices, a prediction model is created, spatial heterogeneity and time trends are explored.

  • Verifying correlations between all indices, creating maps of index and checking spatial heterogeneity

Package functions

The package includes functions stored in the R folder:

  1. Pull survey data - download all of the merged files that exist for each country individually from the LAPOP Datasets. Because of the dataset licence, data are not included in the package

  2. Map and merge all survey from different year and country. This implies:

  • Adding Wave Column: Creates a column that contains the correct wave for datasets.
  • Creating Unique ID: Makes a unique ID for merged country files that follows the unique ID of 2016/17 files.
  • Lengthening and Joining: Lengthens countries into tidy format & Joins the questions and category columns by column_name
  • Finding Common Questions: Flag questions that are asked across all countries
  1. Generate a series of indices, calculated for each region based on common variable to explore trends and statistical clusters. Each of the indices are described in a dedicated vignette.

  2. Generate a series of pre-built charts to display the results and get ground ready for Joint Data Interpretation by subject mater expert

If you change the functions, regenerate the documentation before rebuilding the package

devtools::document()
devtools::check(document = FALSE)

Reference dataset / code book

The package includes for convenience the question categories and labels from different versions, based on Merge_Codebook_v1.0

The package also include a geographic dataset to ease geo-visualization & analysis

About

Project to better understand population movement drivers in the Northern Triangle (Honduras, El Salvador & Guatemala).

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 88.0%
  • R 12.0%