The aim of the Carob project is to create reproducible workflows that reshape primary agricultural research data from experiments and surveys into a standard format, and to aggregate individual data sets into larger collections that can be used in further research.
We do this by writing an R script for each individual dataset. Feel free to improve these scripts, or provide new ones through a pull request (see below for more info).
Compiled versions of the dataset can be downloaded from carob-data.org and some will eventually be made available on the carob dataverse.
You can also compile your own version by cloning the repo and running
remotes::install_github("reagro/carobiner")
ff <- carobiner::make_carob(path)
where path
is the folder of the cloned repo (e.g. "d:/github/carob"
)
if you use the data, run carobiner::get_citations(data)
to get references for the data set used.
Carob is the Extract, Transform, and Load (ETL) framework supported by CGIAR to support predictive agronomy analytics. Contributions are welcome from anyone, and they can be made via pull-requests. You can also raises issues on this github site. See these preliminary guidelines. A good place to discover new data sets is the Gardian website.