ColOpenData is a package designed to access curated and wrangled Colombian demographic, geospatial, climate and population projections data, retrieved from various open Colombian data sources. The package addresses the challenge of scattered Colombian data across multiple web sources by providing functions that enable users to select and load desired datasets without the need for extensive data acquisition processes. Additionally, the tidy data structure offered for demographic and climate data facilitates analysis and visualization.
ColOpenData is developed at Universidad de Los Andes as part of the Epiverse-TRACE program.
You can install the CRAN version of ColOpenData with:
install.packages("ColOpenData")
You can also install the development version of ColOpenData from GitHub with:
# install.packages("pak")
pak::pak("epiverse-trace/ColOpenData")
ColOpenData contains data from two public data sources: Departamento Administrativo Nacional de Estadística (DANE), and Instituto de Hidrología, Meteorología y Estudios Ambientales (IDEAM). The available data is divided in four categories:
-
Demographic: Demographic and Socioeconomic data presents information from the National Population and Dwelling Census (CNPV) of 2018. The CNPV data corresponds to the most recent census available to date and the information is presented as an answer to three questions: How many are we?, Where are we? and How do we live?
-
Geospatial: This data is retrieved from the National Geostatistical Framework (MGN), which includes maps and a summarized version of the 2018 CNPV, aggregated to spatial geometries. The data is available at different aggregation levels including: Blocks, Urban and Rural Sections, Urban and Rural Sectors, Urban Areas, Municipalities and Departments.
-
Climate: Climate data is recovered from backup information provided by IDEAM, containing historical data from the first station in the country (January 1st 1920) until May 31st 2023. This backup includes temperature, precipitation, sunshine duration, wind direction, among others.
-
Population projections: Population Projections data contains the population projections and back projections from 1950 to 2070, considering the post COVID-19 update, which was calculated based on the results of 2018 CNPV.
Documentation and vignettes are available for the modules in the user vignettes.
Similar R packages are offered for international communities, allowing the user to download census, geospatial and climate data.
- cancensus: Canada
- censobr: Brazil
- chilemapas: Chile
- geobr : Brazil
- georAr: Argentina
- geouy: Uruguay
- tidycensus: US
- geofi: Finland
- climate
ColOpenData stores and distributes open data originally obtained from Departamento Administrativo Nacional de Estad0edstica (DANE), and Instituto de Hidrolog0eda, Meteorolog0eda y Estudios Ambientales (IDEAM). It is important to note that the data retrieved from these sources has been processed, transformed, and adapted to meet the functional requirements of this package. These modifications may result in changes to the structure, format, or content of the original data, and as such, the data provided by this package does not represent the official datasets as maintained by IDEAM or DANE. The development of this package is completely independent of these institutions, and neither IDEAM, DANE, nor any other Colombian government body endorses, sponsors, or is involved in the creation or maintenance of this package. Consequently, they bear no responsibility for the data as presented within this package. Furthermore, the authors of the ColOpenData assume no liability for any outcomes resulting from the use or misuse of the data provided. The user is solely responsible for how they utilize the data, and any conclusions drawn, analyses performed, or actions taken based on this data are the user’s responsibility. This package and its authors make no claims to represent or act on behalf of IDEAM, DANE, or any other official entity.
This package is currently stable, as defined by the RECON software lifecycle. Therefore, this is a functional package and it is documented and tested. However, it still may change over time.
Contributions are welcome via pull requests.
Please note that the ColOpenData project is released with a Contributor Code of Conduct. By contributing to this project, you agree to abide by its terms.
This work is part of the TRACE-LAC research project funded by the International Research Centre (IDRC) Ottawa, Canada.[109848-001-].