- New function
occ_download_sql()
for downloading occurrence data using SQL queries. (#752)
occ_download_cached()
bug fixed. (#748)
- New article GBIF SQL Downloads
- Added many missing
occ_search()
terms. (#698) - New function
occ_download_describe()
for getting information about download formats. (#721)
- Added
constituentKey
toname_lookup()
. (#729) - Added support for
gbifId
downloads (#711)
check_inputs()
bug fixed. (#706)
- New article Effectively using occ_search
- Guidance for reversing WKT winding order. (#724)
gbif_citation()
datasetKey methods no longer supported (#716)- "axe" feature in
occ_data()
is no longer supported. (#718) occ_data()
is soft deprecated and supported for legacy reasons only, and will no longer add new features.
There have been many additions for accessing dataset metadata.
-
dataset_export()
downloads all of the results of adataset_search()
. -
New functions for getting dataset metadata from a datasetkey (uuid) :
dataset_get()
,dataset_process()
,dataset_networks()
,dataset_constituents()
,dataset_comment()
,dataset_contact()
,dataset_endpoint()
,dataset_identifier()
,dataset_machinetag()
,dataset_tag()
,dataset_metrics()
. -
New function for getting more obscure dataset metadata, such as machineTags:
dataset()
. -
New functions for listing dataset metadata :
dataset_noendpoint()
,dataset_duplicate()
. -
dataset_doi()
gets dataset metadata from the dataset's doi.
- Error message improvements for
occ_count()
. (#686)
- New article Getting Dataset Metadata From GBIF
-
There are no longer static data files in rgbif. This data is better fetched fresh from the appropriate endpoints. (#690) (#688)
-
datasets()
is soft deprecated, since the interface was overloaded and confusing. See functional replacements above.
- rgbif has a new logo. (#679)
map_fetch()
now returns a base map as amagick::magick-image
. This allows for the creation of high quality images from the GBIF maps API. (#675)occ_download()
terms added to key lookup. (#661) (#589)pred_default()
is anocc_download()
pred function that allows users to easily filter out commonly unwanted occurrence records. (#611)
-
Stream error fixed ("HTTP/2 stream 15 was not closed cleanly before end of the underlying stream"). Now
map_fetch()
,occ_data()
,occ_search()
, andocc_download_wait()
havecurlopts = list(http_version=2)
, which fixes the error. This might need to be the default setting for the whole package. (#656) -
name_suggest()
now gives a warning at prevents setting thelimit
> 100, since this is the GBIF API max. (#657)
New article Creating maps from occurrences, which explains how to use map_fetch()
.
occ_issues()
is now deprecated, since it is difficult to maintain, and not widely used. (#651)
- Fixes test that was causing errors on CRAN.
occ_count()
parametertype
is now deprecated and will no longer work correctly. Please seeocc_count_country()
,occ_count_pub_country()
,occ_count_year()
,occ_count_basis_of_record()
for replacements. (#622)
occ_count()
parametersgeoreferenced
,type
,date
,to
,from
are no longer supported and not guaranteed to work correctly. (#622)occ_facet()
andcount_facet()
are now deprecated useocc_count(facet="x")
instead.
lit_search()
now supports searching the GBIF literature API. (#591)occ_count()
now supports almost allocc_search()
queries. (#622)occ_count()
now supports the facets interface throughocc_count(facet="x")
. (#622)organizations()
(aka publishers) now supports the use of getting lists of publishers bycountry
. (#606)occ_download()
andocc_search()
now support downloading and getting occurrences a certain distance from known country/area centroids via the parameterdistanceFromCentroidInMeters
. (#594)
occ_search()
now supports more multi-valued parameters. (#617)- Removed dependencies on
randgeo
andconditionz
. (#624) (#625)
New article explaining occ_count()
changes and new features Getting Occurrence Counts From GBIF.
name_backbone_checklist()
now acceptsstrict=TRUE
, meaning that only non-fuzzy matches are returned. (#565)name_backbone_checklist()
now accepts default values for high taxonomy, such as kingdom, phylum, family, ect. (#515)name_backbone_checklist()
now returns a columnis_alternative
whenverbose=TRUE
, which lets the user know if a name was originally considered to be an alternative choice by the name matcher. (#515)
- Updated README to be more inviting to new users (#574)
- Added data quality section to article Getting Occurrence Data From GBIF. (#575)
- removed
sp
andrgeos
dependencies. (#578)
name_usage
now has the ability to fetch iucn red list categories usingdata=iucnRedListCategory
. (#547)
name_backbone_checklist
updated definition ofverbose
argument. (#564)- "Too many choices" warning added to article Working With Taxonomic Names. (#536)
dataset_gridded
bug fixed when inputting only one non-gridded dataset. (#546)
- New CRAN checks badge URL. (#555)
- Update min vcr requirement to (>= 1.2.0). (#559)
- Updated r-lib actions to v2 (#566)
- Added missing search parameters for
occ_data()
andocc_search()
(#530) - Added missing
occ_download()
terms to key lookup (#541) - Support for identifying "gridded datasets" with experimental API using
dataset_gridded()
(#516) - Look up the datasets in a GBIF network with
network_constituents()
(#527) - Added support for using GBIF experimental reverse geocoding API
gbif_geocode()
(#521)
networks()
is deprecated and callednetwork()
instead. (#527)parsenames()
is deprecated and calledname_parse()
for better alignment with othername_*
functions. (#504)
occ_search
fixed bug related to networkKey in the column names (#524)
- Removing
wellknown
dependency and switching towk
(#512)
name_backbone_checklist()
: bug fix related two square brackets in url (#509)
name_backbone_checklist()
: bug fixes (#501) (#505)
There is a new rgbif maintainer: John Waller.
derived_dataset()
: New function to register a cleaned or modified dataset on GBIF for citation. (#467)name_backbone_checklist()
: New function that takes a list, vector, or data.frame of scientific names and asynchronously matches them to the backbone. (#475)pred_isnull()
: New predicate function that includes NULL values from a column in the download. (#489)occ_download.print()
: Now prints out much more information including a DOI and citation. (#494)
gbif_citation.gbif()
: it is no longer considered best practice to generate a citation fromocc_search()
orocc_data()
. We recommendocc_download()
orderived_dataset()
instead. (#494)
occ_download_wait()
andocc_download_meta()
: now accept a class character download key directly. The keys does do not need to be class "occ_download". (#487)name_backbone()
: now returns new columns "verbatim_name", "verbatim_genus" ect. that the user has supplied. This makes it easier for the user to track what has been matched. The verbose argument also has been un-retired. Ifverbose=TRUE
, more results will be returned in a single data.frame. (#475)gbif_citation()
: will now accept a download key directly.occ_download_get()
: Does not throw an error if the data is already present andoverwrite=FALSE
, it will just give a warning and return the already present dataset. This allows users to runocc_download_get(key) %>% occ_download_import()
multiple times without re-downloading the same file withoverwrite=TRUE
.download_predicate_dsl()
: "publishingOrg" now added as a download key. (#496)key_lkup
now includes GBIF-style uppercase keys as well. Sopred("TAXON_KEY",212)
andpred("taxonKey",212)
will both work.
Wrote new articles highlighting new features and encouraging the use of occ_download()
over occ_search()
.
New articles:
- Citing GBIF Mediated Data
- Set Up Your GBIF Username and Password
- Getting Occurrence Data From GBIF
- Downloading A Long Species List
- Working With Taxonomic Names
occ_download_import()
: fixed bug related to select argument. (#479)map_fetch()
: fixed bug related tosp::CRS
(#497)
- typo in download predicate functions fixed -
mulitpoint
->multipoint
(#460) thanks @damianooldoni for catching that - added three new predicate keys:
stateProvince
(#458),gadm
(#462), andoccurrenceStatus
(#465)
- add two new occurrence issues:
FOOTPRINT_SRS_INVALID
andFOOTPRINT_WKT_INVALID
(#454) occ_download_import()
docs: more information ondata.table::fread
parameters and particular ones that would be useful to sort out data read issues (#461)
- fix
occ_download_get()
: downloaded files used to have a certain content type in response header we checked for, but its changed at least once even in successful responses, so that step has been removed (#464) - fix
occ_download_import()
: country code for Namibia isNA
- this was turning into the R missing valueNA
- now fixed (#463)
- in occurrence download predicate builder checks, to better help users, give the name of the key that fails upon failure instead of just the string 'key' (#450)
- occurrence download predicates: new key
coordinateUncertaintyInMeters
added, e.g. usage:pred_lt("coordinateUncertaintyInMeters",10000)
(#449) pred_and()
andpred_or()
slight change: now required that more than one predicate is passed to each of these functions because it doesn't make sense to do anand
oror
predicate with only one predicate (#452)- fix for use of
pred_not(pred_notnull())
(#452)
- add a new occurrence issue (
TAXON_MATCH_AGGREGATE
) and a new name issue (BACKBONE_MATCH_AGGREGATE
) (#453)
- remove geoaxe references in man-roxygen template doc files - not using pkg anymore here and that pkg is cran archived too (#448)
- remove package wicket - use package wellknown instead - no user facing changes related to this (#447)
- remove package geoaxe (to be archived on CRAN soon) - use package sf instead (#447)
- fix to download predicate function
pred_not()
: it was not constructing the query correctly, fixed now. user facing change as well: it now expects a predicate to be passed, and only a single predicate as GBIF not predicate only accepts one predicate (#446)
- Add new occurrence issue
DIFFERENT_OWNER_INSTITUTION
(#444) - re-record all test fixtures
- fix bug in
occ_search()
(#443)
- Documentation: clarify for
occ_search()
andocc_data()
what parameters accept many values and which do not; in addition, we clarify which parameters accept multiple values in the same HTTP request, and those that accept multiple values but apply each in separate HTTP requests. See also?many-values
manual file (#369) gbif_issues()
gains 9 new occurrence issues (#435)- for
occ_search()
andocc_data()
,basisOfRecord
parameter now supports multiple values, both in one request and in different requests, depending on input format (see "Multiple values passed to a parameter" section in?occ_search
) (#437) - remove vignettes from cran to avoid cran checks - still available on our docs site (#438)
occ_download_get()
: GBIF slightly altered download behavior - we now explicitly follow any redirects to get a download (#439)print.occ_download_meta
(used when you runocc_download_meta()
) was printingNA
for number of results found if no results were ready yet - now prints0
instead ofNA
(#440)
count_facet()
fixes: fixed internal fxn forcount_facet
for parsing results, was dropping values for facets; added assertions to check parameter types input by user for the fxn; changed so that keys and basisofrecord can be passed together (#436)
- added two new occurrence issues to
gbif_issues()
:GEOREFERENCED_DATE_INVALID
andGEOREFERENCED_DATE_UNLIKELY
(#430)
- fixed an error in
occ_data()
caused by GBIF adding a new field of data to the output of/occurrence/search/
: gadm. cleaned up internals ofocc_data()
to drop gadm, and other fields that are complex and take time to parse (useocc_search()
if you want all the data fields) (#427) gbif_names()
fix: was ending up with invalid URLs to GBIF species pages because we had taxon keys with leading spaces somehow. now all leading and trailing spaces in taxon keys removed before making URLs (#429)
gbif_issues()
changes: three new occurrence issues added; one name issue removed that's deprecated (#423)gbif_citation()
rights field was empty unless pulling from a downloaded file; now fill in withlicense
key; also a fix for when occurrence key passed to the function (#424)establishmentMeans
now supported inocc_download
/pred
(#420)
- fix for
occ_download_get()
: response content-type header changed recently, fixed (#422)
- finally delete code originally extracted from
plyr::rbind.fill
- usedata.table::rbindlist
in all cases (#417) - fix failing test on cran for
dataset_search()
(#418) - fix xd refs note on cran (non-file package anchored links) for curl pkg function (#419)
occ_download_cancel_staged()
fix: was broken cause we were indexing to a column in a table with[,"key"]
(#416)
- Many functions (
occ_search
,occ_get
,name_usage
,name_lookup
,name_suggest
,name_backbone
, anddataset_search
) have areturn
parameter to toggle what is returned from the function call. To simplify rgbif maintenance, we've deprecated thereturn
parameter. We've left it in each of the functions, but it no longer does anything, other than raising a warning if used. This means that function calls to these functions now always return the same data structure, making it easier to reason about for the user, as well as for us developers trying to make sure the package works as expected under a variety of conditions. If you have been using thereturn
parameter, do the same function call as before, but now index to the output you need. This is a breaking change, thus the major version bump (#413)
- new function
occ_download_cached()
, which takes the same input asocc_download()
, but instead of starting a query, it checks if you've recently made the same request (with configureable settings for what "recent" means). This can save time when you're doing occurrence download requests that you may have done in the recent past (#308)
- configured package to be able to use two different base urls,
api.gbif-uat.org
andapi.gbif.org
. We have only used the latter previously, but now can configure rgbif to use the former, mostly for testing purposes (#398) occ_download_import()
gainsencoding
parameter that is passed down todata.table::fread
to make it very clear that encoding can be configured (even though you could have before via...
) (#414)
- fix tibble construction (#412)
- max records you can return for
/occurrence/search
route is now 100,000 (used inocc_data()
andocc_search()
). updated docs throughout accordingly (#405) - improved docs in
occ_download_queue()
for how we determine when a job is done. see new section "When is a job done?" (#409) - print methods
print.occ_download_prep
andprint.occ_download
improved. previously well-known text strings were printed in their entirety. now they are handled to only print so many characters; also applies to any download predicate string that's long (#407) occ_download_get()
now supports using a progress bar by passing inhttr::progress()
(#402)occ_data()
andocc_search()
gain two new parameters:recordedByID
andidentifiedByID
(#403)
- fix in
occ_download_queue
: an emptyocc_download_meta()
lead to problems; now removing anyNULL
's from a list ofocc_download_meta()
outputs before further work (#408) - fix in
occ_download_queue
: we were not accounting for job status "cancelled" (#409) occ_download_import()
fix:fill
parameter was set toTRUE
by default, changed toFALSE
. improved docs for this fxn on passing down parameters todata.table::fread
(#404)
- add a section Download status to the
?downloads
manual file listing all the different download status states a download can have and what they mean (#390) - fix
gbif_issues
/gbif_issues_lookup
: added four missing occurrence issues to the package (COORDINATE_PRECISION_INVALID, COORDINATE_UNCERTAINTY_METERS_INVALID, INDIVIDUAL_COUNT_INVALID, and INTERPRETATION_ERROR) (#400) - doing real tests now for
occ_download()
via vcr (#396)
- fix
name_lookup()
: we were attempting to rearrange columns when no results found, leading to an error (#399)
- the
spellCheck
parameter has been removed from the occurrence routes; thus, theocc_spellcheck()
function is now defunct - and the parameterspellCheck
has been removed fromocc_data()
andocc_search()
(#397)
- docs fix for
occ_data()
: remove...
parameter definition as it wasn't used in the function (#394)
- download predicate fxns fix: "within" wasnt being handled properly (#393) thanks @damianooldoni
- The download query user interface for
occ_download()
has changed in a breaking fashion (thus the major version bump). After installation, see?download_predicate_dsl
. Much more complex queries are now possible withocc_download()
. TL;DR: you now construct queries with functions likepred("taxonKey", 3119195)
rather than passing in strings liketaxonKey = 3119195
, andpred_gt("elevation", 5000)
instead of"elevation > 5000"
(#362) - gains new function
occ_download_wait()
to re-runocc_download_meta()
until the download is ready - kinda likeocc_download_queue()
but for a single download (#389) occ_download_dataset_activity()
gains pagination parameterslimit
andstart
to paginate through results (#382)gbif_citation()
now works with the output ofocc_data()
in addition to the other existing inputs it accepts (#392)
- typo fix in the geometry section of the
occ_download()
manual file (#387) - vignettes fixes (#391)
gbif_citation()
tests needed preserve body bytes for vcr (#384)- fix to
occ_count()
andcount_facet()
: isGeoreferenced/georeferenced variable needed booleans converted to lowercase before being sent to GBIF (#385) (#386)
- gains new function
mvt_fetch()
for fetching Map Vector Tiles (MVT). mvt used to be an option inmap_fetch()
, but we only returned raw bytes for that option. Withmvt_fetch()
we now leverage theprotolite
package, which parses MVT files, to give back an sf object (#373) thanks to @jeroen for the protolite work to make this work - associated with above,
map_fetch()
loses theformat = ".mvt"
option; and thus now only returns aRasterLayer
occ_issues()
andname_issues()
reworked. Both now use the same underlying internal logic, with occ_issues pulling metadata specfic to occurrence issues and name_issues pulling metadata specific to name issues. name_issues used to only be a data.frame of name issues, but can now be used similarly to occ_issues; you can pass the output ofname_usage()
to name_issues to filter/parse name results by their associated name issues. Associated with this, new functiongbif_issues_lookup
can be used to lookup either occurrence or name issues by their full name or code (#363) (#364)
- fix examples and tests that had WKT in the wrong winding order (#361)
- parsing GBIF issues in the output of
name_usage()
wasn't working (#328) (#363) (#364) name_lookup()
gains an additional parametersissue
for filtering name results by name issues (#335) (#363) (#364)- fixed definitions of
x
,y
,z
parameters inmap_fetch()
manual file (#375) - added examples to
gbif_citation()
manual file for accessing many citations (#379) - fixed a test for
occ_download_queue()
(#365) name_*
function outpus have changed, so be aware if you're using those functions
- fixed issue with
map_fetch()
: when srs wasEPSG:3857
, the extent we set was incorrectly set asraster::extent(-180, 180, -85.1, 85.1)
. Now the extent israster::extent(-20037508, 20037508, -20037508, 20037508
(#366) (#367) thanks @dmcglinn for reporting and @mdsumner for fixing! - fix for Windows platforms for
gbif_citation()
forocc_download_get
objects. we weren't correctly creating the path to a file on windows (#359) - fix to
print.gbif_data
(#370) (#371) occ_download()
was erroring with a useless error when users try to use the fxn with the same parameter input types asocc_search
/occ_data
; when this happens now there is a useful error message (#381)- fix to
occ_download()
: whentype = "in"
was used, we weren't creating the JSON correctly, fixed now (#362)
occ_download()
andocc_download_prep()
gain a new parameterformat
for specifying the type of download. options are DWCA (default), SIMPLE_CSV, or SPECIES_LIST. SIMPLE_CSV and SPECIES_LIST are csv formats, while DWCA is the darwin core format (#352)- now throughout the package you can pass
NA
in addition toNULL
for a missing parameter - both are removed before being sent to GBIF (#351)
- replace
tibble::as_data_frame
/tibble::data_frame
withtibble::as_tibble
(#350) key
andgbifID
in the output ofocc_data
/occ_search
/occ_get
have been changed so that both are character class (strings) to match how GBIF encodes them (#349)- fix some test fixtures to use preserve exact bytes so that cran checks on debian clang devel don't fail (#355)
- fix to
occ_download
: fail with useful message when user does not pass in queries as character class (#347) - fix to
occ_download
: fail with useful message now when user/pwd/email not found or given (#348)
- pkgdown documentation site (#336) (#337) all work done by @peterdesmet
- package gains hex logo (#331) (#332) thanks @peterdesmet
- big change to
elevation()
function: the Google Maps API requires a form of payment up front, and so we've decided to move away from the service.elevation()
now uses the Geonames service https://www.geonames.org/; it does require you to register to get a username, but its a free service. Geonames has a few different data models for elevation and can be chosen in theelevation_model
parameter (#344) (#345) - biggish change to
occ_data()
/occ_search()
output: the data.frame in thedata
slot now always has the first column as the occurrence key (key
), and the second column is now the scientific name (scientificName
). the previously usedname
column still exists in the data.frame, so as not to break any user code, but is simply a duplicate of thescientificName
column. in a future version of this package thename
column will be dropped (#329)
- README gains full list of code contributors and any folks involved in github issues (#339) (#343) thanks @peterdesmet
- update pkg citation, include all authors (#338)
- added more to
occ_search()
/occ_data()
/occ_download()
documentation on WKT (well-known text) with respect to winding order. GBIF requires counter-clockwise winding order; if you submit clockwise winding order WKT toocc_search()
orocc_data()
you should get data back but the WKT is treated as an exclusion, so returns data outside of that shape instead of within it; if you submit clockwise winding order WKT toocc_download()
you will get no data back (#340)
- fix bug in
occ_download()
, was failing in certain cases because of some bad code in an internal functioncatch_err()
(#333) occ_download()
was not returning user name and email in it's print method (#334)occ_issues()
was failing withocc_data()
orocc_search()
input whentype="many"
(i.e., when > 1 thing was passed in) (#341)
- tests that make HTTP requests are now cached via the
vcr
package so do not require an internet connection (#306) (#327) - added name usage issues (similar to occurrence issues) data. in part fixes
name_usage()
problem, more work coming to allow users to use the name issues data like we allow for occurrence issues throughocc_issues()
(#324)
map_fetch()
changes following changes in GBIF maps API: new parameterstaxonKey
,datasetkey
,country
,publishingOrg
,publishingCountry
and removed parameterssearch
andid
; note that this changes how queries work with this function (#319)- added note to
map_fetch()
docs thatstyle
parameter does not necessarily use the style you give it. not sure why (#302) - fixed messaging in
occ_download_queue()
to report an accurate number of jobs being processed; before we were just saying "kicking off first 3 requests" even if there were only 1 or 2 (#312)
- fix to
occ_get()
whenverbatim=TRUE
(#318) elevation()
function now fails better. when the API key was invalid the function did not give an informative message; now it does (#322)
- significant change to
occ_download_queue()
: sleep time between successive calls to check on the status of download requests is now 10 seconds or greater. This shouldn't slow down your use ofocc_download_queue()
much because most requests should take more than the 10 seconds to be prepared (#313) - add tests for download queue method (#315)
- explicitly
@importFrom
fxns used fromlazyeval
package to avoid check note (#316) - remove
reshape2
andmaps
packages from Suggests (#317)
- fix bug in
name_usage()
: we were screwing up parsing of issues column when single taxon keys passed in (#314)
occ_issues()
now works with download data and arbitrary data.frame's (#193)- New downloads queueing tools: gains functions
occ_download_prep()
for preparing a download request without executing it, andocc_download_queue()
for kicking off many download jobs while respecting GBIF's downloads rate limits. See also internal R6 classes for dealing with queuing:DownReq
,GifQueue
. See?occ_download_queue
to get started (#266) (#305) (#311) - New function
map_fetch()
working with the GBIF maps API https://www.gbif.org/developer/maps. See?map_fetch
to get started (#238) (#269) (#284) thanks to @JanLauGe for the work on this name_lookup()
gainsorigin
parameter (#288) (#293) thanks @peterdesmet and @damianooldoniname_lookup()
andname_usage()
gain internal paging - just asocc_search()
/occ_data()
have (#291) (see also #281) thanks @damianooldoni- new import
lazyeval
, and new suggestspng
andraster
occ_search()
/occ_data()
gain parameterskip_validate
(boolean) to skip or not stkip WKT validation by thewicket
package
- removed warnings about parameters that were removed in previous versions of the package (#189)
- add citation file (#189)
- updated
name_usage()
to check params that now only allow 1 value: name, language, datasetKey, rank (#287) occ_count()
losesnubKey
,catalogNumber
, andhostCountry
as those parameters are no longer accepted by GBIF
- fixed bug in
name_usage()
, was screwing something up internally (#286) - fixed bug in
occ_data()
: curl options weren't being passed through (#297) - fixed geometry usage in
occ_search()
/occ_data()
- skipping the wicket validation and constructing WKT by hand from bounding box (if bounding box given) - the validation that wicket does isn't what GBIF wants (#303) - add
fill
parameter toocc_download_import()
to pass on tofill
indata.table::fread
, and setfill=TRUE
as default. (#292) - better failure for
occ_download()
(#300) - fix bug in
occ_download()
in which a singletaxonKey
passed in was failing (#283) name_usage()
was ignoringdatasetKey
anduuid
parameters (#290)
gbifmap()
has been removed, see the packagemapr
for similar functionality andmap_fetch()
in this package to use the GBIF map API (#298)
- Gains new functions
occ_download_datasets
andocc_download_dataset_activity
to list datasets for a download, and list the downloads activity of a dataset (#275) (#276) - Gains a new vignette covering working with GBIF downloads
in
rgbif
(#262)
- Guidance added to docs for downloads functions on length of the request body (#263)
- Changed authentication details (user name, password, email) for downloads to allow any of the options: pass in as arguments, store as R options, store as environment variables (#187)
gbif_citation()
function gains an S3 method for passing the output ofocc_download_meta()
to it. In addition, for downloadsgbif_citation()
now returns a citation for the entire download (including) its DOI, in addition to citations for each dataset (#274) thanks @dnoesgaard
- Fix documentation bug in
occ_count()
:georeferenced
had a misleading description of what the valueFALSE
did (#265) - Fixed bug in
gbifmap()
- was failing in some cases - better error handlingn now (#271) thanks @TomaszSuchan - Fixed
occ_download_cancel_staged()
: it wasn't passing on authentication parameters correctly (#280)
- The GBIF API supports passing in many instances of the same
parameter for some parameters on some routes. Previously we
didn't support this feature, but now we do. See the
?many-values
manual file for details. added docs to individual functions that support this, and added additional tests (#200) (#260) (#261) - We've removed
V8
dependency and replaced with C++ based WKT parser packagewicket
. We still usergeos
for some WKT parsing. rgbif functions that use wicket:gbif_bbox2wkt
,gbif_wkt2bbox
,check_wkt
(#243) httr
replaced withcrul
for HTTP reqeusts. As part of this change, the...
parameter was replaced in most functions bycurlopts
which expects a list. Some functions require a...
parameter for facet inputs, so...
is retained with the addition ofcurltops
parameter. A result of this change is that whereas in the past parameters that were not defined in a function that also had a...
parameter would essentially silently ignore that undefined parameter, but with functions where...
was removed a misspelled or undefined parameter will cause an error with message (#256)
- moved to markdown docs (#258)
- namespacing calls to base R pkgs instead of importing them
- Fixed problem in
occ_download_import()
to allow import of csv type download in addition to darwin core archive. additional change toocc_download_get
to addformat
attribute stating which format (#246) - fix to
occ_download_import
addingfill=TRUE
to thedata.table::fread
call (#257)
occ_dowload
gains new parameterbody
to allow users to pass in JSON or a list for the query instead of passing in statements to...
. See examples in?occ_dowload
.
- Now using
tibble
for compact data.frame output forocc_download_import
instead of bespoke internal solution (#240) - Moved all GBIF API requests to use
https
instead ofhttp
(#244) - Improved print method for
occ_download_meta
- Fix to
occ_download
to structure query correctly whentype=within
andgeometry
used because the structure is slightly different than when not usinggeometry
(#242) - Fixed
occ_download
to allowOR
queries for many values of a parameter, e.g.,taxonKey=2475470,2480946
will be queried correctly now as essentiallytaxonKey=2475470
ortaxonKey=2480946
(#245)
- Fixed a bug in
parsenames()
caused by some slots in the list beingNULL
(#237) - Fixed some failing tests:
occ_facet()
tests were failing due to changes in GBIF API (#239) - Fixes to
gbif_oai_get_records()
for slight changes inoai
dependency pkg (#236)
occ_search()
now has faceted search. This feature is not inocc_data()
as that function focuses on getting occurrence data quickly, so will not do get facet data. This means that a new slot is available in the output object fromocc_search()
, namelyfacets
. Note thatrgbif
has had faceted search for the species search route (name_lookup()
) and the registry search route (dataset_search()
) for quite a while. (#215)- new function (
occ_facet()
) to facilitate retrieving only facet data, so no occurrence data is retrieved. (#215) (#229) - A suite of new parameters added to
occ_search()
andocc_data()
following addition the GBIF search API:subgenusKey
,repatriated
,phylumKey
,kingdomKey
,classKey
,orderKey
,familyKey
,genusKey
,establishmentMeans
,protocol
,license
,organismId
,publishingOrg
,stateProvince
,waterBody
,locality
(#216) (#224) - New parameter
spellCheck
added toocc_search()
andocc_data()
that ifTRUE
spell checks anything passed to thesearch
parameter (same asq
parameter on GBIF API; which is a full text search) (#227) - New function
occ_spellcheck
to spell check search terms, returnsTRUE
if no spelling problems, or a list with info on suggestions if not. - Both
occ_search()
andocc_data()
now have ability to support queries wherelimit=0
, which for one should be possible and not fail as we did previously, and second, this makes it so that you can do faceted searches (See above) and not have to wait for occurrence records to be returned. (#222) MULTIPOLYGON
well known text features now supported in the GBIF API. Previously, you could not querygeometry
with more than one polygon (POLYGON
), but now you can. (#222)
- Improved docs for
occ_count()
, especially for the set of allowed parameter options that the GBIF count API supports occ_count()
gains new parametertypeStatus
to indicate the specimen type status.- When no results found, the
data
slot now returnsNULL
instead of a character string
- Fixes to
gbif_photos()
: 1) Mapbox URLs to their JS and CSS assets were out of date, and API key needed. 2) In RStudio, thetable
view was outputting errors due to serving files onlocalhost:<port>
instead of simply opening the file; fixed now by checking platform and using simple open file command appropriate for the OS. (#228) (#235)
- Now using
tibble
in most of the package when the output is a data.frame (#204) - New vignette Taxonomic Names for discussing some common names problems users may run into, and some strategies for dealing with taxonomic names when using GBIF (#208) (#209)
- Replaced
is()
withinherits()
, no longer importingmethods()
(#219) - Improved docs for registry functions. Not all options were listed
for the
data
parameter, now they are (#210) - Fixed documentation error in
gbifmap()
man file (#212) thanks to @rossmounce
- Fixed bug in internal parser within
occ_download()
, in which strings to parse were not being parsed correctly if spaces weren't in the right place, should be more robust now, and added tests (#217). Came from https://discuss.ropensci.org/t/rgbif-using-geometry-in-occ-download/395 - The parameter
type
was being silently ignored in a number of registry functions. fixed that. (#211)
occ_data()
andocc_search()
gain ability to more flexibly deal with inputs to thegeometry
parameter. Previously, long WKT strings passed toocc_search()
orocc_data()
would fail because URIs can only be so long. Another option is to use the download API (see?downloads
). This version adds the ability to choose what to do with long WKT strings via thegeom_big
parameter:asis
(same as previous version),bbox
which detects if a WKT sting is likely too long, and creates a bounding box from the WKT string then once data is retrieved, clips the result to the original WKT string;axe
uses thegeoaxe
package to chop up the input WKT polygon into many, with toggles in the new parametersgeom_size
andgeom_n
. (#197) (#199)- As part of this change, when >1 geometry value passed, or if
geom_big="axe"
, then named elements of the output get namesgeom1
,geom2
,geom3
, etc. instead of the input WKT strings - this is because WKT strings can be very long, and make for very awkward named access to elements. The original WKT strings can still be accessed viaattr(result, "args")$geometry
- code tidying throughout the package
- Fix parsing bug in
name_usage()
function, see commit e88cf01cc11cb238d44222346eaeff001c0c637e - Fix to tests to use new
testthat
fxn names, e.g.,expect_gt()
instead ofexpect_more_than()
- Fix to
occ_download()
to parse error correctly when empty body passed from GBIF (#202)
- New function
occ_data()
- its primary purpose to perform faster data requests. Whereasocc_search()
gives you lots of data, including taxonomic hierarchies and media records,occ_data()
only gives occurrence data. (#190)
- Replaced
XML
withxml2
(#192) - Speed ups to the following functions due to use of
data.table::rbindlist()
for fast list to data.frame coercion:name_lookup()
,name_backbone()
,name_suggest()
,name_usage()
, andparsenames()
(#191) - Changes to
httr
usage to comply with changes inhttr >= v1.1.0
: now setting encoding explicitly toUTF-8
and parsing all data manually, using the internal functionfunction(x) content(x, "text", encoding = "UTF-8")
(#195)
- Fix to internal function
move_col()
to not fail on fields that don't exist. Was failing sometimes when no latitude or longitude columns were returned. (#196)
- New set of functions (
gbif_oai_*()
) for working with GBIF registry OAI-PMH service. Now importingoai
package to make working with GBIF's OAI-PMH service easier (#183) - Added code of conduct (#180)
- Now sending user-agent header with all requests from this
package to GBIF's servers indicating what version of rgbif
and that it's an ropensci package. Looks like
r-curl/0.9.4 httr/1.0.0 rOpenSci(rgbif/0.9.0)
, with whatever versions of each package you're using. We also pass a user-agent string with the headerX-USER-AGENT
in case theuseragent
header gets stripped somewhere along the line (#185) - New function
gbif_citation()
helps get citations for datasets eith using the occurrence search API viaocc_search()
or the downloads API viaocc_downlad()
(#178) (#179)
- Using
importFrom
instead ofimport
in all cases now. - Parameter
collectorName
changed torecordedBy
(#184)
- Fix to
occ_download_meta()
print method to handle 1 or more predicate results (#186) - Fix to
occ_issues()
to work withreturn=data
andreturn=all
occ_search()
output (#188)
- Updated
terraformer.js
javascript code included in the package along with an update in that codebase (#156) - The
email
parameter nowNULL
by default in the functionocc_download()
, so that if not provided or not set in options, then function fails. (#173) - Additional explanation added to the
?downloads
help file. - Added internal checks to
elevation()
to check for coordinates that are impossible (e.g., latitude > 90), not complete (e.g., lat given, long not given), or points at0,0
(just warns, doesn't stop). (#176) thanks @luisDVA - General code tidying across package
- A route changed for getting images for a taxon within the
/species
route, fix to functionname_usage()
(#174) - Fix to
occ_search()
to remove a block of code to do synonym checking. This block of code was used if the parameterscientificName
was passed, and checked if the name given was a synonym; if yes, we used the accepted name according to the GBIF backbone taxonomy; if no, we proceeded with the name given by the user. We removed the block of code because the GBIF API now essentially does this behind the scenes server side. See https://github.com/gbif/gbif-api for examples. (#175)
- Additional tests added for
gbif_photos()
andgbif_names()
(#170)
- Fixed a few tests that were not passing on CRAN.
- New set of functions with names
occ_download*()
for working with the GBIF download API. This is the same service as using the GBIF website, but via an API. See?downloads
. (#154) (#167)
- Explicitly import non-base R pkg functions, so importing from
utils
,methods
, andstats
(#166)
- Fixed problem with
httr
v1
where empty list not allowed to pass to thequery
parameter inGET
(#163)
- New functions for the
/enumerations
GBIF API route:enumeration()
andenumeration_country()
. Many parts of the GBIF API make use of enumerations, i.e. controlled vocabularies for specific topics - and are available via these functions. (#152)
elevation()
now requires an API key (#148)- The
V8
package an Import now, used to do WKT read/create with use of the Javascript library Terraformer (http://terraformer.io/). Replaces packagessp
andrgeos
, which are no longer imported (#155) - Changed
occ_search()
parameterspatialIssues
tohasGeospatialIssues
(#151) - Added note to docs about difference between
/search
and/count
services, and how they work. (#150) - Added tests for habitat parameter in
name_lookup()
(#149) - Dropped
plyr
from Imports (#159) - Dropped
stringr
from Imports (#160) - Dropped
maps
andgrid
packages from Imports (#161)
- Looping over records with
limit
andstart
parameters was in some cases resulting in duplicate records returned. Problem fixed. (#157)
- All example moved to
\dontrun
(#139) - README fixes for html (#141)
- Fixed documentation in
occ_search()
to give correct values for default and max limit and start parameters (#145) - Changed internal
GET
helper function to properly pass on error message (#144) - Replaced
assertthat::assert_that()
withstopifnot()
to have one less dependency (#134) - Fixed
occ_search()
to allow ability to query by only publishingCountry, that is, with no other parameters if desired (#137)
- Fixed bug in internal
GET()
helper function to just passNULL
to thequery
parameter when the list of length 0 passed, since it caused requests to fail in some cases. - Fix to
name_lookup()
to force a logical entry for certain parameters - before this fix if the correct logical param was not passed, the GBIF API went with its default parameter (#135) - Fixed bug in
name_backbone()
due to change innamelkupparser()
helper function - fixes parsing for verbose output (#136) - Fixed some broken URLs in
occ_search()
documentation (#140)
- New function
occ_issues()
to subset data fromocc_search()
based on GBIF issues. (#) (#122) - Related to the last bullet, GBIF issues now are returned by default in
occ_search()
results, and are intentionally moved to the beginning of the column order of the data to be more obvious. (#102) occ_search()
now returns all data fields by default. The default setting for thefields
parameter isall
- but can be changed. See?occ_search
- New function
gbif_names()
to view highlighted terms in name results from a call toname_lookup()
. (#114) - New functions:
occ_issues_lookup()
to lookup GBIF issues based on code name or full issue name, andgbif_issues()
to print the entire issues table.
- Completely replaced
RCurl
withhttr
- Completely replaced
RJSONIO
withjsonlite
. Should see slight performance in JSON parsing withjsonlite
. - Default number of records in
occ_search()
now 500; was 25. (#113) - Vignette for old version of GBIF API removed.
- New vignette for cleaning data via GBIF issues added. (#132)
- Functions for working with old GBIF API removed, now defunct. (#116)
- Now better parsing for some functions (
organizations()
,datasets()
,networks()
,nodes()
,installations()
) to data.frames when possible. (#117) - Added further help to warn users when searching on ranges in latitude or longitude in
occ_search()
(#123) callopts
parameter changed to...
throughout all functions. Now pass on options tohttr
as named lists or functions. (#130)- Beware that GBIF data is becoming Darwin Core compliant - so many parameters throughout this package have changed from sentence_case to camelCase.
dataset_search()
anddataset_suggest()
gain new parameterpublishingOrg
- Default for
limit
parameter changed to 100 for dataset functions:dataset_search()
,dataset_suggest()
, anddatasets()
. - Default for
limit
parameter changed to 100 for registry functions:installations()
,networks()
,organizations
, andnodes()
. - Parameter changes in
networks()
:name
,code
,modifiedsince
,startindex
, andmaxresults
gone; new parametersquery
,identifier
,identifierType
,limit
, andstart
- Parameter changes in
nodes()
: new parametersidentifier
,identifierType
,limit
, andstart
occ_search()
failed sometimes on species that were not found. Fixed. (#112)- Added better handling of some server errors to pass on to user. (#115) (#118)
- Fixed incorrect parsing for some cases in
occ_search()
(#119) - Fixed bad parsing on output from
name_lookup()
(#120) - Fixed single map option in
gbif_photos()
that caused map with no data. (#121) - Fixed some parameter names in
name_()
functions according to changes in the GBIF API spec, and fixed documentation to align with GBIF API changes, and added note about maximum limit. (#124) (#127) (#129) Thanks to @willgearty ! - Fixed internals of
occ_search()
so that user can pass in multiple values to theissue
parameter. (#107) - Fixed URL to tutorial on ropensci website (#105) Thanks @fxi !
occ_search()
now has adplyr
like summary output whenreturn='all'
. See?occ_search
for examples. You can still easily access all data, by indexing tometa
,hierarchy
,data
, ormedia
via e.g.,$data
,['data']
, or[['data']]
. (#95)- Media now returned from the GBIF API. Thus, in
occ_search()
, we now return a media slot in the output list by default. - New function
gbif_photos()
to view media files (photos in the wild or of museum specimens). Two options are available,which='map'
creates a single map which presents the image when the user clicks on the point, andwhich='table'
in which a table has one row for each image, presenting the image and an interactive map with the single point. (#88) - Two new packages are imported:
sp
andwhisker
- GBIF updated their API, now at v1. URL endpoints in
rgbif
changed accordingly. (#92) - GBIF switched to using 2-letter country codes. Take note. (#90)
- GBIF switched all parameters to
camelCase
fromunder_score
style - changed accordingly inrgbif
. - Using package custom version of
plyr::compact()
instead of importing fromplyr
. - In
name_lookup()
removedfacet_only
parameter as it doesn't do anything - uselimit=0
instead. Further, added two new slots of output:hierarchy
andnames
(for common/vernacular names) (#96). The output can be determined by user via thereturn
parameter. - In
name_suggest()
, if the fieldhigherClassificationMap
is selected to be returned via thefields
parameter, a list is returned with a data frame, and a list of the hierarchies separately. IfhigherClassificationMap
is not selected, only a data frame is returned. occ_search()
gains new parametersmediatype
andissue
(#93), with detailed list of possible options for theissue
parameter. Gains new examples for searching for images, examples of calls that will throw errors.- Updated the vignette.
- Added better error message to
check_wkt()
. facet_only
parameter removed fromdataset_search()
function as it doesn't do anything - uselimit=0
instead.- Fixed some examples that didn't work correctly.
- Added functions
gbif_bbox2wkt()
andgbif_wkt2bbox()
to convert a bounding box to wkt and a wkt object to a bounding box, respectively. Copied from thespocc
package. Prefixes to fxn names will avoid conflicts. - Now spitting out more informative error messages when WKT strings passed in are not properly formed, either from
rgeos::readWKT
or from the returned response from GBIF.
gbifmap()
was throwing an error because it was looking for two variableslatitude
andlongitude
, which had been changed todecimalLatitude
anddecimalLongitude
, respectively, in other functions in this package. Fixed. (#81)occ_get()
was updated to include changes in the GBIF API for this endpoint. The fix included fixing the parser for verbatim results, seergbif::gbifparser_verbatim
. (#83)- Fixed bugs in
elevation()
- it was expecting column names to be latitude and longitude, whereas inputs from otherrgbif
functions have changed to decimalLatitude and decimalLongitude. - Fixed bug in
count_facet()
introduced b/c GBIF no longer accepts hostCountry or nubKey parameters.
gist()
,stylegeojson()
, andtogeojson()
functions now listed as deprecated. Their functionality moved to thespocc
package (http://cran.r-project.org/web/packages/spocc/index.html). These functions will be removed from this package in a future version. (#82)- Added a quick sanity test for
gbifmap()
. - Added tests for
occ_get()
for whenverbatim=TRUE
, which gives back different data than whenverbatim=FALSE
.
- A number of variables changed names to better follow the Darwin Core standard.
latitude
is nowdecimalLatitude
.longitude
is nowdecimalLongitude
.clazz
is nowclass
. Code in this package changed to accomodate these changes.date
is noweventDate
.georeferenced
is nowhasCoordinate
. Beware of these changes in your own code usingrgbif
- find and replace for these should be easy. - Changed
altitude
parameter inocc_search()
toelevation
- should have beenelevation
the whole time. occ_count()
function with parameter changes:nubKey
parameter in changed totaxonKey
. New parameterprotocol
. ParametercatalogNumber
gone. ParameterhostCountry
gone. These parameters are still in the function definition, but if called they throw a useful warning telling you the correct parameter names. (#76)- Fixed bug in
name_lookup()
function that was labeling facet outputs incorrectly. (#77)
- Better checking and parsing of response data from GBIF: Across all functions, we now check that the response content type is
application/json
, then parse JSON ourselves usingRJSONIO::fromJSON
(instead of httr doing it). - Across all functions, we now return all potential character class columns as character class (instead of factor), by passing
stringsAsFactors = FALSE
to alldata.frame()
calls. - Now using assertthat package in various places to give better error messages when the wrong input is passed to a function.
- Four parameters have name changes in the
occ_search()
function. These parameters are still in the function definition, but if called they throw a useful warning telling you the correct parameter names. (#75) - Updated docs in
name_usage
,name_backbone
,name_lookup
, andname_suggest
functions. sourceId
parameter inname_usage()
function doesn't work so error message is thrown when used.
- New function
check_wkt()
to check that well known text string is the right format. (#68) - New dataset typestatus to look up possible specimen typeStatus values. See #74 for more information.
- GBIF added some new parameters for use in the
occ_search()
function.scientificName
: search for a species by name (instead oftaxonKey
).continent
: search by continent.lastInterpreted
: search by last time GBIF modified the record.recordNumber
: search by the data collector's specimen record number - this is different from the GBIF record number.typeStatus
: search by specimen type status. (#74) - Note that given the new parameters many more options are available for implicit faceted search in which you can pass many values in a vector to do multiple searches like
parameterName = c(x, y, z)
. These parameters are:taxonKey
,scientificName
,datasetKey
,catalogNumber
,collectorName
,geometry
,country
,recordNumber
,search
,institutionCode
,collectionCode
,decimalLatitude
,decimalLongitude
,depth
,year
,typeStatus
,lastInterpreted
, andcontinent
. This isn't faceted search server side - this is just looping your different values of the parameter against the GBIF API. - Range queries are a new feature in the GBIF API. Some parameters in
occ_search()
now support range queries:decimalLatitude
,decimalLongitude
,depth
,elevation
,eventDate
,lastInterpreted
,month
, andyear
. Do a range query for example bydepth=50,100
to ask for occurrences where depth was recorded between 50 and 100 meters. Note that this syntaxdepth=c(50,100)
will perform two separate searches, one fordepth=50
and one fordepth=100
. (#71)
- Changed name of country_codes() function to gbif_country_codes() to avoid conflicts with other packages.
- Replaced sapply() with vapply() throughout the package as it is more robust and can be faster.
- Added a startup message to the package.
- gbifmap() now plots a map with ggplot2::coord_fixed(ratio=1) so that you don't get wonky maps.
- occ_count() now accepts a call to query publishingCountry with a single parameter (country), to list occurrence counts by publishing country.
- occ_get() and occ_search() lose parameter minimal, and in its place gains parameter fields, in which you can request fields='minimal' to get just name, taxon key, lat and long. Or set to 'all' to get all fields, or selection the fields you want by passing in a vector of field names.
- Updated base url for the GIBF parser function parsenames()
- isocodes dataset now with documentation.
- New function count_facet() to do facetted count search, as GBIF doesn't allow faceted searches against the count API.
- New function elevation() to get elevation data for a data.frame of lat/long points, or a list of lat/long points. This function uses the Google Elevation API (https://developers.google.com/maps/documentation/elevation/).
- New function installations() to get metadata on installations.
- Improved handling of limit parameter in occ_search() so that the correct number of occurrences are returned.
- Fixed various tests that were broken.
- Added missing limit argument in datasets() function man file, also function gains start and callopts parameters.
- Data object isocodes gains new column gbif_names, the GBIF specific names for countries.
- Added in deprecation messages throughout package for functions and arguments that are deprecated.
- tests moved to tests/testthat from inst/tests.
- Vignettes now in vignettes/ directory.
- New function dataset_suggest(), a quick autocomplete service that returns up to 20 datasets.
- New function name_backbone() looks up names against the GBIF backbone taxonomy.
- New function name_suggest(), a quick autocomplete service that returns up to 20 name usages.
- New function occ_metadata() to search dataset metadata.
- New function parsenames() that parses taxonomic names and returns their components.
- Added back in functions, and .Rd files, from old version or rgbif that interacts with the old GBIF API.
- Updated vignette to work with new GBIF API and fxns.
- Added functions to interact with the new GBIF API, notably: country_codes(), dataset_metrics(), dataset_search(), datasets(), name_lookup(), gbifmap(), gist(), name_lookup(), name_usage(), networks(), nodes(), occ_count(), occ_get(), occ_search(), organizations(), stylegeojson(), togeojson(). See the README for a crosswalk from old functions to new ones.
- test files moved from inst/tests/ to tests/testthat/
- Removed georeferencedonly parameter - is deprecated in the GBIF API
- Added S3 objects: Output from calls to occurrencelist() and occurrence list_many() now of class gbiflist, and output from calls to densitylist() now of class gbifdens.
- Slight changes to gbifmaps() function.
- url parameter in all functions moved into the function itself as the base GBIF API url doesn't need to be specified by user.
- Vignette added.
- Added function country_codes() to look up 2 character ISO country codes for use in searches.
- Added function occurrencelist_many() to handle searches of many species.
- Added functions togeojson() and stylegeosjon() to convert a data.frame with lat/long columns to geojson file format, and to add styling to data.frames before using togeojson() .
- occurrencelist() and occurrencelist_many() gain argument fixnames, which lets user change species names in output data.frame according to a variety of scenarios.
- taxonsearch() gains argument accepted_status to accept only those names that have a status of accepted. In addition, this function has significant changes, and examples, to improve performance.
- Improved code style, and simplified code in some functions.
- occurrencelist() now handles scientific notation when maxresults are given in that form.
- occurencelist() now can retrieve any number of records; was previously a max of 1000 records.
- Demo "List" was returning incorrect taxon names - corrected now.
- Removed unused parameter 'latlongdf' in occurencelist().
- Changed all functions to use RCurl instead of httr as httr was presenting some problems.
- Two function, capwords and gbifxmlToDataFrame, added with documentation as internal functions.
- Added function density_spplist to get a species list or data.frame of species and their counts for any degree cell.
- Added function densitylist to access to records showing the density of occurrence records from the GBIF Network by one-degree cell.
- Added function gbifmap to make a simple map to visualize GBIF data.
- Added function occurrencecount to count taxon concept records matching a range of filters.
DEPRECATED
- gbifdatause removed, was just a function to return the data sharing agreement from GBIF.
- released to CRAN