From 485a6b868cdf077cceea11cc34a67689838f8993 Mon Sep 17 00:00:00 2001
From: Karl Broman arXiv has a set of 155 subject classifications, searchable with the
prefix Search terms
Subject classifications
cat:
. The aRxiv package contains a dataset
-arxiv_cats
containing the abbreviations and descriptions.
-Here are the statistics categories.
+## [1] category field subfield short_description long_description
-## <0 rows> (or 0-length row.names)
arxiv_cats
containing the categories, short and long
+descriptions, as well as field (and, for Physics, subfield). Here are
+the column names.
## [1] "category" "field" "subfield" "short_description" "long_description"
+Here are the statistics categories.
+ +## category short_description
+## 150 stat.AP Applications
+## 151 stat.CO Computation
+## 152 stat.ME Methodology
+## 153 stat.ML Machine Learning
+## 154 stat.OT Other Statistics
+## 155 stat.TH Statistics Theory
To search these categories, you need to include either the full term
or use the *
wildcard.
## [1] 0
-
+
## [1] 17577
-
+
## [1] 114647
2007-10-18 12:25:34
. You can use *
for a
wildcard for the times. For example, to get all manuscripts with initial
submission on 2007-10-18:
-
+
## [1] 196
But you can’t use the wildcard within the dates.
- +## [1] 0
To get a count of all manuscripts with original submission in 2007,
use a date range, like [from_date TO to_date]
. (If you give
a partial date, it’s treated as the earliest date/time that matches, and
the range appears to be up to but not including the second
date/time.)
## [1] 55749
The output of arxiv_search()
is a data frame with the
following columns.
## [1] "id" "submitted" "updated" "title" "abstract"
## [6] "authors" "affiliations" "link_abstract" "link_pdf" "link_doi"
## [11] "comment" "journal_ref" "doi" "primary_category" "categories"
@@ -551,9 +560,9 @@ cat:
but are searchable with a general search.
-
+
## [1] 0
-
+
## [1] 870
id_list
.
Here’s an example, to sort the results by the date the manuscripts were last updated, in descending order.
-res <- arxiv_search('au:"Peter Hall" AND ti:deconvolution',
- sort_by="updated", ascending=FALSE)
-res$updated
res <- arxiv_search('au:"Peter Hall" AND ti:deconvolution',
+ sort_by="updated", ascending=FALSE)
+res$updated
## [1] "2010-03-01 11:33:37" "2008-10-27 14:27:52" "2008-04-04 12:19:05" "2007-10-18 12:25:34"
"aRxiv_delay"
(in seconds). The default is 3 seconds.
To reduce the delay to 1 second, use:
- +Don’t do searches in parallel (e.g., via the parallel package). You may be locked out from the arXiv API.