The rplos
package interacts with the API services of PLoS
(Public Library of Science) Journals. You used to need an API key to work with
this package - that is no longer needed!
This tutorial will go through three use cases to demonstrate the kinds
of things possible in rplos
.
install.packages("rplos")
library('rplos')
searchplos
is a general search, and in this case searches for the term
Helianthus and returns the DOI's of matching papers
searchplos(q = "Helianthus", fl = "id", limit = 5)
#> $meta
#> # A tibble: 1 x 2
#> numFound start
#> <int> <int>
#> 1 572 0
#>
#> $data
#> # A tibble: 5 x 1
#> id
#> <chr>
#> 1 10.1371/journal.pone.0198869
#> 2 10.1371/journal.pone.0213065
#> 3 10.1371/journal.pone.0148280
#> 4 10.1371/journal.pone.0111982
#> 5 10.1371/journal.pone.0139188
Get only full article DOIs
searchplos(q = "*:*", fl = 'id', fq = 'doc_type:full', start = 0, limit = 5)
#> $meta
#> # A tibble: 1 x 2
#> numFound start
#> <int> <int>
#> 1 255862 0
#>
#> $data
#> # A tibble: 5 x 1
#> id
#> <chr>
#> 1 10.1371/journal.pone.0155491
#> 2 10.1371/journal.pone.0168631
#> 3 10.1371/journal.pone.0168627
#> 4 10.1371/journal.pone.0184491
#> 5 10.1371/journal.pone.0155489
Get DOIs for only PLoS One articles
searchplos(q = "*:*", fl = 'id', fq = 'journal_key:PLoSONE',
start = 0, limit = 5)
#> $meta
#> # A tibble: 1 x 2
#> numFound start
#> <int> <int>
#> 1 1877872 0
#>
#> $data
#> # A tibble: 5 x 1
#> id
#> <chr>
#> 1 10.1371/journal.pone.0058099/materials_and_methods
#> 2 10.1371/journal.pone.0030394/introduction
#> 3 10.1371/journal.pone.0030394/results_and_discussion
#> 4 10.1371/journal.pone.0002157/materials_and_methods
#> 5 10.1371/journal.pone.0030394/supporting_information
Get DOIs for full article in PLoS One
searchplos(q = "*:*", fl = 'id',
fq = list('journal_key:PLoSONE', 'doc_type:full'),
start = 0, limit = 5)
#> $meta
#> # A tibble: 1 x 2
#> numFound start
#> <int> <int>
#> 1 217655 0
#>
#> $data
#> # A tibble: 5 x 1
#> id
#> <chr>
#> 1 10.1371/journal.pone.0155491
#> 2 10.1371/journal.pone.0168631
#> 3 10.1371/journal.pone.0168627
#> 4 10.1371/journal.pone.0184491
#> 5 10.1371/journal.pone.0155489
Search for many terms
q <- c('ecology','evolution','science')
lapply(q, function(x) searchplos(x, limit = 2))
#> [[1]]
#> [[1]]$meta
#> # A tibble: 1 x 2
#> numFound start
#> <int> <int>
#> 1 47678 0
#>
#> [[1]]$data
#> # A tibble: 2 x 1
#> id
#> <chr>
#> 1 10.1371/journal.pone.0001248
#> 2 10.1371/journal.pone.0059813
#>
#>
#> [[2]]
#> [[2]]$meta
#> # A tibble: 1 x 2
#> numFound start
#> <int> <int>
#> 1 72477 0
#>
#> [[2]]$data
#> # A tibble: 2 x 1
#> id
#> <chr>
#> 1 10.1371/journal.pbio.2002255
#> 2 10.1371/journal.pbio.0030245
#>
#>
#> [[3]]
#> [[3]]$meta
#> # A tibble: 1 x 2
#> numFound start
#> <int> <int>
#> 1 225263 0
#>
#> [[3]]$data
#> # A tibble: 2 x 1
#> id
#> <chr>
#> 1 10.1371/journal.pone.0202320
#> 2 10.1371/journal.pone.0178778
A suite of functions were created as light wrappers around searchplos
as
a shorthand to search specific sections of a paper.
plosauthor
searchers in authorsplosabstract
searches in abstractsplostitle
searches in titlesplosfigtabcaps
searches in figure and table captionsplossubject
searches in subject areasplosauthor
searches across authors, and in this case returns the authors of
the matching papers. the fl parameter determines what is returned
plosauthor(q = "Eisen", fl = "author", limit = 5)
#> $meta
#> # A tibble: 1 x 2
#> numFound start
#> <int> <int>
#> 1 1020 0
#>
#> $data
#> # A tibble: 5 x 1
#> author
#> <chr>
#> 1 Myungsun Kang,Timothy J Eisen,Ellen A Eisen,Arup K Chakraborty,Herman N …
#> 2 Myungsun Kang,Timothy J Eisen,Ellen A Eisen,Arup K Chakraborty,Herman N …
#> 3 Myungsun Kang,Timothy J Eisen,Ellen A Eisen,Arup K Chakraborty,Herman N …
#> 4 Myungsun Kang,Timothy J Eisen,Ellen A Eisen,Arup K Chakraborty,Herman N …
#> 5 Myungsun Kang,Timothy J Eisen,Ellen A Eisen,Arup K Chakraborty,Herman N …
plosabstract
searches across abstracts, and in this case returns the id and
title of the matching papers
plosabstract(q = 'drosophila', fl = 'id,title', limit = 5)
#> $meta
#> # A tibble: 1 x 2
#> numFound start
#> <int> <int>
#> 1 3597 0
#>
#> $data
#> # A tibble: 5 x 2
#> id title
#> <chr> <chr>
#> 1 10.1371/journal.pone… Host Range and Specificity of the Drosophila C Vir…
#> 2 10.1371/journal.pone… A Drosophila model for developmental nicotine expo…
#> 3 10.1371/journal.pone… Exogenous expression of Drp1 plays neuroprotective…
#> 4 10.1371/journal.pone… Reliable Drosophila Body Fat Quantification by a C…
#> 5 10.1371/journal.pone… "Larval Defense against Attack from Parasitoid Was…
plostitle
searches across titles, and in this case returns the title and
journal of the matching papers
plostitle(q = 'drosophila', fl = 'title,journal', limit = 5)
#> $meta
#> # A tibble: 1 x 2
#> numFound start
#> <int> <int>
#> 1 2298 0
#>
#> $data
#> # A tibble: 5 x 2
#> journal title
#> <chr> <chr>
#> 1 PLOS ONE Peptidergic control in a fruit crop pest: The spotted-wing dros…
#> 2 PLOS ONE Nematocytes: Discovery and characterization of a novel anculeat…
#> 3 PLOS ONE Tandem Duplications and the Limits of Natural Selection in Dros…
#> 4 PLoS ONE In Vivo RNAi Rescue in Drosophila melanogaster with Genomic Tra…
#> 5 PLOS ONE The behavior of adult Drosophila in the wild
plosword
allows you to search for 1 to K words and visualize the results
as a histogram, comparing number of matching papers for each word
out <- plosword(list("monkey", "Helianthus", "sunflower", "protein", "whale"),
vis = "TRUE")
out$table
#> No_Articles Term
#> 1 13216 monkey
#> 2 572 Helianthus
#> 3 1636 sunflower
#> 4 149565 protein
#> 5 1880 whale
out$plot
You can also pass in curl options, in this case get verbose information on the curl call.
plosword('Helianthus', callopts = list(verbose = TRUE))
#> Number of articles with search term
#> 572
plot_throughtime
allows you to search for up to 2 words and visualize the
results as a line plot through time, comparing number of articles matching
through time. Visualize with the ggplot2 package, only up to two terms for now.
library("ggplot2")
plot_throughtime(terms = "phylogeny", limit = 200) +
geom_line(size = 2, color = 'black')
See the Faceted and highlighted searches and Full text vignettes for
more rplos
help.