rcorpora: A Collection of Small Text Corpora of Interesting Data

A collection of small text corpora of interesting data. It contains all data sets from https://github.com/dariusk/corpora. Some examples: names of animals: birds, dinosaurs, dogs; foods: beer categories, pizza toppings; geography: English towns, rivers, oceans; humans: authors, US presidents, occupations; science: elements, planets; words: adjectives, verbs, proverbs, US president quotes.

Version: 1.1.1
Imports: jsonlite, utils
Published: 2015-07-13
Author: Darius Kazemi, Matthew Rothenberg, Karl Swedberg, Matthew Hokanson, Nathan Lachenmyer, Aaron Marriner, Mark Sample, Casey Kolderup, Nathaniel Mitchell, Daniel D. Beck, Mike Nowak, Ryan Freebern, Ross Barclay, Ross Binden, Justin Alford, Cole Willsea, Andrew Gorman, Javier Arce, Patrick Rodriguez, Liam Cooke, Will Hankinson, K. Adam White, Garrett Miller, Zac Moody, Jordan Killpack, Brian Jones, Greg Borenstein, Noah Swartz, Nathan Black, Russell Horton, Mark Wunsch, Kay Belardinelli, Colin Mitchell, Michael Dewberry, Joe Mahoney
Maintainer: Gabor Csardi <csardi.gabor at gmail.com>
BugReports: https://github.com/gaborcsardi/rcorpora/issues
License: CC0
URL: https://github.com/gaborcsardi/rcorpora
NeedsCompilation: no
Materials: NEWS
CRAN checks: rcorpora results

Downloads:

Reference manual: rcorpora.pdf
Package source: rcorpora_1.1.1.tar.gz
Windows binaries: r-devel: rcorpora_1.1.1.zip, r-release: rcorpora_1.1.1.zip, r-oldrel: rcorpora_1.1.1.zip
OS X Snow Leopard binaries: r-release: rcorpora_1.1.1.tgz, r-oldrel: rcorpora_1.0.1.tgz
OS X Mavericks binaries: r-release: rcorpora_1.1.1.tgz
Old sources: rcorpora archive