sparkwarc: Load WARC Files into Apache Spark

Load WARC (Web ARChive) files into Apache Spark using 'sparklyr'. This allows to read files from the Common Crawl project <>.

Version: 0.1.1
Imports: sparklyr, DBI
Published: 2017-01-13
Author: Javier Luraschi [aut, cre]
Maintainer: Javier Luraschi <javier at>
License: Apache License 2.0
NeedsCompilation: no
Materials: README
CRAN checks: sparkwarc results


Reference manual: sparkwarc.pdf
Package source: sparkwarc_0.1.1.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
OS X El Capitan binaries: r-release: sparkwarc_0.1.1.tgz
OS X Mavericks binaries: r-oldrel: sparkwarc_0.1.1.tgz
Old sources: sparkwarc archive


Please use the canonical form to link to this page.