Provides functions supporting the reading and parsing of internal e-book content from EPUB files. E-book formatting is non-standard enough across all literature that no function can curate parsed e-book content across an arbitrary collection of e-books, in completely general form, resulting in a singular, consistently formatted output containing all the same variables. EPUB file parsing functionality in this package is intended for relatively general application to arbitrary e-books. However, poorly formatted e-books or e-books with highly uncommon formatting may not work with this package. Text is read 'as is'. Additional text cleaning should be performed by the user at their discretion, such as with functions from packages like 'tm' or 'qdap'.
Version: | 0.4.0 |
Depends: | R (≥ 3.5.0) |
Imports: | xml2, xslt, magrittr, dplyr, purrr, tidyr |
Suggests: | testthat, knitr, rmarkdown, lintr, covr, readr |
Published: | 2018-05-30 |
Author: | Matthew Leonawicz [aut, cre] |
Maintainer: | Matthew Leonawicz <mfleonawicz at alaska.edu> |
BugReports: | https://github.com/leonawicz/epubr/issues |
License: | MIT + file LICENSE |
URL: | https://github.com/leonawicz/epubr |
NeedsCompilation: | no |
Materials: | README NEWS |
CRAN checks: | epubr results |
Reference manual: | epubr.pdf |
Vignettes: |
Introduction to epubr |
Package source: | epubr_0.4.0.tar.gz |
Windows binaries: | r-devel: epubr_0.4.0.zip, r-release: epubr_0.4.0.zip, r-oldrel: not available |
OS X binaries: | r-release: epubr_0.4.0.tgz, r-oldrel: not available |
Please use the canonical form https://CRAN.R-project.org/package=epubr to link to this page.