pdftools: Text Extraction, Rendering and Converting of PDF Documents

Utilities based on 'libpoppler' for extracting text, fonts, attachments and metadata from a PDF file. Also supports high quality rendering of PDF documents info PNG, JPEG, TIFF format, or into raw bitmap vectors for further processing in R.

Version: 1.6
Imports: Rcpp (≥ 0.12.12)
LinkingTo: Rcpp
Suggests: jpeg, png, webp, testthat
Published: 2018-03-27
Author: Jeroen Ooms ORCID iD [aut, cre]
Maintainer: Jeroen Ooms <jeroen at berkeley.edu>
BugReports: https://github.com/ropensci/pdftools/issues
License: MIT + file LICENSE
URL: https://ropensci.org/blog/2016/03/01/pdftools-and-jeroen (blog) https://github.com/ropensci/pdftools#readme (devel) https://poppler.freedesktop.org (upstream)
NeedsCompilation: yes
SystemRequirements: Poppler C++ API: libpoppler-cpp-dev (deb) or poppler-cpp-devel (rpm). The unit tests also require the 'poppler-data' package (rpm/deb)
Materials: NEWS
CRAN checks: pdftools results

Downloads:

Reference manual: pdftools.pdf
Package source: pdftools_1.6.tar.gz
Windows binaries: r-devel: pdftools_1.6.zip, r-release: pdftools_1.6.zip, r-oldrel: pdftools_1.6.zip
OS X binaries: r-release: pdftools_1.6.tgz, r-oldrel: pdftools_1.6.tgz
Old sources: pdftools archive

Reverse dependencies:

Reverse depends: pdfsearch
Reverse imports: crminer, findR, fulltext, rcoreoa, readtext, tesseract, textreadr
Reverse suggests: goldi, gridGraphics, hunspell, magick, spelling, tm

Linking:

Please use the canonical form https://CRAN.R-project.org/package=pdftools to link to this page.