'Microsoft Word' 'docx' files provide an 'XML' structure that is fairly straightforward to navigate, especially when it applies to 'Word' tables and comments. Tools are provided to determine table count/structure, comment count and also to extract/clean tables and comments from 'Microsoft Word' 'docx' documents. There is also nascent support for '.doc' files.
Version: | 0.6.3 |
Depends: | R (≥ 3.6.0) |
Imports: | tools, xml2, purrr, dplyr, utils, httr, magrittr |
Suggests: | testthat, covr |
Published: | 2020-06-13 |
Author: | Bob Rudis |
Maintainer: | Bob Rudis <bob at rud.is> |
BugReports: | https://gitlab.com/hrbrmstr/docxtractr/issues |
License: | MIT + file LICENSE |
URL: | http://gitlab.com/hrbrmstr/docxtractr |
NeedsCompilation: | no |
SystemRequirements: | LibreOffice (<https://www.libreoffice.org/>) required to extract data from .doc and .ppt[x] files. |
Materials: | NEWS |
CRAN checks: | docxtractr results |
Reference manual: | docxtractr.pdf |
Package source: | docxtractr_0.6.3.tar.gz |
Windows binaries: | r-devel: docxtractr_0.6.3.zip, r-release: docxtractr_0.6.3.zip, r-oldrel: docxtractr_0.6.3.zip |
macOS binaries: | r-release: docxtractr_0.6.3.tgz, r-oldrel: docxtractr_0.6.3.tgz |
Old sources: | docxtractr archive |
Please use the canonical form https://CRAN.R-project.org/package=docxtractr to link to this page.