docxtractr: Extract Data Tables and Comments from 'Microsoft' 'Word' Documents

'Microsoft Word' 'docx' files provide an 'XML' structure that is fairly straightforward to navigate, especially when it applies to 'Word' tables and comments. Tools are provided to determine table count/structure, comment count and also to extract/clean tables and comments from 'Microsoft Word' 'docx' documents. There is also nascent support for '.doc' files.

Version: 0.6.1
Depends: R (≥ 3.2.0)
Imports: tools, xml2, purrr, dplyr, utils, httr, magrittr
Suggests: testthat, covr
Published: 2019-01-09
Author: Bob Rudis ORCID iD [aut, cre], Mark Dulhunty [ctb], Karlo Guidoni-Martins [ctb], Chris Muir [aut, ctb]
Maintainer: Bob Rudis <bob at>
License: MIT + file LICENSE
NeedsCompilation: no
SystemRequirements: LibreOffice (<>) required to extract data from .doc files.
Materials: NEWS
CRAN checks: docxtractr results


Reference manual: docxtractr.pdf
Package source: docxtractr_0.6.1.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
OS X binaries: r-release: docxtractr_0.6.1.tgz, r-oldrel: docxtractr_0.6.1.tgz
Old sources: docxtractr archive

Reverse dependencies:

Reverse suggests: ezpickr


Please use the canonical form to link to this page.