Simple tools for scraping webpages, extracting common html tags and parsing contents to a tidy, tabular format. Tools help with extraction of page titles, links, images, rss feeds, social media handles and page metadata.
Version: | 0.2.0 |
Depends: | R (≥ 3.5.0) |
Imports: | cld3, dplyr, httr, lubridate, magrittr, progress, R.utils, ranger, rvest, stringr, tibble, tidyr, tools, urltools, xml2 |
Suggests: | testthat |
Published: | 2021-02-21 |
Author: | Alastair Rushworth |
Maintainer: | Alastair Rushworth <alastairmrushworth at gmail.com> |
BugReports: | https://github.com/alastairrushworth/htmldf/issues |
License: | GPL-2 |
URL: | https://github.com/alastairrushworth/htmldf/ |
NeedsCompilation: | no |
Language: | en_GB |
Materials: | README |
CRAN checks: | htmldf results |
Reference manual: | htmldf.pdf |
Package source: | htmldf_0.2.0.tar.gz |
Windows binaries: | r-devel: htmldf_0.2.0.zip, r-release: htmldf_0.2.0.zip, r-oldrel: htmldf_0.2.0.zip |
macOS binaries: | r-release (arm64): htmldf_0.2.0.tgz, r-release (x86_64): htmldf_0.2.0.tgz, r-oldrel: htmldf_0.2.0.tgz |
Old sources: | htmldf archive |
Please use the canonical form https://CRAN.R-project.org/package=htmldf to link to this page.