Simple tools for scraping webpages, extracting common html tags and parsing contents to a tidy, tabular format. Tools help with extraction of page titles, links, images, rss feeds, social media handles and page metadata.
Version: | 0.1.0 |
Depends: | R (≥ 3.5.0) |
Imports: | cld3, dplyr, httr, lubridate, magrittr, progress, R.utils, ranger, rvest, stringr, tibble, tidyr, tools, urltools, xml2 |
Suggests: | testthat |
Published: | 2020-09-25 |
Author: | Alastair Rushworth |
Maintainer: | Alastair Rushworth <alastairmrushworth at gmail.com> |
BugReports: | https://github.com/alastairrushworth/htmldf/issues |
License: | GPL-2 |
URL: | https://github.com/alastairrushworth/htmldf/ |
NeedsCompilation: | no |
Language: | en_GB |
Materials: | README |
CRAN checks: | htmldf results |
Reference manual: | htmldf.pdf |
Package source: | htmldf_0.1.0.tar.gz |
Windows binaries: | r-devel: htmldf_0.1.0.zip, r-release: htmldf_0.1.0.zip, r-oldrel: htmldf_0.1.0.zip |
macOS binaries: | r-release: htmldf_0.1.0.tgz, r-oldrel: htmldf_0.1.0.tgz |
Please use the canonical form https://CRAN.R-project.org/package=htmldf to link to this page.