htmldf: Simple Scraping and Tidy Webpage Summaries

Simple tools for scraping webpages, extracting common html tags and parsing contents to a tidy, tabular format. Tools help with extraction of page titles, links, images, rss feeds, social media handles and page metadata.

Version: 0.1.0
Depends: R (≥ 3.5.0)
Imports: cld3, dplyr, httr, lubridate, magrittr, progress, R.utils, ranger, rvest, stringr, tibble, tidyr, tools, urltools, xml2
Suggests: testthat
Published: 2020-09-25
Author: Alastair Rushworth
Maintainer: Alastair Rushworth <alastairmrushworth at gmail.com>
BugReports: https://github.com/alastairrushworth/htmldf/issues
License: GPL-2
URL: https://github.com/alastairrushworth/htmldf/
NeedsCompilation: no
Language: en_GB
Materials: README
CRAN checks: htmldf results

Downloads:

Reference manual: htmldf.pdf
Package source: htmldf_0.1.0.tar.gz
Windows binaries: r-devel: htmldf_0.1.0.zip, r-release: htmldf_0.1.0.zip, r-oldrel: htmldf_0.1.0.zip
macOS binaries: r-release: htmldf_0.1.0.tgz, r-oldrel: htmldf_0.1.0.tgz

Linking:

Please use the canonical form https://CRAN.R-project.org/package=htmldf to link to this page.