tm.plugin.webmining: Retrieve Structured, Textual Data from Various Web Sources

Facilitate text retrieval from feed formats like XML (RSS, ATOM) and JSON. Also direct retrieval from HTML is supported. As most (news) feeds only incorporate small fractions of the original text tm.plugin.webmining even retrieves and extracts the text of the original text source.

Version: 1.3
Depends: R (≥ 3.1.0)
Imports: NLP (≥ 0.1-2), tm (≥ 0.6), boilerpipeR, RCurl, XML, RJSONIO
Suggests: testthat
Published: 2015-05-11
Author: Mario Annau [aut, cre]
Maintainer: Mario Annau <mario.annau at>
License: GPL-3
NeedsCompilation: no
Materials: NEWS
In views: NaturalLanguageProcessing, WebTechnologies
CRAN checks: tm.plugin.webmining results


Reference manual: tm.plugin.webmining.pdf
Vignettes: Introduction to the tm.plugin.webmining Package
Package source: tm.plugin.webmining_1.3.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
OS X Snow Leopard binaries: r-release: tm.plugin.webmining_1.3.tgz, r-oldrel: tm.plugin.webmining_1.3.tgz
OS X Mavericks binaries: r-release: tm.plugin.webmining_1.3.tgz
Old sources: tm.plugin.webmining archive