tm.plugin.europresse: Import Articles from 'Europresse' Using the 'tm' Text Mining Framework

Provides a 'tm' Source to create corpora from articles exported from the 'Europresse' content provider as HTML files. It is able to read both text content and meta-data information (including source, date, title, author and pages).

Version: 1.3
Imports: utils, NLP, tm (≥ 0.6), XML
Published: 2015-07-29
Author: Milan Bouchet-Valat [aut, cre]
Maintainer: Milan Bouchet-Valat <nalimilan at>
License: GPL-2 | GPL-3 [expanded from: GPL (≥ 2)]
NeedsCompilation: no
Materials: NEWS
In views: NaturalLanguageProcessing
CRAN checks: tm.plugin.europresse results


Reference manual: tm.plugin.europresse.pdf
Package source: tm.plugin.europresse_1.3.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
OS X Snow Leopard binaries: r-release: tm.plugin.europresse_1.3.tgz, r-oldrel: tm.plugin.europresse_1.2.tgz
OS X Mavericks binaries: r-release: tm.plugin.europresse_1.3.tgz
Old sources: tm.plugin.europresse archive

Reverse dependencies:

Reverse suggests: RcmdrPlugin.temis