This package provides a tm Source to create corpora from articles exported from the Europresse content provider as HTML files. It is able to read both text content and meta-data information (including source, date, title, author and pages).
Version: | 1.1 |
Imports: | NLP, tm (≥ 0.6), XML |
Published: | 2014-06-11 |
Author: | Milan Bouchet-Valat [aut, cre] |
Maintainer: | Milan Bouchet-Valat <nalimilan at club.fr> |
BugReports: | https://r-forge.r-project.org/tracker/?group_id=1437 |
License: | GPL-2 | GPL-3 [expanded from: GPL (≥ 2)] |
URL: | https://r-forge.r-project.org/projects/r-temis/ |
NeedsCompilation: | no |
Materials: | NEWS |
In views: | NaturalLanguageProcessing |
CRAN checks: | tm.plugin.europresse results |
Reference manual: | tm.plugin.europresse.pdf |
Package source: | tm.plugin.europresse_1.1.tar.gz |
Windows binaries: | r-devel: tm.plugin.europresse_1.1.zip, r-release: tm.plugin.europresse_1.1.zip, r-oldrel: tm.plugin.europresse_1.0.1.zip |
OS X Snow Leopard binaries: | r-release: tm.plugin.europresse_1.1.tgz, r-oldrel: tm.plugin.europresse_1.0.1.tgz |
OS X Mavericks binaries: | r-release: tm.plugin.europresse_1.1.tgz |
Old sources: | tm.plugin.europresse archive |
Reverse suggests: | RcmdrPlugin.temis |