SnowballC: Snowball stemmers based on the C libstemmer UTF-8 library

An R interface to the C libstemmer library that implements Porter's word stemming algorithm for collapsing words to a common root to aid comparison of vocabulary. Currently supported languages are Danish, Dutch, English, Finnish, French, German, Hungarian, Italian, Norwegian, Portuguese, Romanian, Russian, Spanish, Swedish and Turkish.

Version: 0.5
Published: 2013-05-22
Author: Milan Bouchet-Valat [aut, cre]
Maintainer: Milan Bouchet-Valat <nalimilan at>
License: BSD
Copyright: Dr Martin Porter (2001) for the libstemmer C library, and Milan Bouchet-Valat (2013) for the R package contents
NeedsCompilation: yes
In views: NaturalLanguageProcessing
CRAN checks: SnowballC results


Reference manual: SnowballC.pdf
Package source: SnowballC_0.5.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
OS X Snow Leopard binaries: r-release: SnowballC_0.5.tgz, r-oldrel: SnowballC_0.5.tgz
OS X Mavericks binaries: r-release: SnowballC_0.5.tgz

Reverse dependencies:

Reverse depends: lsa, RWBP
Reverse imports: DeducerText, NLPutils
Reverse suggests: koRpus, movMF, qdap, rattle, RcmdrPlugin.temis, stm, tm, topicmodels