SnowballC: Snowball stemmers based on the C libstemmer UTF-8 library

An R interface to the C libstemmer library that implements Porter's word stemming algorithm for collapsing words to a common root to aid comparison of vocabulary. Currently supported languages are Danish, Dutch, English, Finnish, French, German, Hungarian, Italian, Norwegian, Portuguese, Romanian, Russian, Spanish, Swedish and Turkish.

Version: 0.5.1
Published: 2014-08-09
Author: Milan Bouchet-Valat [aut, cre]
Maintainer: Milan Bouchet-Valat <nalimilan at>
License: BSD_2_clause + file LICENSE
Copyright: Dr Martin Porter (2001) for the libstemmer C library, and Milan Bouchet-Valat (2013) for the R package contents
NeedsCompilation: yes
Materials: NEWS
In views: NaturalLanguageProcessing
CRAN checks: SnowballC results


Reference manual: SnowballC.pdf
Package source: SnowballC_0.5.1.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
OS X binaries: r-release: SnowballC_0.5.1.tgz, r-oldrel: SnowballC_0.5.1.tgz
Old sources: SnowballC archive

Reverse dependencies:

Reverse depends: lsa, RWBP
Reverse imports: available, bibliometrix, corpustools, DeducerText, gofastr, goldi, inpdfr, lexRankr, NLPutils, proustr, ptstem, quanteda, R.temis, revtools, rJST, SentimentAnalysis, slowraker, stmCorrViz, TAShiny, textmining, textstem, tokenizers
Reverse suggests: koRpus, movMF, qdap, rattle, RcmdrPlugin.temis, stm, textmineR, textreg, tm, topicmodels, wikisourcer


Please use the canonical form to link to this page.