SnowballC: Snowball Stemmers Based on the C 'libstemmer' UTF-8 Library

An R interface to the C 'libstemmer' library that implements Porter's word stemming algorithm for collapsing words to a common root to aid comparison of vocabulary. Currently supported languages are Danish, Dutch, English, Finnish, French, German, Hungarian, Italian, Norwegian, Portuguese, Romanian, Russian, Spanish, Swedish and Turkish.

Version: 0.6.0
Published: 2019-01-15
Author: Milan Bouchet-Valat [aut, cre]
Maintainer: Milan Bouchet-Valat <nalimilan at>
License: BSD_3_clause + file LICENSE
Copyright: Dr Martin Porter (2001) and Richard Boulton (2004, 2005) for the 'libstemmer' C library, and Milan Bouchet-Valat (2013) for the R package contents.
NeedsCompilation: yes
Materials: NEWS
In views: NaturalLanguageProcessing
CRAN checks: SnowballC results


Reference manual: SnowballC.pdf
Package source: SnowballC_0.6.0.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
OS X binaries: r-release: SnowballC_0.6.0.tgz, r-oldrel: SnowballC_0.6.0.tgz
Old sources: SnowballC archive

Reverse dependencies:

Reverse depends: lsa, RWBP
Reverse imports: available, bibliometrix, corpustools, DeducerText, gofastr, goldi, inpdfr, lexRankr, needmining, NLPutils, petro.One, proustr, ptstem, quanteda, R.temis, revtools, rJST, slowraker, stmCorrViz, TAShiny, TextForecast, textmining, textrecipes, textstem, tokenizers
Reverse suggests: koRpus, movMF, qdap, rattle, RcmdrPlugin.temis, SentimentAnalysis, stm, textmineR, textreg, tm, topicmodels, wikisourcer


Please use the canonical form to link to this page.