A set of tools to analyze texts. Includes, amongst others, functions for automatic language detection, hyphenation, several indices of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD) and readability (e.g., Flesch, SMOG, LIX, Dale-Chall). Basic import functions for language corpora are also provided, to enable frequency analyses (supports Celex and Leipzig Corpora Collection file formats) and measures like tf-idf. #' Note: For full functionality a local installation of TreeTagger is recommended. koRpus also includes a plugin for the R GUI and IDE RKWard, providing dialogs for its basic features. To use them, install RKWard from http://rkward.sf.net (plugins are detected automatically). Due to some restrictions on CRAN, the full package sources are only available from the project homepage.
Version: | 0.05-5 |
Depends: | R (≥ 2.10.0), methods |
Suggests: | testthat, tm, SnowballC, shiny |
Enhances: | rkward |
Published: | 2014-03-20 |
Author: | m.eik michalke [aut, cre], Earl Brown [ctb], Alberto Mirisola [ctb], Alexandre Brulet [ctb], Laura Hauser [ctb] |
Maintainer: | m.eik michalke <meik.michalke at hhu.de> |
License: | GPL (≥ 3) |
URL: | http://reaktanz.de/?c=hacking&s=koRpus |
NeedsCompilation: | no |
Citation: | koRpus citation info |
Materials: | NEWS ChangeLog |
In views: | NaturalLanguageProcessing |
CRAN checks: | koRpus results |
Reference manual: | koRpus.pdf |
Vignettes: |
Using the koRpus Package for Text Analysis |
Package source: | koRpus_0.05-5.tar.gz |
Windows binaries: | r-devel: koRpus_0.05-5.zip, r-release: koRpus_0.05-5.zip, r-oldrel: koRpus_0.05-5.zip |
OS X Snow Leopard binaries: | r-release: koRpus_0.05-5.tgz, r-oldrel: koRpus_0.05-5.tgz |
OS X Mavericks binaries: | r-release: koRpus_0.05-5.tgz |
Old sources: | koRpus archive |
Reverse suggests: | qdap |