koRpus: An R Package for Text Analysis

A set of tools to analyze texts. Includes, amongst others, functions for automatic language detection, hyphenation, several indices of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD) and readability (e.g., Flesch, SMOG, LIX, Dale-Chall). Basic import functions for language corpora are also provided, to enable frequency analyses (supports Celex and Leipzig Corpora Collection file formats) and measures like tf-idf. Note: For full functionality a local installation of TreeTagger is recommended. 'koRpus' also includes a plugin for the R GUI and IDE RKWard, providing graphical dialogs for its basic features. The respective R package 'rkward' cannot be installed directly from a repository, as it is a part of RKWard. To make full use of this feature, please install RKWard from https://rkward.kde.org (plugins are detected automatically). Due to some restrictions on CRAN, the full package sources are only available from the project homepage.

Version: 0.05-6
Depends: R (≥ 2.10.0), methods
Suggests: testthat, tm, SnowballC, shiny
Enhances: rkward
Published: 2015-06-30
Author: m.eik michalke [aut, cre], Earl Brown [ctb], Alberto Mirisola [ctb], Alexandre Brulet [ctb], Laura Hauser [ctb]
Maintainer: m.eik michalke <meik.michalke at hhu.de>
License: GPL (≥ 3)
URL: http://reaktanz.de/?c=hacking&s=koRpus
NeedsCompilation: no
Citation: koRpus citation info
Materials: NEWS ChangeLog
In views: NaturalLanguageProcessing
CRAN checks: koRpus results


Reference manual: koRpus.pdf
Vignettes: Using the koRpus Package for Text Analysis
Package source: koRpus_0.05-6.tar.gz
Windows binaries: r-devel: koRpus_0.05-6.zip, r-release: koRpus_0.05-6.zip, r-oldrel: koRpus_0.05-6.zip
OS X Snow Leopard binaries: r-release: koRpus_0.05-6.tgz, r-oldrel: koRpus_0.05-5.tgz
OS X Mavericks binaries: r-release: koRpus_0.05-6.tgz
Old sources: koRpus archive

Reverse dependencies:

Reverse suggests: pander, qdap