koRpus: An R Package for Text Analysis

A set of tools to analyze texts. Includes, amongst others, functions for automatic language detection, hyphenation, several indices of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD) and readability (e.g., Flesch, SMOG, LIX, Dale-Chall). Basic import functions for language corpora are also provided, to enable frequency analyses (supports Celex and Leipzig Corpora Collection file formats) and measures like tf-idf. #' Note: For full functionality a local installation of TreeTagger is recommended. koRpus also includes a plugin for the R GUI and IDE RKWard, providing dialogs for its basic features. To use them, install RKWard from http://rkward.sf.net (plugins are detected automatically). Due to some restrictions on CRAN, the full package sources are only available from the project homepage.

Version: 0.05-5
Depends: R (≥ 2.10.0), methods
Suggests: testthat, tm, SnowballC, shiny
Enhances: rkward
Published: 2014-03-20
Author: m.eik michalke [aut, cre], Earl Brown [ctb], Alberto Mirisola [ctb], Alexandre Brulet [ctb], Laura Hauser [ctb]
Maintainer: m.eik michalke <meik.michalke at hhu.de>
License: GPL (≥ 3)
URL: http://reaktanz.de/?c=hacking&s=koRpus
NeedsCompilation: no
Citation: koRpus citation info
Materials: NEWS ChangeLog
In views: NaturalLanguageProcessing
CRAN checks: koRpus results

Downloads:

Reference manual: koRpus.pdf
Vignettes: Using the koRpus Package for Text Analysis
Package source: koRpus_0.05-5.tar.gz
OS X binary: koRpus_0.05-5.tgz
Windows binary: koRpus_0.05-5.zip
Old sources: koRpus archive

Reverse dependencies:

Reverse suggests: qdap