A set of tools to analyze texts. Includes, amongst others, functions for automatic language detection, hyphenation, several indices of lexical diversity (e.g., type token ratio, HD-D/vocd-D, MTLD) and readability (e.g., Flesch, SMOG, LIX, Dale-Chall). Basic import functions for language corpora are also provided, to enable frequency analyses (supports Celex and Leipzig Corpora Collection file formats). #' Note: For full functionality a local installation of TreeTagger is recommended. Be encouraged to send feedback to the author(s)!
| Version: | 0.04-40 |
| Depends: | R (≥ 2.10.0), methods |
| Suggests: | testthat, tm, Snowball |
| Enhances: | rkward |
| Published: | 2013-04-08 |
| Author: | m.eik michalke, with contributions from Earl Brown, Alberto Mirisola, Alexandre Brulet, and Laura Hauser |
| Maintainer: | m.eik michalke <meik.michalke at hhu.de> |
| License: | GPL (≥ 3) |
| URL: | http://reaktanz.de/?c=hacking&s=koRpus |
| NeedsCompilation: | no |
| Citation: | koRpus citation info |
| In views: | NaturalLanguageProcessing |
| CRAN checks: | koRpus results |
| Package source: | koRpus_0.04-40.tar.gz |
| MacOS X binary: | koRpus_0.04-40.tgz |
| Windows binary: | koRpus_0.04-40.zip |
| Reference manual: | koRpus.pdf |
| Vignettes: |
Using the koRpus Package for Text Analysis |
| News/ChangeLog: | NEWS ChangeLog |
| Old sources: | koRpus archive |
| Reverse suggests: | qdap |