clustRcompaR: Easy Interface for Clustering a Set of Documents and Exploring Group- Based Patterns

Provides an interface to perform cluster analysis on a corpus of text. Interfaces to Quanteda to assemble text corpuses easily. Deviationalizes text vectors prior to clustering using technique described by Sherin (Sherin, B. [2013]. A computational study of commonsense science: An exploration in the automated analysis of clinical interview data. Journal of the Learning Sciences, 22(4), 600-638. Chicago. http://dx.doi.org/10.1080/10508406.2013.836654). Uses cosine similarity as distance metric for two stage clustering process, involving Ward's algorithm hierarchical agglomerative clustering, and k-means clustering. Selects optimal number of clusters to maximize "variance explained" by clusters, adjusted by the number of clusters. Provides plotted output of clustering results as well as printed output. Assesses "model fit" of clustering solution to a set of preexisting groups in dataset.

Version: 0.1.0
Depends: R (≥ 3.1.3)
Imports: quanteda, dplyr, ggplot2, ppls, tidyr
Suggests: knitr, rmarkdown
Published: 2017-01-07
Author: Josh Rosenberg, Alex Lishinski
Maintainer: Alex Lishinski <alexlishinski at gmail.com>
License: GPL-3
URL: https://github.com/alishinski/clustRcompaR
NeedsCompilation: no
Materials: README
CRAN checks: clustRcompaR results

Downloads:

Reference manual: clustRcompaR.pdf
Vignettes: Vignette Title
Package source: clustRcompaR_0.1.0.tar.gz
Windows binaries: r-devel: clustRcompaR_0.1.0.zip, r-release: clustRcompaR_0.1.0.zip, r-oldrel: clustRcompaR_0.1.0.zip
OS X El Capitan binaries: r-release: clustRcompaR_0.1.0.tgz
OS X Mavericks binaries: r-oldrel: clustRcompaR_0.1.0.tgz

Linking:

Please use the canonical form https://CRAN.R-project.org/package=clustRcompaR to link to this page.