ngram: Fast n-Gram 'Tokenization'

An n-gram is a sequence of n "words" taken from a body of text in order. This package offers utilities for creating, displaying, summarizing, and "babbling" n-grams. The 'tokenization' and "babbling" are handled by very efficient C code, which can even be build as its own standalone library. The babbler is a simple Markov chain. The package also offers a vignette with complete example 'workflows' and information about the utilities offered in the package.

Version: 3.0.1
Depends: R (≥ 3.0.0)
Imports: methods, assertthat (≥ 0.1)
Published: 2016-07-13
Author: Drew Schmidt [aut, cre], Christian Heckendorf [aut]
Maintainer: Drew Schmidt <wrathematics at gmail.com>
BugReports: https://github.com/wrathematics/ngram/issues
License: BSD 2-clause License + file LICENSE
URL: https://github.com/wrathematics/ngram
NeedsCompilation: yes
Citation: ngram citation info
Materials: README ChangeLog
CRAN checks: ngram results

Downloads:

Reference manual: ngram.pdf
Package source: ngram_3.0.1.tar.gz
Windows binaries: r-devel: ngram_3.0.1.zip, r-release: ngram_3.0.1.zip, r-oldrel: ngram_3.0.1.zip
OS X Mavericks binaries: r-release: ngram_3.0.1.tgz, r-oldrel: ngram_3.0.1.tgz
Old sources: ngram archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=ngram to link to this page.