The basic idea of latent semantic analysis (LSA) is, that text do have a higher order (=latent semantic) structure which, however, is obscured by word usage (e.g. through the use of synonyms or polysemy). By using conceptual indices that are derived statistically via a truncated singular value decomposition (a two-mode factor analysis) over a given document-term matrix, this variability problem can be overcome.
| Version: | 0.59 |
| Depends: | Rstem |
| Date: | 2007-11-28 |
| Author: | Fridolin Wild |
| Maintainer: | Fridolin Wild <fridolin.wild at wu-wien.ac.at> |
| License: | GPL (≥ 2) |
| In views: | NaturalLanguageProcessing |
| CRAN checks: | lsa results |
Downloads:
| Package source: | lsa_0.59.tar.gz |
| MacOS X binary: | lsa_0.59.tgz |
| Windows binary: | lsa_0.59.zip |
| Reference manual: | lsa.pdf |
| Old sources: | lsa archive |