The PRISMA package is capable of loading and processing huge text
corpora processed with the sally toolbox (http://www.mlsec.org/sally/).
sally acts as a very fast preprocessor which splits the text files into
tokens or n-grams. These output files can then be read with the PRISMA
package which applies testing-based token selection and has some
replicate-aware, highly tuned non-negative matrix factorization and
principal component analysis implementation which allows the processing of
very big data sets even on desktop machines.
Version: |
0.2-5 |
Depends: |
R (≥ 2.10), Matrix, gplots, ggplot2 |
Suggests: |
tm (≥ 0.6) |
Published: |
2015-03-16 |
Author: |
Tammo Krueger, Nicole Kraemer |
Maintainer: |
Tammo Krueger <tammokrueger at googlemail.com> |
License: |
GPL-2 | GPL-3 [expanded from: GPL (≥ 2.0)] |
NeedsCompilation: |
no |
Materials: |
README |
CRAN checks: |
PRISMA results |