doc2vec: Distributed Representations of Sentences and Documents

Learn vector representations of sentences, paragraphs or documents by using the 'Paragraph Vector' algorithms, namely the distributed bag of words ('PV-DBOW') and the distributed memory ('PV-DM') model. The techniques in the package are detailed in the paper "Distributed Representations of Sentences and Documents" by Mikolov et al. (2014), available at <arXiv:1405.4053>.

Version: 0.1.1
Depends: R (≥ 2.10)
Imports: Rcpp (≥ 0.11.5), stats
LinkingTo: Rcpp
Suggests: tokenizers.bpe
Published: 2021-01-21
Author: Jan Wijffels [aut, cre, cph] (R wrapper), BNOSAC [cph] (R wrapper), hiyijian [ctb, cph] (Code in src/doc2vec)
Maintainer: Jan Wijffels <jwijffels at bnosac.be>
License: MIT + file LICENSE
URL: https://github.com/bnosac/doc2vec
NeedsCompilation: yes
Materials: README NEWS
CRAN checks: doc2vec results

Downloads:

Reference manual: doc2vec.pdf
Package source: doc2vec_0.1.1.tar.gz
Windows binaries: r-devel: doc2vec_0.1.1.zip, r-release: doc2vec_0.1.1.zip, r-oldrel: doc2vec_0.1.1.zip
macOS binaries: r-release: doc2vec_0.1.1.tgz, r-oldrel: doc2vec_0.1.1.tgz
Old sources: doc2vec archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=doc2vec to link to this page.