doc2vec: Distributed Representations of Sentences and Documents

Learn vector representations of sentences, paragraphs or documents by using the 'Paragraph Vector' algorithms, namely the distributed bag of words ('PV-DBOW') and the distributed memory ('PV-DM') model. The techniques in the package are detailed in the paper "Distributed Representations of Sentences and Documents" by Mikolov et al. (2014), available at <arXiv:1405.4053>.

Version: 0.1.1
Depends: R (≥ 2.10)
Imports: Rcpp (≥ 0.11.5), stats
LinkingTo: Rcpp
Suggests: tokenizers.bpe
Published: 2021-01-21
Author: Jan Wijffels [aut, cre, cph] (R wrapper), BNOSAC [cph] (R wrapper), hiyijian [ctb, cph] (Code in src/doc2vec)
Maintainer: Jan Wijffels <jwijffels at>
License: MIT + file LICENSE
NeedsCompilation: yes
Materials: README NEWS
CRAN checks: doc2vec results


Reference manual: doc2vec.pdf
Package source: doc2vec_0.1.1.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
macOS binaries: r-release: doc2vec_0.1.1.tgz, r-oldrel: doc2vec_0.1.1.tgz
Old sources: doc2vec archive


Please use the canonical form to link to this page.