chinese.misc: Miscellaneous Tools for Chinese Text Mining and More

Efforts are made to make Chinese text mining easier, faster, and robust to errors. Document term matrix can be generated by only one line of code; detecting encoding, segmenting and removing stop words are done automatically. Some convenient tools are also supplied.

Version: 0.2.3
Depends: R (≥ 3.6.0)
Imports: jiebaR, NLP, tm (≥ 0.7), stringi, slam (≥ 0.1-37), Matrix, purrr
Published: 2020-09-11
DOI: 10.32614/CRAN.package.chinese.misc
Author: Jiang Wu [aut, cre] (from Capital Normal University)
Maintainer: Jiang Wu <textidea at>
License: GPL-3
NeedsCompilation: no
CRAN checks: chinese.misc results


Reference manual: chinese.misc.pdf


Package source: chinese.misc_0.2.3.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
macOS binaries: r-release (arm64): chinese.misc_0.2.3.tgz, r-oldrel (arm64): chinese.misc_0.2.3.tgz, r-release (x86_64): chinese.misc_0.2.3.tgz, r-oldrel (x86_64): chinese.misc_0.2.3.tgz
Old sources: chinese.misc archive

Reverse dependencies:

Reverse imports: LDABiplots, LDAShiny


Please use the canonical form to link to this page.