Package for corpus analysis using the Corpus Workbench ('CWB', <https://cwb.sourceforge.io>) as an efficient back end for indexing and querying large corpora. The package offers functionality to flexibly create subcorpora and to carry out basic statistical operations (count, co-occurrences etc.). The original full text of documents can be reconstructed and inspected at any time. Beyond that, the package is intended to serve as an interface to packages implementing advanced statistical procedures. Respective data structures (document-term matrices, term-co-occurrence matrices etc.) can be created based on the indexed corpora.
Version: | 0.8.7 |
Depends: | R (≥ 3.5.0) |
Imports: | methods, data.table (≥ 1.12.2), fs, slam, Matrix, tm, DT, xml2, stringi, utils, jsonlite, parallel, pbapply, RcppCWB (≥ 0.5.3), magrittr, knitr, lifecycle, rlang |
Suggests: | markdown, rmarkdown, htmltools (≥ 0.4.0), highlight, sendmailR, shiny, shinythemes, shinyjs, miniUI, rhandsontable, testthat, tidytext, covr, igraph, NLP, httr, protolite, curl |
Published: | 2022-08-26 |
Author: | Andreas Blaette [aut, cre], Christoph Leonhardt [ctb], Marius Bertram [ctb] |
Maintainer: | Andreas Blaette <andreas.blaette at uni-due.de> |
BugReports: | https://github.com/PolMine/polmineR/issues |
License: | GPL-3 |
URL: | https://github.com/PolMine/polmineR |
NeedsCompilation: | no |
Citation: | polmineR citation info |
Materials: | NEWS |
CRAN checks: | polmineR results |
Reference manual: | polmineR.pdf |
Vignettes: |
OpenCPU Encodings Introduction to polmineR |
Package source: | polmineR_0.8.7.tar.gz |
Windows binaries: | r-devel: polmineR_0.8.7.zip, r-release: polmineR_0.8.7.zip, r-oldrel: polmineR_0.8.7.zip |
macOS binaries: | r-release (arm64): polmineR_0.8.7.tgz, r-oldrel (arm64): polmineR_0.8.7.tgz, r-release (x86_64): polmineR_0.8.7.tgz, r-oldrel (x86_64): polmineR_0.8.7.tgz |
Old sources: | polmineR archive |
Please use the canonical form https://CRAN.R-project.org/package=polmineR to link to this page.