textrecipes: Extra 'Recipes' for Text Processing

Converting text to numerical features requires specifically created procedures, which are implemented as steps according to the 'recipes' package. These steps allows for tokenization, filtering, counting (tf and tfidf) and feature hashing.

Version: 1.0.6
Depends: R (≥ 3.6), recipes (≥ 1.0.7)
Imports: lifecycle, dplyr, generics (≥ 0.1.0), magrittr, Matrix, purrr, rlang, SnowballC, tibble, tokenizers, vctrs, glue
LinkingTo: cpp11
Suggests: covr, data.table, dials (≥ 1.2.0), hardhat, janitor, knitr, modeldata, rmarkdown, sentencepiece, spacyr, stopwords, stringi, testthat (≥ 3.0.0), text2vec, tokenizers.bpe, udpipe, wordpiece
Published: 2023-11-15
DOI: 10.32614/CRAN.package.textrecipes
Author: Emil Hvitfeldt ORCID iD [aut, cre], Michael W. Kearney [cph] (author of count_functions), Posit Software, PBC [cph, fnd]
Maintainer: Emil Hvitfeldt <emil.hvitfeldt at posit.co>
BugReports: https://github.com/tidymodels/textrecipes/issues
License: MIT + file LICENSE
URL: https://github.com/tidymodels/textrecipes, https://textrecipes.tidymodels.org/
NeedsCompilation: yes
SystemRequirements: "GNU make"
Materials: README NEWS
CRAN checks: textrecipes results

Documentation:

Reference manual: textrecipes.pdf
Vignettes: Working with n-grams
Cookbook - Using more complex recipes involving text
Under the hood - tokenlist

Downloads:

Package source: textrecipes_1.0.6.tar.gz
Windows binaries: r-devel: textrecipes_1.0.6.zip, r-release: textrecipes_1.0.6.zip, r-oldrel: textrecipes_1.0.6.zip
macOS binaries: r-release (arm64): textrecipes_1.0.6.tgz, r-oldrel (arm64): textrecipes_1.0.6.tgz, r-release (x86_64): textrecipes_1.0.6.tgz, r-oldrel (x86_64): textrecipes_1.0.6.tgz
Old sources: textrecipes archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=textrecipes to link to this page.