tidytext: Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools

Using tidy data principles can make many text mining tasks easier, more effective, and consistent with tools already in wide use. Much of the infrastructure needed for text mining with tidy data frames already exists in packages like 'dplyr', 'broom', 'tidyr', and 'ggplot2'. In this package, we provide functions and supporting data sets to allow conversion of text to and from tidy formats, and to switch seamlessly between tidy tools and existing text mining packages.

Version: 0.4.2
Depends: R (≥ 2.10)
Imports: cli, dplyr (≥ 1.1.1), generics, janeaustenr, lifecycle, Matrix, methods, purrr (≥ 0.1.1), rlang (≥ 0.4.10), stringr, tibble, tokenizers, vctrs
Suggests: broom, covr, data.table, ggplot2, hunspell, knitr, mallet, NLP, quanteda, readr, reshape2, rmarkdown, scales, stm, stopwords, testthat (≥ 2.1.0), textdata, tidyr, tm, topicmodels, vdiffr, wordcloud
Published: 2024-04-10
DOI: 10.32614/CRAN.package.tidytext
Author: Gabriela De Queiroz [ctb], Colin Fay ORCID iD [ctb], Emil Hvitfeldt [ctb], Os Keyes ORCID iD [ctb], Kanishka Misra [ctb], Tim Mastny [ctb], Jeff Erickson [ctb], David Robinson [aut], Julia Silge ORCID iD [aut, cre]
Maintainer: Julia Silge <julia.silge at gmail.com>
BugReports: https://github.com/juliasilge/tidytext/issues
License: MIT + file LICENSE
URL: https://juliasilge.github.io/tidytext/, https://github.com/juliasilge/tidytext
NeedsCompilation: no
Citation: tidytext citation info
Materials: README NEWS
In views: NaturalLanguageProcessing
CRAN checks: tidytext results

Documentation:

Reference manual: tidytext.pdf
Vignettes: Tidy Term Frequency and Inverse Document Frequency (tf-idf)
Converting to and from Document-Term Matrix and Corpus objects
Introduction to tidytext

Downloads:

Package source: tidytext_0.4.2.tar.gz
Windows binaries: r-devel: tidytext_0.4.2.zip, r-release: tidytext_0.4.2.zip, r-oldrel: tidytext_0.4.2.zip
macOS binaries: r-release (arm64): tidytext_0.4.2.tgz, r-oldrel (arm64): tidytext_0.4.2.tgz, r-release (x86_64): tidytext_0.4.2.tgz, r-oldrel (x86_64): tidytext_0.4.2.tgz
Old sources: tidytext archive

Reverse dependencies:

Reverse imports: akc, AnimalSequences, available, bibliometrix, CINE, crsra, dail, DistatisR, DOPE, epitweetr, ggpage, Goodreader, GSEAmining, iheiddown, LDAShiny, MadanText, MadanTextNetwork, madshapR, margaret, miaViz, miRetrieve, moodleR, naturaList, NIMAA, opitools, oRus, saotd, scicomptools, sherlock, statquotes, texter, TextForecast, TextMiningGUI, tidylda, tsentiment, TwitterAutomatedTrading, ulex, upstartr, vivainsights, WeatherSentiment, weed, widyr, wpa
Reverse suggests: aifeducation, bugphyzz, eurlex, funrar, gutenbergr, ibawds, LexisNexisTools, MetMashR, mvrsquared, newsanchor, openintro, polmineR, rfars, schrute, smartid, spacyr, spRingsteen, textmineR, tidylo, tidypmc, Twitmo, vosonSML
Reverse enhances: quanteda

Linking:

Please use the canonical form https://CRAN.R-project.org/package=tidytext to link to this page.