All functions

bind_tf_idf

Bind the term frequency and inverse document frequency of a tidy text dataset to the dataset

cast_sparse

Create a sparse matrix from row names, column names, and values in a table.

tidy glance

Tidiers for a corpus object from the quanteda package

tidy

Tidy dictionary objects from the quanteda package

cast_tdm cast_dtm cast_dfm

Casting a data frame to a DocumentTermMatrix, TermDocumentMatrix, or dfm

get_sentiments

Get a tidy data frame of a single sentiment lexicon

get_stopwords

Get a tidy data frame of a single stopword lexicon

tidy augment glance

Tidiers for LDA objects from the topicmodels package

tidy augment

Tidiers for Latent Dirichlet Allocation models from the mallet package

nma_words

English negators, modals, and adverbs

parts_of_speech

Parts of speech for English words from the Moby Project

sentiments

Sentiment lexicons from three sources

tidy augment glance

Tidiers for Structural Topic Models from the stm package

stop_words

Various lexicons for English stop words

tidy

Tidy DocumentTermMatrix, TermDocumentMatrix, and related objects from the tm package

tidy_triplet

Utility function to tidy a simple triplet matrix

tidy

Tidy a Corpus object from the tm package

tidytext

tidytext: Text Mining using 'dplyr', 'ggplot2', and Other Tidy Tools

unnest_tokens

Split a column into tokens using the tokenizers package