Remove stopwords from a character string.

preprocess(s, ..., filters = c("strip_tags", "strip_punctuation",
  "strip_multiple_spaces", "strip_numeric", "remove_stopwords",
  "strip_short", "stem_text"), to_lower = TRUE)

# S3 method for character
preprocess(s, ..., filters = c("strip_tags",
  "strip_punctuation", "strip_multiple_spaces", "strip_numeric",
  "remove_stopwords", "strip_short", "stem_text"), to_lower = TRUE)

# S3 method for list
preprocess(s, ..., filters = c("strip_tags",
  "strip_punctuation", "strip_multiple_spaces", "strip_numeric",
  "remove_stopwords", "strip_short", "stem_text"), to_lower = TRUE)

# S3 method for data.frame
preprocess(s, text, ..., filters = c("strip_tags",
  "strip_punctuation", "strip_multiple_spaces", "strip_numeric",
  "remove_stopwords", "strip_short", "stem_text"), to_lower = TRUE)

Arguments

s

A Character string or data.frame.

...

Any other options.

filters

Filters to apply, see filter section.

to_lower

Whether to convert to lowercase before processing.

text

bare name of text column.

Filters

  • strip_tags

  • strip_punctuation

  • strip_multiple_spaces

  • strip_numeric

  • remove_stopwords

  • strip_short

  • stem_text