preprocess.Rd
Remove stopwords from a character string.
preprocess(s, ..., filters = c("strip_tags", "strip_punctuation", "strip_multiple_spaces", "strip_numeric", "remove_stopwords", "strip_short", "stem_text"), to_lower = TRUE) # S3 method for character preprocess(s, ..., filters = c("strip_tags", "strip_punctuation", "strip_multiple_spaces", "strip_numeric", "remove_stopwords", "strip_short", "stem_text"), to_lower = TRUE) # S3 method for list preprocess(s, ..., filters = c("strip_tags", "strip_punctuation", "strip_multiple_spaces", "strip_numeric", "remove_stopwords", "strip_short", "stem_text"), to_lower = TRUE) # S3 method for data.frame preprocess(s, text, ..., filters = c("strip_tags", "strip_punctuation", "strip_multiple_spaces", "strip_numeric", "remove_stopwords", "strip_short", "stem_text"), to_lower = TRUE)
s | A Character string or data.frame. |
---|---|
... | Any other options. |
filters | Filters to apply, see filter section. |
to_lower | Whether to convert to lowercase before processing. |
text | bare name of text column. |
strip_tags
strip_punctuation
strip_multiple_spaces
strip_numeric
remove_stopwords
strip_short
stem_text