textclean: Text Cleaning Tools

Tools to clean and process text. Tools are geared at checking for substrings that are not optimal for analysis and replacing or removing them (normalizing) with more analysis friendly substrings (see Sproat, Black, Chen, Kumar, Ostendorf, & Richards (2001) <doi:10.1006/csla.2001.0169>). For example, emoticons are often used in text but not always easily handled by analysis algorithms. The 'replace_emoticon()' function replaces emoticons with word equivalents.

Version: 0.6.3
Depends: R (≥ 3.2.3)
Imports: data.table, english (≥ 1.0-2), lexicon (≥ 0.7.4), qdapRegex, stringi, textshape (≥ 1.0.1), utils
Suggests: testthat
Published: 2018-01-14
Author: Tyler Rinker [aut, cre], ctwheels StackOverflow [ctb]
Maintainer: Tyler Rinker <tyler.rinker at gmail.com>
BugReports: https://github.com/trinker/textclean/issues
License: GPL-2
URL: https://github.com/trinker/textclean
NeedsCompilation: no
Citation: textclean citation info
Materials: README NEWS
CRAN checks: textclean results


Reference manual: textclean.pdf
Package source: textclean_0.6.3.tar.gz
Windows binaries: r-devel: textclean_0.6.3.zip, r-release: textclean_0.6.3.zip, r-oldrel: textclean_0.6.3.zip
OS X El Capitan binaries: r-release: textclean_0.6.3.tgz
OS X Mavericks binaries: r-oldrel: textclean_0.5.1.tgz
Old sources: textclean archive

Reverse dependencies:

Reverse imports: sentimentr, syllable, textstem


Please use the canonical form https://CRAN.R-project.org/package=textclean to link to this page.