Tools to clean and process text. Tools are geared at checking for substrings that are not optimal for analysis and replacing or removing them (normalizing) with more analysis friendly substrings (see Sproat, Black, Chen, Kumar, Ostendorf, & Richards (2001) <doi:10.1006/csla.2001.0169>) or extracting them into new variables. For example, emoticons are often used in text but not always easily handled by analysis algorithms. The replace_emoticon() function replaces emoticons with word equivalents.
| Version: | 0.9.7 |
| Depends: | R (≥ 3.4.0) |
| Imports: | data.table, english (≥ 1.0-2), glue (≥ 1.3.0), lexicon (≥ 1.0.0), mgsub (≥ 1.5.0), qdapRegex, stringi, textshape (≥ 1.0.1), utils |
| Suggests: | hunspell, testthat |
| Published: | 2026-03-05 |
| DOI: | 10.32614/CRAN.package.textclean |
| Author: | Tyler Rinker [aut, cre], ctwheels StackOverflow [ctb], Surin Space [ctb] |
| Maintainer: | Tyler Rinker <tyler.rinker at gmail.com> |
| BugReports: | https://github.com/trinker/textclean/issues |
| License: | GPL-2 |
| URL: | https://github.com/trinker/textclean |
| NeedsCompilation: | no |
| Citation: | textclean citation info |
| Materials: | README, NEWS |
| CRAN checks: | textclean results |
| Reference manual: | textclean.html , textclean.pdf |
| Package source: | textclean_0.9.7.tar.gz |
| Windows binaries: | r-devel: textclean_0.9.7.zip, r-release: textclean_0.9.7.zip, r-oldrel: textclean_0.9.7.zip |
| macOS binaries: | r-release (arm64): textclean_0.9.7.tgz, r-oldrel (arm64): textclean_0.9.7.tgz, r-release (x86_64): textclean_0.9.7.tgz, r-oldrel (x86_64): textclean_0.9.7.tgz |
| Old sources: | textclean archive |
| Reverse imports: | fobitools, NUSS, SemanticDistance, sentimentr, spell.replacer, text2emotion, textstem, upstartr |
| Reverse suggests: | LilRhino |
Please use the canonical form https://CRAN.R-project.org/package=textclean to link to this page.
Need a high-speed mirror for your open-source project?
Contact our mirror admin team at info@clientvps.com.
This archive is provided as a free public service to the community.
Proudly supported by infrastructure from VPSPulse , RxServers , BuyNumber , UnitVPS , OffshoreName and secure payment technology by ArionPay.