Welcome to ClientVPS Mirrors

CRAN: Package llmclean

llmclean: 'LLM'-Assisted Data Cleaning with Multi-Provider Support

Detects and suggests fixes for semantic inconsistencies in data frames by calling large language models (LLMs) through a unified, provider-agnostic interface. Supported providers include 'OpenAI' ('GPT-4o', 'GPT-4o-mini'), 'Anthropic' ('Claude'), 'Google' ('Gemini'), 'Groq' (free-tier 'LLaMA' and 'Mixtral'), and local 'Ollama' models. The package identifies issues that rule-based tools cannot detect: abbreviation variants, typographic errors, case inconsistencies, and malformed values. Results are returned as tidy data frames with column, row index, detected value, issue type, suggested fix, and confidence score. An offline fallback using statistical and fuzzy-matching methods is provided for use without any API key. Interactive fix application with human review is supported via 'apply_fixes()'. Methods follow de Jonge and van der Loo (2013) <https://cran.r-project.org/doc/contrib/de_Jonge+van_der_Loo-Introduction_to_data_cleaning_with_R.pdf> and Chaudhuri et al. (2003) <doi:10.1145/872757.872796>.

Version: 0.1.0
Depends: R (≥ 4.1.0)
Imports: stats, utils, dplyr (≥ 1.0.0), rlang (≥ 1.0.0)
Suggests: knitr, rmarkdown, testthat (≥ 3.0.0), httr2 (≥ 1.0.0), jsonlite (≥ 1.8.0)
Published: 2026-04-22
DOI: 10.32614/CRAN.package.llmclean
Author: Sadikul Islam ORCID iD [aut, cre]
Maintainer: Sadikul Islam <sadikul.islamiasri at gmail.com>
License: GPL-3
NeedsCompilation: no
CRAN checks: llmclean results

Documentation:

Reference manual: llmclean.html , llmclean.pdf
Vignettes: LLM-Assisted Data Cleaning with llmclean (source, R code)

Downloads:

Package source: llmclean_0.1.0.tar.gz
Windows binaries: r-devel: llmclean_0.1.0.zip, r-release: llmclean_0.1.0.zip, r-oldrel: llmclean_0.1.0.zip
macOS binaries: r-release (arm64): llmclean_0.1.0.tgz, r-oldrel (arm64): llmclean_0.1.0.tgz, r-release (x86_64): llmclean_0.1.0.tgz, r-oldrel (x86_64): llmclean_0.1.0.tgz

Linking:

Please use the canonical form https://CRAN.R-project.org/package=llmclean to link to this page.

Need a high-speed mirror for your open-source project?
Contact our mirror admin team at info@clientvps.com.

This archive is provided as a free public service to the community.
Proudly supported by infrastructure from VPSPulse , RxServers , BuyNumber , UnitVPS , OffshoreName and secure payment technology by ArionPay.