llamaR: Interface for Large Language Models via 'llama.cpp'

Provides R bindings to 'llama.cpp' for running large language models locally, with optional GPU acceleration via 'ggmlR'. Supports text generation, embeddings, chat-based workflows, tool calling, and multimodal (vision) inference. Includes 'OpenAI'- and 'Anthropic'-compatible HTTP servers for serving local models, along with device selection and multi-GPU support.

Version: 0.2.5
Depends: R (≥ 4.1.0), ggmlR
Imports: jsonlite, stats, tools, utils
LinkingTo: ggmlR
Suggests: testthat (≥ 3.0.0), withr, drogonR, later, ellmer, callr, knitr, rmarkdown
Published: 2026-07-02
DOI: 10.32614/CRAN.package.llamaR
Author: Yuri Baramykov ORCID iD [aut, cre], Georgi Gerganov [cph] (Author of the 'llama.cpp' library included in src/)
Maintainer: Yuri Baramykov <lbsbmsu at mail.ru>
BugReports: https://github.com/Zabis13/llamaR/issues
License: MIT + file LICENSE
URL: https://github.com/Zabis13/llamaR
NeedsCompilation: yes
SystemRequirements: C++17, GNU make
Materials: README, NEWS
CRAN checks: llamaR results

Documentation:

Reference manual: llamaR.html , llamaR.pdf
Vignettes: Chat and Agents (source)
Getting Started with llamaR (source)

Downloads:

Package source: llamaR_0.2.5.tar.gz
Windows binaries: r-devel: llamaR_0.2.4.zip, r-release: llamaR_0.2.4.zip, r-oldrel: llamaR_0.2.4.zip
macOS binaries: r-release (arm64): llamaR_0.2.5.tgz, r-oldrel (arm64): llamaR_0.2.5.tgz, r-release (x86_64): llamaR_0.2.5.tgz, r-oldrel (x86_64): llamaR_0.2.5.tgz
Old sources: llamaR archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=llamaR to link to this page.