Interface for Large Language Models via 'llama.cpp' [R package llamaR version 0.2.5]

Yuri Baramykov

llamaR: Interface for Large Language Models via 'llama.cpp'

Provides R bindings to 'llama.cpp' for running large language models locally, with optional GPU acceleration via 'ggmlR'. Supports text generation, embeddings, chat-based workflows, tool calling, and multimodal (vision) inference. Includes 'OpenAI'- and 'Anthropic'-compatible HTTP servers for serving local models, along with device selection and multi-GPU support.

Version:	0.2.5
Depends:	R (≥ 4.1.0), ggmlR
Imports:	jsonlite, stats, tools, utils
LinkingTo:	ggmlR
Suggests:	testthat (≥ 3.0.0), withr, drogonR, later, ellmer, callr, knitr, rmarkdown
Published:	2026-07-02
DOI:	10.32614/CRAN.package.llamaR
Author:	Yuri Baramykov [aut, cre], Georgi Gerganov [cph] (Author of the 'llama.cpp' library included in src/)
Maintainer:	Yuri Baramykov <lbsbmsu at mail.ru>
BugReports:	https://github.com/Zabis13/llamaR/issues
License:	MIT + file LICENSE
URL:	https://github.com/Zabis13/llamaR
NeedsCompilation:	yes
SystemRequirements:	C++17, GNU make
Materials:	README, NEWS
CRAN checks:	llamaR results

Documentation:

Reference manual:	llamaR.html , llamaR.pdf
Vignettes:	Chat and Agents (source) Getting Started with llamaR (source)

Downloads:

Package source:	llamaR_0.2.5.tar.gz
Windows binaries:	r-devel: llamaR_0.2.4.zip, r-release: llamaR_0.2.4.zip, r-oldrel: llamaR_0.2.4.zip
macOS binaries:	r-release (arm64): llamaR_0.2.5.tgz, r-oldrel (arm64): llamaR_0.2.5.tgz, r-release (x86_64): llamaR_0.2.5.tgz, r-oldrel (x86_64): llamaR_0.2.5.tgz
Old sources:	llamaR archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=llamaR to link to this page.