llama : support LiquidAI LFM2-MoE hybrid model #16464

tdakhran · 2025-10-07T14:04:10Z

Add support for LiquidAI/LFM2-8B-A1B model.

For more information about the model, please read the blog post.
HF PR is merged.
GGUFs are uploaded and available for testing.

Add support for [LiquidAI/LFM2-8B-A1B](https://huggingface.co/LiquidAI/LFM2-8B-A1B) model. For more information about models, please read [the blog post](https://www.liquid.ai/company/news). [HF PR](huggingface/transformers#41401) [GGUFs](https://huggingface.co/LiquidAI/LFM2-8B-A1B-GGUF)

tdakhran · 2025-10-07T15:07:33Z

I will remove defaultdict, makes CI unhappy.

upd: addressed in fe3b812

convert_hf_to_gguf.py

src/llama-model.cpp

tdakhran · 2025-10-07T17:15:18Z

Thanks you for the feedback @CISC , addressed in eb190c6.

I'm reuploading GGUFs.

tdakhran · 2025-10-07T18:06:28Z

GGUFs are updated!

Tested that bin/llama-cli -hf LiquidAI/LFM2-8B-A1B-GGUF:Q4_0 -p "What's the capital of Great Britain?" works.

* master: (113 commits) webui: updated the chat service to only include max_tokens in the req… (ggml-org#16489) cpu : optimize the ggml NORM operation (ggml-org#15953) server : host-memory prompt caching (ggml-org#16391) No markdown in cot (ggml-org#16483) model-conversion : add support for SentenceTransformers (ggml-org#16387) ci: add ARM64 Kleidiai build and test support (ggml-org#16462) CANN: Improve ACL graph matching (ggml-org#16166) kleidiai: kernel interface refactoring (ggml-org#16460) [SYCL] refactor soft_max, add soft_max_back (ggml-org#16472) model: EmbeddingGemma Adding Support for SentenceTransformers Dense Modules (ggml-org#16367) refactor: centralize CoT parsing in backend for streaming mode (ggml-org#16394) Disable CUDA host buffers on integrated GPUs (ggml-org#16308) server : fix cancel pending task (ggml-org#16467) metal : mark FA blocks (ggml-org#16372) server : improve context checkpoint logic (ggml-org#16440) ggml webgpu: profiling, CI updates, reworking of command submission (ggml-org#16452) llama : support LiquidAI LFM2-MoE hybrid model (ggml-org#16464) server : add `/v1/health` endpoint (ggml-org#16461) webui : added download action (ggml-org#13552) (ggml-org#16282) presets : fix pooling param for embedding models (ggml-org#16455) ...

Updates llama.cpp from b6638 to b6709, adding LFM2-MoE architecture support. Changes: - Updated third_party/llama.cpp submodule to b6709 - Synced cpp/ directory via scripts/bootstrap.sh - Added LLM_ARCH_LFM2MOE for LiquidAI hybrid models - Updated version.ts to build 6709 Tested with LiquidAI LFM2-1.2B models on iOS. References: - Release: https://github.com/ggml-org/llama.cpp/releases/tag/b6709 - PR: ggml-org/llama.cpp#16464

tdakhran requested a review from CISC as a code owner October 7, 2025 14:04

github-actions bot added the python python script changes label Oct 7, 2025

Do not use defaultdict

fe3b812

CISC approved these changes Oct 7, 2025

View reviewed changes

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

convert_hf_to_gguf.py Show resolved Hide resolved

src/llama-model.cpp Outdated Show resolved Hide resolved

Address PR feedback

eb190c6

CISC added hot Something that is hot model Model specific labels Oct 7, 2025

CISC merged commit aeaf8a3 into ggml-org:master Oct 7, 2025
72 checks passed

tdakhran deleted the tarek/feat/lfm2_moe branch October 7, 2025 18:08

rick-github mentioned this pull request Oct 9, 2025

hf.co/LiquidAI/LFM2-8B-A1B-GGUF:latest ollama/ollama#12549

Open

boshjerns mentioned this pull request Oct 11, 2025

feat: sync llama.cpp to b6709 mybigday/llama.rn#224

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama : support LiquidAI LFM2-MoE hybrid model #16464

llama : support LiquidAI LFM2-MoE hybrid model #16464

tdakhran commented Oct 7, 2025

Uh oh!

tdakhran commented Oct 7, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tdakhran commented Oct 7, 2025

Uh oh!

Uh oh!

tdakhran commented Oct 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

llama : support LiquidAI LFM2-MoE hybrid model #16464

llama : support LiquidAI LFM2-MoE hybrid model #16464

Conversation

tdakhran commented Oct 7, 2025

Uh oh!

tdakhran commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tdakhran commented Oct 7, 2025

Uh oh!

Uh oh!

tdakhran commented Oct 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tdakhran commented Oct 7, 2025 •

edited

Loading