[model] Add support for Plamo3 #17304

mmnga · 2025-11-16T16:55:19Z

This PR adds support for the PLaMo-3 series (2B, 8B, 31B base models):

PLaMo-3 uses a hybrid architecture with Sliding Window Attention (SWA) and standard full attention layers, as well as a custom FFN layout. This PR wires those pieces into llama.cpp so that the official checkpoints can be converted to GGUF and run with the usual backends.

…amo-3

…lama.cpp into features/suppert-plamo-3

CISC · 2025-11-17T10:38:26Z

Any non-gated models available?

mmnga · 2025-11-17T10:57:39Z

There are no non-gated models available at the moment.

mmnga · 2025-11-17T13:34:22Z

Sorry, the checks failed, so I’m reverting it to draft for now.

CISC · 2025-11-17T13:37:49Z

Sorry, the checks failed, so I’m reverting it to draft for now.

The nvidia-vulkan-cm CI failures are unrelated if that's what you're referring to...

…lama.cpp into features/suppert-plamo-3

mmnga · 2025-11-18T15:41:44Z

I’ve reopened this PR. Thank you in advance.

mmngays and others added 16 commits November 15, 2025 22:56

plamo3

f61ed44

fix plamo3

c3b6134

clean code

ce7a922

clean up the code

1ab3bba

fix diff

d9854cc

Merge branch 'ggml-org:master' into features/suppert-plamo-3

8dbbe79

clean up the code

967810d

clean up the code

74fa9d6

clean up the code

3391080

clean up the code

037d831

Merge branch 'ggml-org:master' into features/suppert-plamo-3

4d0be03

clean up the code

80c3418

clean up the code

9cecb26

Merge remote-tracking branch 'origin/master' into features/suppert-pl…

3873edb

…amo-3

Merge branch 'features/suppert-plamo-3' of https://github.com/mmnga/l…

8b92852

…lama.cpp into features/suppert-plamo-3

clean up the code

0df5296

github-actions bot added model Model specific python python script changes labels Nov 16, 2025

mmnga closed this Nov 16, 2025

add chat_template if exist

cdb1d2c

mmnga reopened this Nov 16, 2025

clean up the code

527c65a

mmnga marked this pull request as ready for review November 17, 2025 09:49

mmnga requested review from CISC and ggerganov as code owners November 17, 2025 09:49

mmnga marked this pull request as draft November 17, 2025 13:34

mmnga and others added 3 commits November 17, 2025 23:50

Merge branch 'ggml-org:master' into features/suppert-plamo-3

0f9d0a6

fix cpu-backend

5d52fe6

Merge branch 'features/suppert-plamo-3' of https://github.com/mmnga/l…

dab7aaa

…lama.cpp into features/suppert-plamo-3

DajanaV mentioned this pull request Nov 17, 2025

UPSTREAM PR #17304: [model] Add support for Plamo3 auroralabs-loci/llama.cpp#237

Open

mmngays and others added 2 commits November 18, 2025 16:53

chore: whitespace trim fix + typo fix

9bd33d0

Merge branch 'ggml-org:master' into features/suppert-plamo-3

67a6dda

mmnga marked this pull request as ready for review November 18, 2025 15:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[model] Add support for Plamo3 #17304

[model] Add support for Plamo3 #17304

mmnga commented Nov 16, 2025

Uh oh!

CISC commented Nov 17, 2025

Uh oh!

mmnga commented Nov 17, 2025

Uh oh!

mmnga commented Nov 17, 2025

Uh oh!

CISC commented Nov 17, 2025

Uh oh!

mmnga commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[model] Add support for Plamo3 #17304

Are you sure you want to change the base?

[model] Add support for Plamo3 #17304

Conversation

mmnga commented Nov 16, 2025

Uh oh!

CISC commented Nov 17, 2025

Uh oh!

mmnga commented Nov 17, 2025

Uh oh!

mmnga commented Nov 17, 2025

Uh oh!

CISC commented Nov 17, 2025

Uh oh!

mmnga commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants