go-mlx

Native Apple Metal GPU inference via mlx-c CGO bindings, implementing the inference.Backend and inference.TextModel interfaces from go-inference for Apple Silicon (M1-M4). Supports Gemma 3, Gemma 4 (dense and MoE), Qwen 2/3, and Llama 3 architectures from HuggingFace safetensors directories and GGUF checkpoints, with fused Metal kernels for RMSNorm, RoPE, scaled dot-product attention, KV cache management, LoRA fine-tuning with AdamW, and batch inference. The root package also exposes an RFC-style direct model API (mlx.LoadModel, model.Generate, model.GenerateStream) and a non-LLM frame-compute API (mlx.NewSession, Session.BeginFrame, Session.FinishFrame, PixelBuffer, KernelRGB565ToRGBA8, KernelNearestScale, KernelScanlineFilter, KernelCRTFilter, KernelSoftenFilter, KernelSharpenFilter) for Apple GPU-accelerated image and emulator workloads. A Python subprocess backend (mlxlm) is provided as a CGO-free alternative. Platform-restricted: darwin/arm64 only; a no-op stub compiles on all other platforms.

Module: dappco.re/go/mlx Licence: EUPL-1.2 Language: Go 1.26

Quick Start

import (
    "context"
    "fmt"

    "dappco.re/go/inference"
    _ "dappco.re/go/mlx"  // registers "metal" backend via init()
)

model, err := inference.LoadModel("/Volumes/Data/lem/safetensors/gemma-3-1b/")
if err != nil {
    panic(err)
}
defer model.Close()

for tok := range model.Generate(context.Background(), "Hello", inference.WithMaxTokens(256)) {
    fmt.Print(tok.Text)
}

Root API

import (
    "fmt"

    mlx "dappco.re/go/mlx"
)

model, err := mlx.LoadModel("/path/to/model",
    mlx.WithContextLength(8192),
    mlx.WithQuantization(4),
    mlx.WithDevice("gpu"),
)
if err != nil {
    panic(err)
}
defer model.Close()

reply, err := model.Generate("Explain Gemma 4 shared KV layers", mlx.WithMaxTokens(128))
if err != nil {
    panic(err)
}
fmt.Println(reply)

Frame Compute

import mlx "dappco.re/go/mlx"

session, err := mlx.NewSession(mlx.WithSessionLabel("frame-pipeline"))
if err != nil {
    panic(err)
}
defer session.Close()

src, _ := session.NewPixelBuffer(mlx.PixelBufferDesc{
    Width:  320,
    Height: 224,
    Stride: 640,
    Format: mlx.PixelRGB565,
})
rgba, _ := session.NewPixelBuffer(mlx.PixelBufferDesc{
    Width:  320,
    Height: 224,
    Stride: 1280,
    Format: mlx.PixelRGBA8,
})
scaled, _ := session.NewPixelBuffer(mlx.PixelBufferDesc{
    Width:  960,
    Height: 672,
    Stride: 3840,
    Format: mlx.PixelRGBA8,
})

frameBytes := make([]byte, src.Descriptor().SizeBytes())
if err := src.Upload(frameBytes); err != nil {
    panic(err)
}
if err := session.BeginFrame(); err != nil {
    panic(err)
}
if err := session.Run(mlx.KernelRGB565ToRGBA8, mlx.KernelArgs{
    Inputs:  map[string]mlx.Buffer{"src": src},
    Outputs: map[string]mlx.Buffer{"dst": rgba},
}); err != nil {
    panic(err)
}
if err := session.Run(mlx.KernelNearestScale, mlx.KernelArgs{
    Inputs:  map[string]mlx.Buffer{"src": rgba},
    Outputs: map[string]mlx.Buffer{"dst": scaled},
}); err != nil {
    panic(err)
}
if err := session.Run(mlx.KernelScanlineFilter, mlx.KernelArgs{
    Inputs:  map[string]mlx.Buffer{"src": scaled},
    Outputs: map[string]mlx.Buffer{"dst": scaled},
    Scalars: map[string]float64{"strength": 0.3},
}); err != nil {
    panic(err)
}
frameMetrics, err := session.FinishFrame()
if err != nil {
    panic(err)
}

finalFrame, err := scaled.Read()
if err != nil {
    panic(err)
}
_ = finalFrame
_ = frameMetrics

Documentation

Compute Guide — frame-oriented Metal compute sessions, pixel buffers, kernels, metrics
Architecture — CGO binding, model architectures, weight loading, KV cache, attention, batch inference, LoRA training, mlxlm backend
Models — model loading, supported architectures, tokenisation, chat templates
Training — LoRA fine-tuning, AdamW, gradient computation, checkpoints
Development Guide — prerequisites (mlx-c CMake build), CGO flags, test patterns, benchmarks
Project History — completed phases, commit hashes, known limitations

Build & Test

git submodule update --init --recursive
go generate ./...        # builds mlx-c C library (required first time)
go test ./...
go build ./...

Licence

European Union Public Licence 1.2 — see LICENCE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 198 Commits
.core		.core
.forgejo/workflows		.forgejo/workflows
cmd/violet		cmd/violet
cpp		cpp
dist/include		dist/include
docs		docs
internal		internal
lib		lib
mlxlm		mlxlm
pkg/daemon		pkg/daemon
tests/cli		tests/cli
.editorconfig		.editorconfig
.gitignore		.gitignore
.gitmodules		.gitmodules
.golangci.yml		.golangci.yml
CLAUDE.md		CLAUDE.md
CMakeLists.txt		CMakeLists.txt
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md
adapter.go		adapter.go
adapter_test.go		adapter_test.go
api_common.go		api_common.go
api_darwin.go		api_darwin.go
api_shape_common.go		api_shape_common.go
api_shape_test.go		api_shape_test.go
api_stub.go		api_stub.go
api_test.go		api_test.go
api_tokenizer_darwin.go		api_tokenizer_darwin.go
api_tokenizer_stub.go		api_tokenizer_stub.go
api_tokenizer_test.go		api_tokenizer_test.go
attention_snapshot_test.go		attention_snapshot_test.go
attention_test.go		attention_test.go
backend_common.go		backend_common.go
backend_common_test.go		backend_common_test.go
compute.go		compute.go
compute_darwin.go		compute_darwin.go
compute_darwin_test.go		compute_darwin_test.go
compute_stub.go		compute_stub.go
compute_test.go		compute_test.go
gguf_info.go		gguf_info.go
gguf_info_test.go		gguf_info_test.go
go.mod		go.mod
go.sum		go.sum
medium.go		medium.go
mlx.go		mlx.go
mlx_stub.go		mlx_stub.go
mlx_test.go		mlx_test.go
options_darwin.go		options_darwin.go
register_metal.go		register_metal.go
register_metal_stub.go		register_metal_stub.go
register_metal_test.go		register_metal_test.go
test_cgo_d		test_cgo_d
tokenizer_common.go		tokenizer_common.go
training.go		training.go
training_stub.go		training_stub.go
unsupported_stub_test.go		unsupported_stub_test.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

go-mlx

Quick Start

Root API

Frame Compute

Documentation

Build & Test

Licence

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

go-mlx

Quick Start

Root API

Frame Compute

Documentation

Build & Test

Licence

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages