GitHub - bastiangx/wordserve: Prefix completion engine, designed for any msgpack clients in need of a fast and minimal Go server!

Lightweight prefix completion library | server, designed for any MessagePack clients

Example usage of wordserve suggestions engine in a client app

Report a Bug · Request a Feature

What's it about?

WordServe is a minimalistic and high performance prefix completion library with a server executable written in Go.

Its designed to provide auto-completion for various clients, especially those using MessagePack as a serialization format.

Why?

So many tools and apps I use on daily basis do not offer any form of word completion, AI/NLP driven or otherwise, there are times when I need to quickly find a word or phrase that I know exists in my vocabulary, but I have no idea how to spell it or don't feel like typing for that long.

Why not make my own tool that can power any TS/JS/etc clients with a completion server?

Similar to?

Think of this as a elementary nvim-cmp or vscode Intellisense daemon, but for any plugin/app that can use a MessagePack client. (which is super easy to implement and use compared to JSON parsing btw, in fact, about 411% improvement in speed and 40% reduction in payload sizes)

This is my first attempt on creating a small scaled but usable Go server/library. Expect unstable or incomplete features, as well as some bugs. I primarily made this for myself so I can make a completion plugin for Obsidian but hey, you might find it useful too!

Prerequisites

Go 1.22 or later
Luajit 2.1 (only for dictionary build scripts)
- A simple words.txt file for building the dictionary with most used words and their corresponding frequencies -- see dictionary for more info

Installation

Go

using go install (Recommended):

go install github.com/bastiangx/wordserve/cmd/wordserve@latest

Library Dependency

use go get to add wordserve as a dependency in your project:

go get github.com/bastiangx/wordserve

and then import it in your code:

import "github.com/bastiangx/wordserve/pkg/suggest"

Releases

Download the latest precompiled binaries from the releases page.

wordserve automatically downloads and initializes the needed dictionary files from GitHub releases
The dictionary files (dict_*.bin) are packaged in data.zip and the word list (words.txt) is available as a separate download
If automatic download fails, you can manually download data.zip and words.txt from the releases page and extract them to the data/ directory

If you're not sure, use 'go install'.

Building from source

You can also clone via git and build the old fashioned way:

git clone https://github.com/bastiangx/wordserve.git
cd wordserve
# -w -s strips debug info & symbols | alias wserve
go build -ldflags="-w -s" -o wserve ./cmd/wordserve/main.go

The build process for the dict files is handled by the wordserve binary, If you encounter any issues, you can manually run the build script located in scripts/build-data.lua using LuaJIT.

Make sure the data/ directory exists and has the words.txt file in it before running this.

luajit scripts/build-data.lua

What can it do?

Batched Word Suggestions

WordServe returns suggestions in batches using a radix trie. Memory pools handle rapid queries without triggering garbage collection.

Responsive

The IPC server communicates through stdin/stdout channels with minimal protocol overhead.

Goroutines handle multiple client connections simultaneously.

Capital letters

It just works!

Compact MessagePack Protocol

Binary MessagePack encoding keeps request and response payloads as small as possible.

Many Many Words

Start with a simple words.txt file containing 65,000+ entries.

WordServe chunks the dictionary into binary trie files and loads only what's needed, dynamically managing memory based on usage patterns.

Small memory usage

Memory usage of WordServe shown to be around 20MB with 50K words loaded in

WordServe's memory usage remains low even with large dictionaries, typically around 20MB for 50,000 words default. Even after expanding many nodes and normal usage for few hours, it stays under 60MB and has checks to shrink periodically.

What can it not do?

As this is the early version and Beta, there are many features that are yet not implemented

simple fuzzy matching
string searching algo (haystack-needle)
integrated spelling correction (aspell)
support conventional dict formats like .dict

Will monitor the issues and usage to see if enough people are interested.

Usage

Standalone server

you can run wordserve as a dependency in your Go project, a standalone IPC server. A simple CLI is also provided for testing and debugging.

Library

The library provides simple to use API for prefix completion requests and dictionary management.

Read all about using them in the API doc

More comprehensive and verbose Go Package docs

completer := suggest.NewLazyCompleter("./data", 10000, 50000)

if err := completer.Initialize(); err != nil {
    log.Fatalf("Failed to initialize: %v", err)
}

suggestions := completer.Complete("amer", 10)

or for static check:

completer := suggest.NewCompleter()

completer.AddWord("example", 500)
completer.AddWord("excellent", 400)

suggestions := completer.Complete("ex", 5)

You can inspect the informal flow diagram on the core internals:

A flow diagram of WordServe's core internals

Client Integration

The Client doc gives some guide on how to use WordServe in your TS/JS app.

import { spawn, ChildProcess } from "child_process";
import { encode, decode } from "@msgpack/msgpack";

class WordServeClient {
  private process: ChildProcess;
  private requestId = 0;

  constructor(binaryPath: string = "wordserve") {
    this.process = spawn(binaryPath, [], {
      stdio: ["pipe", "pipe", "pipe"],
    });
  }

  async getCompletions(
    prefix: string,
    limit: number = 20,
  ): Promise<Suggestion[]> {
    const request = {
      id: `req_${++this.requestId}`,
      p: prefix,
      l: limit, // (optional)
    };

    const binaryRequest = encode(request);
    this.process.stdin!.write(binaryRequest);

    return new Promise((resolve, reject) => {
      this.process.stdout!.once("data", (data: Buffer) => {
        try {
          const response = decode(data) as CompletionResponse;
          const suggestions = response.s.map((s, index) => ({
            word: s.w,
            rank: s.r,
            frequency: 65536 - s.r, // Convert rank back to freq score
          }));
          resolve(suggestions);
        } catch (error) {
          reject(error);
        }
      });
    });
  }
}

CLI

Learn how to use it in the CLI doc

Flags

wordserve [flags]

Flag	Description	Default Value
-version	Show current version	false
-config	Path to custom config.toml file	""
-data	Directory containing the binary files	"data/"
-v	Toggle verbose mode	false
-c	Run CLI -- useful for testing and debugging	false
-limit	Number of suggestions to return	10
-prmin	Minimum Prefix length for suggestions (1 < n <= prmax)	3
-prmax	Maximum Prefix length for suggestions	24
-no-filter	Disable input filtering (DBG only) - shows all raw dictionary entries (numbers, symbols, etc)	false
-words	Maximum number of words to load (use 0 for all words)	100,000
-chunk	Number of words per chunk for lazy loading	10,000

Dictionary

Read more about the dictionary design and how it works.

Configuration

Refer to the config doc on how to manage server, send commands to it and change dictionary on runtime.

Development

See the open issues for a list of proposed features (and known issues).

Contributions are welcome! Refer to the contributing guidelines

License

WordServe is licensed under the MIT license. Feel free to edit and distribute this library as you like.

See LICENSE

Acknowledgements

The Beautiful Rosepine theme used for graphics and screenshots throughout the readme.
The Incredible mono font, Berkeley Mono by U.S. Graphics used in screenshots, graphics, gifs and more.

Name		Name	Last commit message	Last commit date
Latest commit History 136 Commits
.github		.github
cmd		cmd
data		data
docs		docs
internal		internal
pkg		pkg
scripts		scripts
.gitignore		.gitignore
.goreleaser.yaml		.goreleaser.yaml
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

What's it about?

Why?

Similar to?

Prerequisites

Installation

Go

Library Dependency

Releases

Building from source

What can it do?

Batched Word Suggestions

Responsive

Capital letters

Compact MessagePack Protocol

Many Many Words

Small memory usage

What can it not do?

Usage

Standalone server

Library

Client Integration

CLI

Flags

Dictionary

Configuration

Development

License

Acknowledgements

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 3

Sponsor this project

Uh oh!

Contributors

Uh oh!

Languages