Venesa

An autonomous, programmable AI platform for Windows that transforms natural language into executable system workflows.

Windows only. MIT licensed. Entirely local except for the LLM and speech APIs you configure.

What is Venesa?

Venesa is a desktop AI assistant for Windows built on a fundamentally different premise than most voice assistants or chatbots. It does not treat the AI as a conversational interface - it treats the AI as an execution engine. When you ask Venesa to do something, the language model formulates a structured plan and the platform executes it as real system operations: launching apps, managing files, controlling settings, running scripts, and more.

The result is an assistant that can take a complex, loosely-worded request and carry it out end-to-end - autonomously, silently, or with confirmation steps where appropriate - without you needing to specify every instruction manually.

You interact with Venesa by voice (wake word or push-to-talk) or through the text interface. The assistant remembers your preferences and context across sessions, so it gets more useful the more you use it.

What Venesa Can Do

System Automation

Venesa ships with a full set of built-in capabilities covering common Windows operations:

Launch and close applications
Control system volume, display brightness, and power state
Manage windows (minimize, snap, focus, close all)
Read and write the clipboard
Search, open, move, copy, and delete files
Run sandboxed PowerShell scripts
Take screenshots
Open URLs and perform Google searches
Set reminders

Multi-Step Workflows

Where Venesa stands apart is in orchestration. Rather than executing one command at a time, it can decompose a single request into a serialized sequence of steps - a plan - and execute them in order, passing results between steps automatically.

Examples of what this enables:

"Copy everything in my clipboard and search Google for it."

"Close all open apps, set the volume to zero, and lock the screen."

"Take a screenshot of my desktop, save it to my Documents folder, and open it in Paint."

"Find all PDF files modified in the last week and list them out."

Each step in a plan carries a visibility marker - it can run silently in the background, announce its result, or pause and ask for your confirmation before proceeding. This gives you fine-grained control over how automated the assistant behaves for any given action.

Voice Interaction

Venesa runs a continuous, offline wake-word listener using the Vosk speech model. When the wake phrase is detected, it captures your speech, transcribes it via ElevenLabs Scribe, sends it through the reasoning engine, and responds through a floating voice overlay - speaking the response aloud with ElevenLabs Flash v2.5 synthesis. The voice overlay is always-on-top and unobtrusive.

The wake word, voice style, and speech behavior are all configurable.

Persistent Memory

Every interaction can write to named memory buckets - preferences, aliases, reminders, and conversation history - that are injected into the AI's context on every subsequent query. Venesa remembers your name, your preferences, custom command aliases, and anything else you tell it to keep. There is no cloud sync; everything is stored locally in %USERPROFILE%\.venesa\memory.json.

Dynamic Interfaces

When a query warrants structured output - a list of running processes, disk usage, network adapters, installed apps - the AI renders it as a formatted UI element directly in the interface: tables, key-value grids, card lists, or command lists. The layout is determined by the response content, not hardcoded.

Flexibility: The Capability System

Venesa's built-in features are only the starting point. The platform is designed to be extended with capabilities - single JavaScript files that give the AI a new skill. Once a capability is installed, it becomes part of the AI's tool chain and can be invoked by name in any query or as a step in a plan.

How Capabilities Work

A capability is a CommonJS module that exports an object describing a skill: its name, a description that gets injected into the AI's system prompt so the model knows when to use it, a Zod schema for parameter validation, and an async handler that performs the actual work.

When the AI decides to use a capability, the platform validates the parameters against the schema and calls the handler. Errors are isolated - a broken capability cannot take down the rest of the system.

Capabilities can return data for the AI to reason about further, trigger UI renders, write to memory, perform system actions, or do a combination of all of these.

Community Capabilities

The venesa-capabilities repository is the official registry of community-built capabilities. It currently includes skills for:

Checking disk usage, network adapters, and system info
Listing running processes and installed applications
Fetching weather forecasts
Searching YouTube and Google
Managing persistent notes
Retrieving saved WiFi credentials
Cleaning up temporary files and junk

...and more. The registry is growing.

Installing community capabilities requires no downloads or file management. From inside Venesa:

Open Settings → Skills & Capabilities
Select the Community tab
Browse the live registry, then click Install on any capability
It's ready to use now.

Installed capabilities are indistinguishable from built-in core skills at runtime.

Getting Started

Requirements:

Windows 10 or 11 (x64)
Node.js v18.0.0 or higher
pnpm (npm install -g pnpm)
A Google AI Studio API key (Gemini)
An ElevenLabs API key (STT + TTS)

Install and run:

git clone https://github.com/mianjunaid1223/Venesa.git
cd Venesa
pnpm install
pnpm start

The first launch opens a setup wizard where you configure your API keys and preferences. No manual .env editing required, though you can also create one:

GEMINI_API_KEY=your_key
ELEVENLABS_API_KEY=your_key

Append _1, _2, etc. to any key to enable automatic round-robin rotation when rate limits are hit.

For verbose logging:

pnpm dev

To build a distributable installer:

pnpm build

Contributing

There are two ways to contribute to Venesa:

1. Contribute to the Platform

If you want to improve the core assistant - fix bugs, improve orchestration, extend the speech pipeline, add new built-in capabilities, or improve the UI - open a pull request on this repository.

Before contributing, read ARCHITECTURE.md to understand the execution pipeline and module responsibilities, and standards.md for code conventions, capability schema rules, and security requirements. All contributions must conform to the unified protocol standard described there.

2. Publish a Community Capability

If you want to give Venesa a new skill without modifying the core platform, build and submit a capability to the venesa-capabilities repository. One file. No dependencies on the main project required. Venesa handles dependencies (which you just provide in its manifest) and versions automatically.

A capability is a single .js file dropped into the /capabilities/ folder of that repository. Once merged, it becomes available to all Venesa users via the in-app community browser.

Minimal example:

const { z } = require("zod");

module.exports = {
  name: "myCapability",
  description:
    "Describe what this does - the AI reads this to decide when to use it.",
  returnType: "action", // data | action | ui | memory | hybrid
  marker: "announce", // silently | announce | confirm
  schema: z.object({
    input: z.string().describe("The input value"),
  }),
  handler: async ({ input }) => {
    return { success: true, result: `Processed: ${input}` };
  },
};

Full specification - schema options, return types, lifecycle hooks, UI rendering hints - is documented in the venesa-capabilities repository. The same rules apply whether a capability is built-in or community-contributed.

Documentation

Document	Contents
ARCHITECTURE.md	Full internal architecture: pipeline diagrams, module responsibilities, IPC design, memory system, speech stack, capability lifecycle
standards.md	Capability schema specification, code conventions, security requirements, governance rules
venesa-capabilities	Community capability registry and development guide

License

MIT. See LICENSE for details.

Developed by Mian Junaid

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
assets		assets
models		models
src		src
.env.example		.env.example
.env.production		.env.production
.gitattributes		.gitattributes
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
README.md		README.md
electron-builder.json		electron-builder.json
package-lock.json		package-lock.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
standards.md		standards.md
venesa.standard.json		venesa.standard.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Venesa

What is Venesa?

What Venesa Can Do

System Automation

Multi-Step Workflows

Voice Interaction

Persistent Memory

Dynamic Interfaces

Flexibility: The Capability System

How Capabilities Work

Community Capabilities

Getting Started

Contributing

1. Contribute to the Platform

2. Publish a Community Capability

Documentation

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Venesa

What is Venesa?

What Venesa Can Do

System Automation

Multi-Step Workflows

Voice Interaction

Persistent Memory

Dynamic Interfaces

Flexibility: The Capability System

How Capabilities Work

Community Capabilities

Getting Started

Contributing

1. Contribute to the Platform

2. Publish a Community Capability

Documentation

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages