main into nvidia branch by meghsat · Pull Request #4 · meghsat/lemonade

meghsat · 2026-01-06T19:24:59Z

No description provided.

* C++: Improve linux CLI experience * Show error messages on cli failures * Fix status/stop on linux run * Clean up the run command * extend testing

* fix: bypass OpenAI client for chat and completion requests to avoid Pydantic serialization issues * fix: remove unused json import in wrapped_server.py * chore: update .gitignore to include Claude, Bruno, and Cursor files * fix: update OpenAI package version and adjust API calls in wrapped_server.py * fix: add logprobs parameter to response output in Server class

* add dll copying to path * upgrade to new rai whl * update version check

* Rev version numvers for 8.2.1 / 9.0.1-beta release * Enable the stats endpoint * Remove unused code * add prompt tokens to stats * update docs * update docs again

…e-sdk#547) * C++: env vars to override cli defaults * Enable custom backend arguments for llamacpp * Update docs

…emonade-sdk#548) * C++: Make it easier to change the llamacpp build * Clean up install directory

* Remove python nsis installer and related tests * Remove the 'beta' tag from the C++ server * rev version numbers * Update install instructions * Remove most instances of lemonade-server-dev * Documentation updates * Add python deprecation notice * Improve deprecation notice * Fix make_http_request() bug * Change default host to localhost and fix bugs * Status function checks ipv4 and 6

)

* Update open-hands.md * Update open-hands.md

It had the old value

* Update runner labels * Update labels in documentation * Suggested changes

…emonade-sdk#729) * Fix bug that resulted in user-model registrations * Eliminate redundant methods and data structures * Fix model pull bugs --------- Co-authored-by: Daniel Holanda <holand.daniel@gmail.com>

* Overhaul flm install flow * Adjust model list cache for correctness * Fix FLM and rev lemonade version * Change FLM driver req to .304. Update website.

* More helpful 'not-found' errors * Better error message when models are filtered * Show 'model load failed' errors --------- Co-authored-by: Daniel Holanda <holand.daniel@gmail.com>

…e-sdk#731) * Show embeddings well * cleaner * cleaner rerank * Better reranker * Better embeddings * Great embedding and reranking * transcription * transcriber * nit * Cleanup styles merge * Fix chat bar

* Add --extra-models-dir option * Refine the language

* Touch up the documentation * Apply suggestions from code review Co-authored-by: Ramakrishnan Sivakumar <ramkrishna2910@gmail.com> --------- Co-authored-by: Ramakrishnan Sivakumar <ramkrishna2910@gmail.com>

…container (lemonade-sdk#723) * add: added docker build/run instruction for lemonade cpp * add: updated ubuntu to support the new cmake version requirements * add: added refrence to devcontainers * add: added note regarding folder structure: --------- Co-authored-by: Jeremy Fowers <80718789+jeremyfowers@users.noreply.github.com>

This reverts commit 88ab33c.

…emonade-sdk#764) * add LEMONADE_DISABLE_MODEL_FILTERING env var * Add Ryzen AI Z2 to the supported NPU processors regex

* Steamline the new user experience * bug fix * new file

* add support for recipe options * add documentation for recipe_options.json * corrected documentation * add logging for recipe options

* add support for API key * document LEMONADE_API_KEY * do not require auth on HTTP OPTIONS method

jeremyfowers and others added 30 commits November 7, 2025 07:51

C++: linux CLI fixes and new tests (lemonade-sdk#534)

0c2bc73

* C++: Improve linux CLI experience * Show error messages on cli failures * Fix status/stop on linux run * Clean up the run command * extend testing

Never use SSL libs during build (lemonade-sdk#535)

aa16561

Fix dll issue in RAI 1.6 (lemonade-sdk#530)

c88b274

* add dll copying to path * upgrade to new rai whl * update version check

C++: Improve model endpoint performance (lemonade-sdk#541)

e8eb275

Add enable_thinking toggle to web UI model settings (lemonade-sdk#531)

94e1482

C++: Add support for HF_HUB_CACHE (lemonade-sdk#542)

2c6d20f

Simplify and fix the web ui (lemonade-sdk#544)

30b451f

C++: Enable stats endpoint, add prompt_tokens (lemonade-sdk#543)

9445da7

* Rev version numvers for 8.2.1 / 9.0.1-beta release * Enable the stats endpoint * Remove unused code * add prompt tokens to stats * update docs * update docs again

C++: custom llamacpp args; env vars to override cli defaults (lemonad…

ac7a450

…e-sdk#547) * C++: env vars to override cli defaults * Enable custom backend arguments for llamacpp * Update docs

C++: Users can change the llamacpp build without rebuilding lemonade (l…

6cd1493

…emonade-sdk#548) * C++: Make it easier to change the llamacpp build * Clean up install directory

C++: clean up the --host arg (lemonade-sdk#551)

1f25d0f

Fix: use the model's commit hash in cache path (lemonade-sdk#557)

554bf64

Support enable_thinking arg in chat/completions (lemonade-sdk#558)

f6ef295

Only test the python when the python has changed (lemonade-sdk#559)

feaca5a

Bring C++ system-info to parity with python (lemonade-sdk#560)

f1030c0

Clean up the help menu (lemonade-sdk#569)

8cfbe34

use updated model list from flm (lemonade-sdk#561)

69d0db5

Add unzip to the .deb deps (lemonade-sdk#571)

62d88f5

Single source of truth for models dir (lemonade-sdk#570)

e3e921e

Improve the user experience for embeddings (lemonade-sdk#568)

8f02bbe

Bug fixes: flm install, user_models.json, list command (lemonade-sdk#579

7c413bf

)

Update open-hands.md (lemonade-sdk#564)

6daa0c5

* Update open-hands.md * Update open-hands.md

add: Version: to log (lemonade-sdk#576)

ff5cd5a

Use WiX to make an MSI installer (lemonade-sdk#580)

ce86ee4

Add project roadmap to README (lemonade-sdk#583)

e51d410

Infinite timeout for inference requests (lemonade-sdk#590)

37702b0

Organize CMakeLists and exclude zstd from deb package (lemonade-sdk#587)

e331db8

Update README to change default host address (lemonade-sdk#588)

85cf255

It had the old value

vgodsoe and others added 29 commits December 10, 2025 16:04

Add core Lemonade features to homepage (lemonade-sdk#702)

f144b16

Update index.html

d3426cb

Update links to icons to fix rate limiting issues (lemonade-sdk#712)

ad38f95

Update wall of logos on the website (lemonade-sdk#714)

f6863d1

Resolve cmake warnings (lemonade-sdk#713)

7c86541

Update Runner Labels (lemonade-sdk#728)

92e649c

* Update runner labels * Update labels in documentation * Suggested changes

Fix bug that resulted in user-model registrations, Add a Model bugs (l…

659eb70

…emonade-sdk#729) * Fix bug that resulted in user-model registrations * Eliminate redundant methods and data structures * Fix model pull bugs --------- Co-authored-by: Daniel Holanda <holand.daniel@gmail.com>

Improve FLM installation and fix bugs (lemonade-sdk#730)

048cc7d

* Overhaul flm install flow * Adjust model list cache for correctness * Fix FLM and rev lemonade version * Change FLM driver req to .304. Update website.

Overhaul the "not-found" error message (lemonade-sdk#735)

f580420

* More helpful 'not-found' errors * Better error message when models are filtered * Show 'model load failed' errors --------- Co-authored-by: Daniel Holanda <holand.daniel@gmail.com>

Fix app download progress bugs with 0B files (lemonade-sdk#743)

61b3413

Surpress health request prints (lemonade-sdk#742)

3bd1fc8

Add experiences for Embeddings, Reranking, and Transcription (lemonad…

e4f0c0b

…e-sdk#731) * Show embeddings well * cleaner * cleaner rerank * Better reranker * Better embeddings * Great embedding and reranking * transcription * transcriber * nit * Cleanup styles merge * Fix chat bar

Add --extra-models-dir option (lemonade-sdk#741)

4c34aa1

* Add --extra-models-dir option * Refine the language

Add ROCm support to STX Point (gfx1150) (lemonade-sdk#740)

5c5074c

Update development status in README.md

adf6d9c

Update GitHub/Website for early 2026 (lemonade-sdk#716)

cb28837

Add nemotron 3 nano model (lemonade-sdk#751)

545a4d2

Touch up the documentation (lemonade-sdk#753)

338ccde

* Touch up the documentation * Apply suggestions from code review Co-authored-by: Ramakrishnan Sivakumar <ramkrishna2910@gmail.com> --------- Co-authored-by: Ramakrishnan Sivakumar <ramkrishna2910@gmail.com>

Fix index.html responsive design (lemonade-sdk#755)

4cbb958

Center website subtitle (lemonade-sdk#756)

965fab7

incremental logits

88ab33c

Revert "incremental logits"

e1c2e12

This reverts commit 88ab33c.

Add Ryzen AI Z2 support and LEMONADE_DISABLE_MODEL_FILTERING env var (l…

479adaa

…emonade-sdk#764) * add LEMONADE_DISABLE_MODEL_FILTERING env var * Add Ryzen AI Z2 to the supported NPU processors regex

upgrade flm to 0.9.24 (lemonade-sdk#781)

32d7e90

Steamline the new user experience (lemonade-sdk#788)

45dd215

* Steamline the new user experience * bug fix * new file

add support for recipe options (lemonade-sdk#775)

443cc7d

* add support for recipe options * add documentation for recipe_options.json * corrected documentation * add logging for recipe options

Lemonade API key (lemonade-sdk#774)

5657ce3

* add support for API key * document LEMONADE_API_KEY * do not require auth on HTTP OPTIONS method

Merge branch 'nvidia_benchmark_nvidia-smi' into main

0f51dba

meghsat merged commit 7541eac into nvidia_benchmark_nvidia-smi Jan 6, 2026
18 of 34 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

main into nvidia branch#4

main into nvidia branch#4
meghsat merged 108 commits intonvidia_benchmark_nvidia-smifrom
main

meghsat commented Jan 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

13 participants

Conversation

meghsat commented Jan 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

13 participants