Skip to content

⬆️ Update dependency onnxruntime-web to ^1.24.1#1204

Draft
renovate[bot] wants to merge 1 commit intomainfrom
renovate/onnxruntime-web-1.x
Draft

⬆️ Update dependency onnxruntime-web to ^1.24.1#1204
renovate[bot] wants to merge 1 commit intomainfrom
renovate/onnxruntime-web-1.x

Conversation

@renovate
Copy link
Contributor

@renovate renovate bot commented Feb 9, 2026

This PR contains the following updates:

Package Change Age Confidence
onnxruntime-web ^1.22.0^1.24.1 age confidence

Release Notes

Microsoft/onnxruntime (onnxruntime-web)

v1.24.1: ONNX Runtime v1.24.1

Compare Source

📢 Announcements & Breaking Changes

Platform Support Changes
  • Python 3.10 wheels are no longer published — Please upgrade to Python 3.11+
  • Python 3.14 support added
  • Free-threaded Python (PEP 703) — Added support for Python 3.13t and 3.14t in Linux (#​26786)
  • x86_64 binaries for macOS/iOS are no longer provided and minimum macOS is raised to 14.0
API Version

✨ New Features

🤖 Execution Provider (EP) Plugin API

A major infrastructure enhancement enabling plugin-based EPs with dynamic loading:

  • Initial kernel-based EP support (#​26206)
  • Weight pre-packing support for plugin EPs (#​26754)
  • EP Context model support (#​25124)
  • Control flow kernel APIs (#​26927)
  • OrtKernelInfo APIs for kernel-based plugin EPs (#​26803)
🔧 Core APIs
  • OrtApi::CreateEnvWithOptions() and OrtEpApi::GetEnvConfigEntries() (#​26971)
  • EP Device Compatibility APIs (#​26922)
  • External Resource Importer API for D3D12 shared resources (#​26828)
  • Session config access from KernelInfo (#​26589)
📊 Dependencies & Integration

🖥️ Execution Provider Updates

NVIDIA
  • CUDA EP: Flash Attention updates, GQA kernel fusion, BF16 support for MoE/qMoE/MatMulNBits, CUDA 13.0 support
  • TensorRT EP: Upgraded to TensorRT 10.14, automatic plugin loading, NVFP4 custom ops
  • TensorRT RTX EP: RTX runtime caching, CUDA graph support, BFloat16, memory-mapped engines
Qualcomm QNN EP
  • QNN SDK upgraded to 2.42.0 with new ops (RMSNorm, ScatterElements, GatherND, STFT, RandomUniformLike)
  • Gelu pattern fusion, LPBQ quantization support, ARM64 wheel builds, v81 device support
Intel & AMD
  • OpenVINO EP: Upgraded to 2025.4.1
  • VitisAI EP: External EP loader, compiled model compatibility API
  • MIGraphX EP: QuickGelu, multihead attention, QLinear pooling ops
ArmNN EP

Arm is formally deprecating the Arm NN Execution Provider (EP) in ONNX Runtime. The Arm NN EP is still experimental and depends on technology that is no longer actively maintained. Keeping it available now only adds complexity and potential confusion for users.

What to expect:

  • Effective immediately, the Arm NN EP is deprecated and will no longer be maintained
  • All build options, documentation, and examples referencing ArmNN will be removed once the upstream change merges; the removal will appear in the first ONNX Runtime release that includes that change. We will confirm the release number as soon as it is known
  • Builds that still rely on Arm NN-specific options (for example --use_armnn) will fail after the change lands, so please adjust configurations in advance

🌐 Web & JavaScript

  • WebGPU EP: Flash Attention optimizations, graph capture, Split-K MatMul, qMoE support, WGSL templates
  • WebNN EP: GQA local attention, GatherBlockQuantized, ConvInteger/MatMulInteger
  • Node.js/React Native: Node.js v22, JSI for React Native, JSPI build support

🧠 CPU Improvements

  • KleidiAI: SME1/SME2 Convolution and SGemm kernels, FP32 Gemv, Windows/Arm support
  • New ops: MoE/qMoE kernels, RotaryEmbeddings opset 23, LayerNorm/RMSNorm broadcasting
  • Platform support: S390x SIMD, LoongArch64 4-bit quantization, FP16 inference improvements
  • ARM NCHWc layout support: NCHWc layout support for potential performance improvement of Conv models. Needs building from source with --enable_arm_neon_nchwc to enable this feature (#​25580 #​26838 #​26691 #​26171). This feature may be turned ON by default in a future release based on community feedback.
  • ARM perf improvements: Dedicated depthwise conv kernel (#​26688) and SiLU activation perf improvement (#​26753)

🔌 Language Bindings

C#
Python
  • add_external_initializers_from_files (#​26012)
Java

🐛 Bug Fixes

Critical Fixes
  • DoS vulnerability in FuseReluClip (#​26878)
  • Security issue loading arbitrary files as external data (#​26776)
  • Memory leak fix for KernelContext_GetAllocator (#​26883)
  • Local Attention off-by-1 bug (#​25927)
EP-Specific Fixes
  • [QNN] Clip op with min/max from QDQ (#​26601)
  • [CoreML] Gather fp16 support (#​26442)

🙏 Contributors

Thanks to our 170 contributors for this release!

@​fs-eire, @​tianleiwu, @​edgchen1, @​qjia7, @​yuslepukhin, @​hariharans29, @​Honry, @​qti-yuduo, @​adrianlizarraga, @​snnn, @​eserscor, @​vraspar, @​xiaofeihan1, @​guschmue, @​daijh, @​quic-muchhsu, @​qti-jkilpatrick, @​tirupath-qti, @​Jiawei-Shao, @​qti-hungjuiw, @​quic-ashwshan, @​titaiwangms, @​qti-mattsinc, @​chilo-ms, @​jchen10, @​xhcao, @​skottmckay, @​quic-calvnguy, @​JonathanC-ARM, @​Rohanjames1997, @​sushraja-msft, @​jambayk, @​adrastogi, @​xenova, @​quic-tirupath, @​justinchuby, @​HectorSVC, @​kunal-vaishnavi, @​wenqinI, @​prathikr, @​baijumeswani, @​preetha-intel, @​jatinwadhwa921, @​umangb-09, @​qti-ashwshan, @​carzh, @​bachelor-dou, @​ranjitshs, @​gedoensmax, @​xadupre, @​nenad1002, @​TedThemistokleous, @​keshavv27, @​zpye, @​jnagi-intel, @​jiafatom, @​mingyueliuh, @​Colm-in-Arm, @​borg323, @​chunghow-qti, @​Craigacp, @​BODAPATIMAHESH, @​AlekseiNikiforovIBM, @​hans00, @​thevishalagarwal, @​MaanavD, @​qti-kromero, @​damdoo01-arm, @​BoarQing, @​naomiOvad, @​yuhuchua-qti, @​hadiFute, @​vishalpandya1990, @​rivkastroh, @​minfhong-qti, @​kuanyul-qti, @​xieofxie, @​ankitm3k, @​RyanMetcalfeInt8, @​MayureshV1, @​bopeng1234, @​vthaniel, @​mdvoretc-intel, @​ericcraw, @​javier-intel, @​saurabhkale17, @​sfatimar, @​Kotomi-Du, @​intbf, @​n1harika, @​TejalKhade28, @​gupta-pallavi, @​cbourjau, @​nieubank, @​r-devulap, @​wszqkzqk, @​sanketkaleoss, @​amancini-N, @​fanchenkong1, @​meakbiyik, @​hisham-hchowdhu, @​shaoboyan091, @​Stonesjtu, @​qwu16, @​wangw-1991, @​bonktree, @​naetherm, @​nikhilfujitsu, @​Panxuefeng-loongson, @​selenayang888, @​moyo1997, @​chwarr, @​patryk-kaiser-ARM, @​fdwr, @​SavaLione, @​shiyi9801, @​mcost45, @​aciddelgado, @​prudhvi-qti, @​Jonahcb, @​lifang-zhang, @​zhaoxul-qti, @​gaugarg-nv, @​cocotdf, @​WangFengtu1996, @​orlmon01, @​weidu-tpvision, @​theHamsta, @​kevinch-nv, @​XXXXRT666, @​movedancer, @​melkap01-Arm, @​KingSora, @​urpetkov-amd, @​junchao-loongson, @​jixiongdeng, @​wcy123, @​GrigoryEvko, @​anujj, @​peishenyan, @​quic-ankus, @​jchen351, @​yihonglyu, @​satyajandhyala, @​co63oc, @​mschofie, @​quic-ashigarg, @​asoldano, @​nproshun, @​jiangzhaoming, @​seungtaek94, @​liqunfu, @​jaholme, @​hanbitmyths, @​quic-boyuc, @​rM-planet, @​qti-vaiskv, @​AndreyOrb, @​pkubaj, @​xhan65, @​Jaswanth51, @​quic-hungjuiw, @​jywu-msft, @​mklimenk, @​derdeljan-msft, @​ianfhunter, @​NingW101, @​feich-ms, @​Akupadhye, @​wschin


Full Changelog: v1.23.2...rel-1.24.1

v1.23.2: ONNX Runtime v1.23.2

Compare Source

v1.23.0: ONNX Runtime v1.23.0

Compare Source

Announcements

  • This release introduces Execution Provider (EP) Plugin API, which is a new infrastructure for building plugin-based EPs. (#​24887 , #​25137, #​25124, #​25147, #​25127, #​25159, #​25191, #​2524)

  • This release introduces the ability to dynamically download and install execution providers. This feature is exclusively available in the WinML build and requires Windows 11 version 25H2 or later. To leverage this new capability, C/C++/C# users should use the builds distributed through the Windows App SDK, and Python users should install the onnxruntime-winml package(will be published soon). We encourage users who can upgrade to the latest Windows 11 to utilize the WinML build to take advantage of this enhancement.

Upcoming Changes

  • The next release will stop providing x86_64 binaries for macOS and iOS operating systems.
  • The next release will increase the minimum supported macOS version from 13.4 to 14.0.
  • The next release will stop providing python 3.10 wheels.

Execution & Core Optimizations

Shutdown logic on Windows is simplified

Now on Windows some global object will be not destroyed if we detect that the process is being shutting down(#​24891) . It will not cause memory leak as when a process ends all the memory will be returned to the operating system. This change can reduce the chance of having crashes on process exit.

AutoEP/Device Management

Now ONNX Runtime has the ability to automatically discovery computing devices and select the best EPs to download and register. The EP downloading feature currently only works on Windows 11 version 25H2 or later.

Execution Provider (EP) Updates

ROCM EP was removed from the source tree. Users are recommended to use Migraphx or Vitis AI EPs from AMD.
A new EP, Nvidia TensorRT RTX, was added.

Web

EMDSK is upgraded from 4.0.4 to 4.0.8

WebGPU EP

Added WGSL template support.

QNN EP

SDK Update: Added support for QNN SDK 2.37.

KleidiAI

Enhanced performance for SGEMM, IGEMM, and Dynamic Quantized MatMul operations, especially for Conv2D operators on hardware that supports SME2 (Scalable Matrix Extension v2).

Known Problems

  • There was a change in build.py that was related to KleidiAI that may cause build failures when doing cross-compiling (#​26175) .

Contributions

Contributors to ONNX Runtime include members across teams at Microsoft, along with our community members:

@​1duo, @​Akupadhye, @​amarin16, @​AndreyOrb, @​ankan-ban, @​ankitm3k, @​anujj, @​aparmp-quic, @​arnej27959, @​bachelor-dou, @​benjamin-hodgson, @​Bonoy0328, @​chenweng-quic, @​chuteng-quic, @​clementperon, @​co63oc, @​daijh, @​damdoo01-arm, @​danyue333, @​fanchenkong1, @​gedoensmax, @​genarks, @​gnedanur, @​Honry, @​huaychou, @​ianfhunter, @​ishwar-raut1, @​jing-bao, @​joeyearsley, @​johnpaultaken, @​jordanozang, @​JulienMaille, @​keshavv27, @​kevinch-nv, @​khoover, @​krahenbuhl, @​kuanyul-quic, @​mauriciocm9, @​mc-nv, @​minfhong-quic, @​mingyueliuh, @​MQ-mengqing, @​NingW101, @​notken12, @​omarhass47, @​peishenyan, @​pkubaj, @​qc-tbhardwa, @​qti-jkilpatrick, @​qti-yuduo, @​quic-ankus, @​quic-ashigarg, @​quic-ashwshan, @​quic-calvnguy, @​quic-hungjuiw, @​quic-tirupath, @​qwu16, @​ranjitshs, @​saurabhkale17, @​schuermans-slx, @​sfatimar, @​stefantalpalaru, @​sunnyshu-intel, @​TedThemistokleous, @​thevishalagarwal, @​toothache, @​umangb-09, @​vatlark, @​VishalX, @​wcy123, @​xhcao, @​xuke537, @​zhaoxul-qti


Configuration

📅 Schedule: Branch creation - Between 09:00 PM and 11:59 PM, only on Sunday ( * 21-23 * * 7 ), Between 12:00 AM and 04:59 PM, only on Monday ( * 0-16 * * 1 ) (UTC), Automerge - At any time (no schedule defined).

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about this update again.


  • If you want to rebase/retry this PR, check this box

This PR was generated by Mend Renovate. View the repository job log.

@renovate renovate bot temporarily deployed to #1204: App preview February 9, 2026 02:41 Inactive
@renovate renovate bot temporarily deployed to #1204: Bundle breakdown February 9, 2026 08:54 Inactive
@github-actions
Copy link
Contributor

github-actions bot commented Feb 9, 2026

Bundle Stats

Hey there, this message comes from a GitHub action that helps you and reviewers to understand how these changes affect the size of this project's bundle.

As this PR is updated, I'll keep you updated on how the bundle size is impacted.

Total

Files count Total bundle size % Changed
235 46.46 MB → 52.26 MB (+5.8 MB)
12.12 MB → 13.36 MB (gzip)
+12.48%
Changeset
File Δ Size
bindings/select.js 🆕 +455 B 0 B → 455 B
ort.bundle.min.mjs 📈 +2.81 kB (+0.71%) 393.58 kB → 396.39 kB
node_modules/onnxruntime-web/dist/ort.bundle.min.mjs 📈 +2.26 kB (+0.58%) 387.1 kB → 389.36 kB
piexifjs/piexif.js 📉 -8 B (-0.03%) 30.49 kB → 30.48 kB
parser.js 📉 -8 B (-0.03%) 26.21 kB → 26.2 kB
compiler.js 📉 -4 B (-0.05%) 7.84 kB → 7.83 kB
resolve-block-scalar.js 📉 -2 B (-0.07%) 2.93 kB → 2.92 kB
whitespace-control.js 📉 -2 B (-0.08%) 2.58 kB → 2.58 kB
stringifyString.js 📉 -4 B (-0.08%) 4.88 kB → 4.87 kB
source-map-[hash].js 📉 -12 B (-0.10%) 11.78 kB → 11.76 kB
browser/index.js 📉 -745 B (-0.10%) 720.81 kB → 720.09 kB
results.ts 📉 -9 B (-0.11%) 8.13 kB → 8.13 kB
source-node.js 📉 -7 B (-0.18%) 3.7 kB → 3.69 kB
stringifyPair.js 📉 -4 B (-0.19%) 2.03 kB → 2.02 kB
resolve-props.js 📉 -5 B (-0.23%) 2.12 kB → 2.11 kB
javascript-[hash].js 📉 -38 B (-0.24%) 15.39 kB → 15.35 kB
resolve-flow-collection.js 📉 -9 B (-0.25%) 3.49 kB → 3.48 kB
resolve-block-map.js 📉 -6 B (-0.28%) 2.08 kB → 2.07 kB
fflate/esm/browser.js 📉 -41 B (-0.34%) 11.83 kB → 11.79 kB
handlebars.runtime.js 📉 -4 B (-0.44%) 911 B → 907 B
helpers.js 📉 -12 B (-0.45%) 2.6 kB → 2.59 kB
handlebars.js 📉 -4 B (-0.47%) 847 B → 843 B
server.js 📉 -20 B (-0.65%) 3.03 kB → 3.01 kB
storage.ts 📉 -20 B (-0.67%) 2.92 kB → 2.9 kB
inference_utils.js 📉 -34 B (-0.94%) 3.53 kB → 3.5 kB
each.js 📉 -13 B (-1.02%) 1.24 kB → 1.23 kB
foldFlowLines.js 📉 -15 B (-1.06%) 1.39 kB → 1.37 kB
runtime.js 📉 -74 B (-1.21%) 5.95 kB → 5.88 kB
util.js 📉 -98 B (-2.62%) 3.65 kB → 3.55 kB
select.js 🔥 -455 B (-100%) 455 B → 0 B
View detailed bundle breakdown

Added

Asset File Size % Changed
_app/immutable/assets/ort-wasm-simd-threaded.jsep.6MnTkKum.wasm 0 B → 23.76 MB (+23.76 MB)
0 B → 5.55 MB (gzip)
-

Removed

Asset File Size % Changed
_app/immutable/assets/ort-wasm-simd-threaded.jsep.CLPRrI3A.wasm 20.86 MB → 0 B (-20.86 MB)
4.92 MB → 0 B (gzip)
-100%

Bigger

Asset File Size % Changed
_app/immutable/workers/assets/ort-wasm-simd-threaded.jsep-[hash].wasm 20.86 MB → 23.76 MB (+2.9 MB)
4.92 MB → 5.55 MB (gzip)
+13.89%
_app/immutable/chunks/D6uZN-YA.js 389.93 kB → 392.19 kB (+2.26 kB)
107.6 kB → 108.13 kB (gzip)
+0.58%
_app/immutable/workers/start-[hash].js 1.71 MB → 1.71 MB (+1.6 kB)
584.71 kB → 585.21 kB (gzip)
+0.09%

Smaller

No assets were smaller

@renovate renovate bot temporarily deployed to #1204: E2E tests February 9, 2026 09:06 Inactive
@gwennlbh
Copy link
Member

gwennlbh commented Feb 9, 2026

gotta test if using this version fixes eventual bugs with the yolo26 model, if not, dont upgrade cuz of bundle size

@gwennlbh gwennlbh marked this pull request as draft February 9, 2026 10:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant