Skip to content

Conversation

@dependabot
Copy link
Contributor

@dependabot dependabot bot commented on behalf of github Dec 29, 2025

Bumps html-to-markdown from 2.15.0 to 2.18.0.

Release notes

Sourced from html-to-markdown's releases.

v2.17.0

Added

  • Go binding now auto-downloads the native FFI library from GitHub Releases with cache/override controls.
  • Release pipeline now publishes per-platform Go FFI artifacts for Go installs.

v2.16.1

What's Changed

Full Changelog: kreuzberg-dev/html-to-markdown@v2.16.0...v2.16.1

v2.16.0

What's Changed

Full Changelog: kreuzberg-dev/html-to-markdown@v2.15.0...v2.16.0

Changelog

Sourced from html-to-markdown's changelog.

[2.18.0] - 2025-12-28

Added

  • Visitor Pattern: Complete implementation of visitor pattern for custom HTML element processing across all 8 language bindings (Python, TypeScript, Ruby, PHP, Go, Java, C#, Elixir)
    • Synchronous and asynchronous visitor support (where applicable per language)
    • 40+ visitor methods with hooks for every HTML element type (text, links, images, headings, lists, tables, code blocks, and more)
    • NodeContext provides element metadata: tag name, attributes, depth, parent tag, inline status, and sibling index
    • Control flow options: Continue, Custom (provide custom markdown), Skip, PreserveHtml, or Error
    • Element lifecycle callbacks: visit_element_start and visit_element_end for complete control
    • Python: Full async visitor support with convert_with_async_visitor() function
    • TypeScript: Async visitor with full type definitions
    • Ruby: Sync visitor implementation with complete RBS type definitions
    • PHP: Full visitor support with PHPStan level 9 compliance
    • Go: Thread-safe visitor registry with markdown post-processing
    • Java: Panama FFI visitor (JDK 21+)
    • C#: P/Invoke visitor with cross-platform compatibility
    • Elixir: Rustler NIF visitor implementation

Fixed

  • HTML parsing for modern websites: Fixed issue where JavaScript-heavy websites (like Reuters) would lose article body content during conversion (GitHub issue #167)
    • The parser was incorrectly interpreting HTML-like strings inside <script> tags as actual HTML elements
    • Script and style tags are now properly stripped during preprocessing while preserving JSON-LD metadata
    • No performance impact on conversion speed
  • Python API: Fixed missing ConversionOptionsHandle export in public API (GitHub issue #166)
    • Users can now import ConversionOptionsHandle directly from the html_to_markdown package
    • Maintains backward compatibility with existing OptionsHandle import

[2.17.0] - 2025-12-22

Added

  • Go binding now auto-downloads the native FFI library from GitHub Releases with cache/override controls.
  • Release pipeline now publishes per-platform Go FFI artifacts for Go installs.

[2.16.1] - 2025-12-22

Fixed

  • Fast-path plain-text conversions now honor escape flags (asterisks/underscores/misc/ASCII).
  • Fast-path plain-text conversions now normalize whitespace and trim trailing spaces.
  • Fast-path plain-text conversions now respect strip_newlines.
  • Python CLI proxy now only applies v1 translation defaults when v1-only flags are present.

[2.16.0] - 2025-12-22

Added

  • Profiling harness and workflow for Rust core and bindings with consolidated flamegraph output.
  • Benchmark scenarios for inline images, metadata extraction, and raw metadata output across fixtures.
  • WASM profiling support with warmups and stable flamegraph parsing.
  • FFI byte-based conversion path plus metadata-raw benchmark coverage.

Changed

... (truncated)

Commits
  • bdd6a30 fix(ci): add Task path triggers to all CI workflows
  • e02e35d chore: remove Elixir crash dump file
  • bc32556 fix(task): use relative paths in language task files to fix directory resolution
  • 64ba9a1 fix(ci): pin Task version to 3.46.4 to avoid installation failures
  • 5e6eb58 fix(ci): replace arduino/setup-task with go-task/setup-task to avoid rate limits
  • c95a7d5 feat(build): implement Kreuzberg pattern with modular Taskfile and test_apps
  • ce27fa5 chore: cleanup artifacts directory and add to gitignore
  • 40eb670 fix(ci): resolve Windows FFI builds and benchmark analyzer failures
  • 9628257 fix: resolve CI failures in Rust visitor tests, Python tests, and mypy valida...
  • 3e4935f chore: cleanup
  • Additional commits viewable in compare view

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [html-to-markdown](https://github.com/kreuzberg-dev/html-to-markdown) from 2.15.0 to 2.18.0.
- [Release notes](https://github.com/kreuzberg-dev/html-to-markdown/releases)
- [Changelog](https://github.com/kreuzberg-dev/html-to-markdown/blob/main/CHANGELOG.md)
- [Commits](kreuzberg-dev/html-to-markdown@v2.15.0...v2.18.0)

---
updated-dependencies:
- dependency-name: html-to-markdown
  dependency-version: 2.18.0
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
@dependabot dependabot bot added dependencies Pull requests that update a dependency file python:uv Pull requests that update python:uv code labels Dec 29, 2025
@DevNergis DevNergis merged commit c43a50e into latest Dec 31, 2025
10 checks passed
@dependabot dependabot bot deleted the dependabot/uv/html-to-markdown-2.18.0 branch December 31, 2025 00:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file python:uv Pull requests that update python:uv code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants