Skip to content

Dev#19

Merged
zTgx merged 3 commits intomainfrom
dev
Apr 7, 2026
Merged

Dev#19
zTgx merged 3 commits intomainfrom
dev

Conversation

@zTgx
Copy link
Copy Markdown
Member

@zTgx zTgx commented Apr 7, 2026

No description provided.

zTgx added 3 commits April 7, 2026 10:36
- Add scraper dependency for HTML5 parsing
- Implement HtmlConfig with customizable parsing options
- Create HtmlParser with heading hierarchy extraction
- Support metadata extraction (title, description, author, keywords)
- Extract content from various HTML elements (paragraphs, lists, tables)
- Add tests for HTML parsing functionality
- Update parser registry to include HTML parser
- Modify documentation to reflect HTML support
Add comprehensive HTML parser example demonstrating:
- Basic HTML parsing functionality
- Metadata extraction from head elements
- Complex HTML structures including lists, tables, code blocks
- Configuration options with different presets
- Integration with Engine for indexing and querying
- Support for various HTML elements (headings, paragraphs, lists, tables)

Also update page range demo by changing variable names to use
underscore prefix convention for unused variables.
- Update package version in Cargo.toml from 0.1.17 to 0.1.18
- Prepare for new release with version increment
@zTgx zTgx merged commit 16e8215 into main Apr 7, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant