-
Notifications
You must be signed in to change notification settings - Fork 0
September #2 #16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
September #2 #16
Changes from all commits
Commits
Show all changes
3 commits
Select commit
Hold shift + click to select a range
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,68 @@ | ||
--- | ||
title: "September Bulletin #2" | ||
date: "2025-10-13" | ||
authors: ["Community Team"] | ||
excerpt: "Overview of all work happening in Vortex" | ||
published: true | ||
--- | ||
|
||
> Editor's note: Its been pointed out to us that while this post is published in October, it covers work done in September, so this will be the second September Bulletin. | ||
|
||
This is the second monthly issue of all that has been happening in Vortex! | ||
|
||
Development keeps moving quickly, we released [0.54.0](https://github.com/vortex-data/vortex/releases/tag/0.54.0) with many new features and improvements to the Rust API. This month we accepted 227 commits from 14 different contributors, with the relentless renovate-bot taking the crown for most changes merged. | ||
|
||
## Core | ||
|
||
1. Published an [RFC](https://github.com/vortex-data/vortex/pull/4726) presenting our planned changes to how arrays and compute interact with each other, introducing what we call Operators. | ||
1. Started work on GPU-powered kernels. | ||
1. Merged a few big changes to the IO APIs ([#4557](https://github.com/vortex-data/vortex/pull/4608), [#4608](https://github.com/vortex-data/vortex/pull/4608)), which introduce new push-based write API (in addition to the existing stream-based one), and writes now return both the file's footer (which can now be cached, reused and even de/serialized [#4598](https://github.com/vortex-data/vortex/pull/4608)), and the file's total compressed size. | ||
1. Added a new encoding - [fastlanes](https://github.com/spiraldb/fastlanes)-based RLE [#4588](https://github.com/vortex-data/vortex/pull/4588), [#4789](https://github.com/vortex-data/vortex/pull/4789). | ||
1. The work on the new `FixedSizeList` keeps moving forward - [#4590](https://github.com/vortex-data/vortex/pull/4590), [#4601](https://github.com/vortex-data/vortex/pull/4601). | ||
1. Added a new canonical encoding for list types, see the [tracking issue](https://github.com/vortex-data/vortex/issues/4699) for full details and work. | ||
1. Fixed a long-standing issue where we used to underestimate the uncompressed size of arrays, causing inaccurate array stats and a consistent underestimation of the overall data size. [#4963](https://github.com/vortex-data/vortex/pull/4693). | ||
1. Improved performance for very wide tables (1K+ columns), improving a new compression [benchmark](https://github.com/vortex-data/vortex/pull/4867) by over 80%. [#4863](https://github.com/vortex-data/vortex/pull/4863) [#4868](https://github.com/vortex-data/vortex/pull/4868) [#4877](https://github.com/vortex-data/vortex/pull/4877) | ||
1. Allow users to write with different compression strategies from Python [#4825](https://github.com/vortex-data/vortex/pull/4825) | ||
|
||
## Integrations | ||
|
||
### DuckDB | ||
|
||
1. Landed a release of the Vortex extension for the most recent DuckDB release (1.4). As always, you can install and load the extension with: | ||
|
||
```sql | ||
INSTALL vortex FROM community; | ||
LOAD vortex; | ||
``` | ||
|
||
1. Improved the testing of our DuckDB extension, making it easier and faster to run tests, using a debug build of DuckDB. | ||
1. Added zero-copy exporting of arrays to DuckDB [#4812](https://github.com/vortex-data/vortex/pull/4812) [#4804](https://github.com/vortex-data/vortex/pull/4804) | ||
|
||
### Apache DataFusion | ||
|
||
1. Use the built-in `FilePruner` to prune file based on the full expression, even for expressions we can't push down yet like dynamic expressions. | ||
1. Support for tables with hive-style partitioning. | ||
1. Updated our Apache DataFusion integration to the most recent release (v50.1.0) [#4577](https://github.com/vortex-data/vortex/pull/4577) | ||
|
||
## Acknowledgments | ||
|
||
We want to thank to anyone who has tried Vortex, provided feedback, asked question and filed issues. | ||
|
||
Special thanks go for all the contributors who took the time and care to contribute to Vortex this month (in descending count of commits): | ||
|
||
```text | ||
41 Adam Gutglick | ||
32 Connor Tsui | ||
22 Joe Isaacs | ||
21 Robert Kruszewski | ||
21 Alexander Droste | ||
20 Onur Satici | ||
20 Nicholas Gates | ||
10 Dan King | ||
4 Dmitrii Blaginin | ||
2 Andrew Duffy | ||
1 Will Manning | ||
1 Maksim Dergousov | ||
1 Evan Martin | ||
1 Alfonso Subiotto Marqués | ||
``` |
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.