DataFusion is underperforming the Polars streaming engine on some localhost join queries (1e8 rows of data on a Macbook M3 with 16GB of RAM):
Here are the join queries.
I am guessing the join operator can be optimized, similar to how the filtering and aggregation operations were optimized.
Here is an example of how the median function was made faster: #13550
See this epic for more info: #13548