Skip to content

datalake: Convert protobuf repeated fields into Arrow lists#3

Draft
jcipar wants to merge 2 commits intojcipar/proto-to-arrowfrom
jcipar/proto-repeated
Draft

datalake: Convert protobuf repeated fields into Arrow lists#3
jcipar wants to merge 2 commits intojcipar/proto-to-arrowfrom
jcipar/proto-repeated

Conversation

@jcipar
Copy link
Copy Markdown
Owner

@jcipar jcipar commented Jul 30, 2024

This change parses Protobuf repeated fields into Arrow lists.

Backports Required

  • none - not a bug fix
  • none - this is a backport
  • none - issue does not exist in previous branches
  • none - papercut/not impactful enough to backport
  • v24.1.x
  • v23.3.x
  • v23.2.x

Release Notes

jcipar added 2 commits July 29, 2024 17:49
Introduces a protobuf_to_arrow converter that accepts messages in
protobuf format, parses them, and adds them to an Arrow table. The Arrow
table can be used to write a Parquet file.
This change parses Protobuf repeated fields into Arrow lists.
@jcipar jcipar force-pushed the jcipar/proto-to-arrow branch 2 times, most recently from 9a73355 to 003f82e Compare August 2, 2024 21:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant