Skip to content

feat(parquet/metadata): bloom filter implementation #336

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 10 commits into from
Apr 1, 2025

Conversation

zeroshade
Copy link
Member

Rationale for this change

As with many other parquet reader/writers we should add support for bloom filters.

What changes are included in this PR?

This only adds an implementation to the metadata package to represent bloom filters and process them for metadata reading and writing. This does not yet wire it up through the actual parquet file reader and writer. That will be done in a subsequent PR.

Are these changes tested?

Yes, unit tests are included.

Are there any user-facing changes?

Only the addition of the new functions that are exposed in the metadata package.

@zeroshade zeroshade requested review from kou, lidavidm and wgtmac March 28, 2025 20:59
Copy link
Member

@lidavidm lidavidm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some initial comments. I have yet to look at the bloom filter itself

@zeroshade zeroshade merged commit 6576e9c into apache:main Apr 1, 2025
23 checks passed
@zeroshade zeroshade deleted the add-bloom-filters branch April 1, 2025 14:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy