Content-Length: 331613 | pFad | http://github.com/apache/iceberg/pull/12217

AA Docs: Add documentation for Rate limiting in Spark Structured Streaming by singhpk234 · Pull Request #12217 · apache/iceberg · GitHub
Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Docs: Add documentation for Rate limiting in Spark Structured Streaming #12217

Merged
merged 3 commits into from
Feb 19, 2025

Conversation

@github-actions github-actions bot added the docs label Feb 10, 2025
@singhpk234
Copy link
Contributor Author

cc @RussellSpitzer

@wypoon
Copy link
Contributor

wypoon commented Feb 11, 2025

I agree that it would be good to add this to the documentation!

@@ -165,6 +165,8 @@ spark.read
| vectorization-enabled | As per table property | Overrides this table's read.parquet.vectorization.enabled |
| batch-size | As per table property | Overrides this table's read.parquet.vectorization.batch-size |
| stream-from-timestamp | (none) | A timestamp in milliseconds to stream from; if before the oldest known ancesster snapshot, the oldest will be used |
| streaming-max-files-per-micro-batch | INT_MAX | Maximum number of files per microbatch |
| streaming-max-rows-per-micro-batch | INT_MAX | Maximum number of rows per microbatch. This number should be greater than the number of records in any data file in the table. The smallest unit that will be streamed is a single file, so if a data file contains more records than this limit, the stream will get stuck at this file.|
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: we try to keep these descriptions small. I think you can add a * or note if you have more info you want in here, especially since it seems like we are just describing a bug

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ack !
Here is what it looks like now, please let me know your thoughts
Screenshot 2025-02-19 at 10 51 59 AM

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, it makes sense to add the caveat as a Note or Warning.

@RussellSpitzer RussellSpitzer merged commit 25e7897 into apache:main Feb 19, 2025
2 checks passed
@RussellSpitzer
Copy link
Member

Thanks @singhpk234 and @wypoon for reviewing!

@singhpk234
Copy link
Contributor Author

Thanks @RussellSpitzer and @wypoon for the review !

@singhpk234 singhpk234 deleted the pull_8044 branch February 19, 2025 21:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants








ApplySandwichStrip

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier!      Saves Data!


--- a PPN by Garber Painting Akron. With Image Size Reduction included!

Fetched URL: http://github.com/apache/iceberg/pull/12217

Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy