Create empty snapshot for metadata operations #7075

jackye1995 · 2023-03-11T04:05:55Z

Feature Request / Improvement

Creating an empty snapshot for all metadata operations to make sure table does not have a state with no snapshot might simplify various use cases.

(1) for branching, main does not need to be a special case compared to custom branches that has to exist only after the first data write.

(2) for time travel, currently schema is derived from snapshot ID at the specific time. If a table added data at t0, has for example schema update at t1, creating an empty snapshot at t1 means that traveling to t0 and t1 will yield different results because schema has changed, which makes more sense.

However, doing so might have other implications and affect behavior of existing operations like snapshot expiration.

Also we will have to keep backwards compatibility and still deal with tables with no snapshot, so maybe we do not gain much and have to live with the current situation.

Would like to know what others think.

cc @rdblue @aokolnychyi @RussellSpitzer @danielcweeks

Query engine

None

The text was updated successfully, but these errors were encountered:

rdblue · 2023-03-11T19:35:14Z

I don't think we need to create a new snapshot for every metadata operation, but I think it would be reasonable to create empty snapshots when we need to create a branch and there is no current snapshot. And I also think it would be reasonable to create a snapshot when the schema changes to signal when in history that happened.

manuzhang · 2023-07-31T08:43:03Z

How do we revert a table to "empty state" without a snapshot for empty table? Do I have to rebuild table?

s-akhtar-baig · 2023-08-24T21:46:33Z

@rdblue, my team and I came across a similar problem with schema updates.

The mentioned pull request handles creating empty snapshots for empty tables but I don't see changes that address creating snapshots for schema updates. If so and if not already being worked on, can my team and I contribute to the remainder of this issue?

Fyi @mderoy @rafoid.

namrathamyske · 2024-03-26T17:53:48Z

Can i take this up if not already done?

github-actions · 2024-10-04T00:14:44Z

This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs. To permanently prevent this issue from being considered stale, add the label 'not-stale', but commenting on the issue is preferred when possible.

github-actions · 2024-10-19T00:14:31Z

This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale'

ConeyLiu mentioned this issue Jul 15, 2023

Spark: Supports creating a branch on an empty table #8072

Merged

namrathamyske mentioned this issue Mar 28, 2024

branch schema affected by main table schema #9737

Closed

github-actions bot added the stale label Oct 4, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Oct 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create empty snapshot for metadata operations #7075

Create empty snapshot for metadata operations #7075

jackye1995 commented Mar 11, 2023 •

edited

Loading

rdblue commented Mar 11, 2023

manuzhang commented Jul 31, 2023 •

edited

Loading

s-akhtar-baig commented Aug 24, 2023 •

edited

Loading

namrathamyske commented Mar 26, 2024

github-actions bot commented Oct 4, 2024

github-actions bot commented Oct 19, 2024

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!

Create empty snapshot for metadata operations #7075

Create empty snapshot for metadata operations #7075

Comments

jackye1995 commented Mar 11, 2023 • edited Loading

Feature Request / Improvement

Query engine

rdblue commented Mar 11, 2023

manuzhang commented Jul 31, 2023 • edited Loading

s-akhtar-baig commented Aug 24, 2023 • edited Loading

namrathamyske commented Mar 26, 2024

github-actions bot commented Oct 4, 2024

github-actions bot commented Oct 19, 2024

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!

jackye1995 commented Mar 11, 2023 •

edited

Loading

manuzhang commented Jul 31, 2023 •

edited

Loading

s-akhtar-baig commented Aug 24, 2023 •

edited

Loading