Data Lake v1 Retirement

What's happening?

GRAX is retiring the Data Lake v1 functionality in favor of the newer, faster, and safer Data Lake v2. As of today, all new GRAX applications will only have v2 available by default. Data Lake v1 will be automatically removed from any app that disables all v1 objects for more than 30 days. As of October 1st, 2025, no GRAX users will be able to enable new objects on v1. As of April 1st, 2026, v1 will be removed entirely from the GRAX product.

Why is this happening?

GRAX has invested substantial effort into optimizing Data Lake in many ways:

  • No missed writes

  • Shorter time from backup to Data Lake write

  • Faster backfills

  • Higher object concurrency

  • Lower resource utilization

  • Easier integration with downstream tools (Athena, Glue, Duckdb, etc.)

  • Improved field formats/types

Many of these improvements necessitated structural changes to the Data Lake product as well as changes to the final Parquet structure and content. Pipelines ingesting and querying v1 Parquet will require modification to properly ingest v2.

Data Lake v1 now provides a user and developer experience that is less reliable, slower, and higher cost than v2. To ensure that all users get the most value out of their GRAX application, we've decided to establish a deadline for moving over.

How does it impact me?

If you or your business depends on Data Lake v1 for analytics or other forms of downstream usage, we recommend restructuring your project to use Data Lake v2 as soon as possible. You will not be able to add new objects after October 1st, and your pipeline will halt entirely after April 1st if still using v1.

How do I get more information?

If you have questions or need more information, please open a support ticket.

Last updated

Was this helpful?