Delta Lake 3.0 Unveiled: Unifying Data Storage Formats with Groundbreaking Features from The Linux Foundation
October 18, 2023
In a game-changing move for the data world, The Linux Foundation has announced the general availability of Delta Lake 3.0, the latest iteration of the open-source Delta Lake project. This release brings a significant leap forward in lakehouse storage formats, offering unparalleled flexibility and control to users.
Delta Lake 3.0 introduces Delta Universal Format (UniForm), an innovative feature designed to erase the dilemma of choosing between Apache Iceberg and Apache Hudi for data storage. UniForm allows reading data in the format demanded by applications, ensuring heightened compatibility and a broader ecosystem. With UniForm, users can now effortlessly generate essential metadata for both formats, eliminating the need for manual conversions and providing unmatched flexibility and efficiency.
Another noteworthy addition is Delta Kernel, which simplifies the process of building Delta connectors for developers. By offering a simplified API, Delta Kernel masks the intricate details of the Delta protocol specification, enabling developers to focus on harnessing Delta's full potential without being bogged down by its underlying complexities.
Performance improvements are also at the forefront of Delta Lake 3.0, with MERGE operations now leveraging data skipping and an improved execution for up to 2x better performance in various scenarios. DELETE operations have seen more than a 2x increase in speed, translating to reduced wait times when purging or updating data, making data management and processing more agile.
Jim Zemlin, Executive Director of The Linux Foundation, expressed his excitement about the release: "Delta Lake 3.0 is a testament to the power of community-driven development in the data world. This groundbreaking version represents not just technological advancement but also the potential of open ecosystems to drive change and foster innovation."
Users worldwide can now take advantage of Delta Lake 3.0's feature set and optimized performance for their data management needs. For more information on this revolutionary release, explore the detailed release notes or read the blog post.
The Linux Foundation, founded in 2000, is a leading home for collaboration on open source software, hardware, standards, and data. Its projects, including Linux, Kubernetes, Node.js, ONAP, Hyperledger, RISC-V, and more, are critical to the world's infrastructure. For further details, visit linuxfoundation.org.
Contact: Noah Lehman [email protected]
Source: The Linux Foundation