Elevating Data Lakes for All
Struggling to choose the right table format for your use case? You're not alone! Each table format brings a unique set of advantages. At Onehouse, we've built our managed lakehouse on Hudi to provide the fastest and most scalable platform for processing mutable data. But, with Apache XTable, we enable your data to be consumed in Iceberg and Delta Lake formats to maximize openness and interoperability. No more table format compromise!
Powered By Apache Hudi™
Onehouse is made by the creators of Apache Hudi, the pioneering lakehouse technology now used industry wide.
Enhancing Onehouse's Data Lake Capabilities
Enhanced Performance Across Formats
Experience up to 10x faster upserts with Hudi-powered Onehouse, while ensuring seamless read compatibility for your Iceberg and Delta Lake consumers.
Seamless Streaming and Batch Unification
Onehouse leverages Hudi's support for both streaming and batch data ingestion, offering a consistent data management experience across various data processing paradigms.
Streamlined Asynchronous Table Services
Onehouse builds upon Hudi's asynchronous operations to provide the Onehouse Table Optimizer, offering intelligent, asynchronous table services that reduce operational burden and improve performance.
Advanced Data Processing Features in Onehouse
Universal Incremental Processing
Hudi's core strength - processing only changed data - enables Onehouse to dramatically reduce your processing time and costs. Iceberg and Delta Lake users can benefit from this efficiency while maintaining their existing table formats.
Flexible Indexing and Updates
Leverage Hudi's advanced indexing through Onehouse for enhanced performance, or maintain your current Iceberg or Delta Lake indexing strategies - the choice is yours.
Intelligent Data Optimization
Thanks to Hudi's built-in data clustering, Onehouse automatically optimizes your data layout, minimizing the manual maintenance often required by other formats and optimizing query performance for all.
Advanced Data Auditing and Processing
Onehouse implements Hudi's time travel and incremental query capabilities, enabling you to perform advanced data auditing and efficient downstream processing and easily restore data to a previous state if something goes wrong.
“My real hope here is that together, we can create an ecosystem where customers can go to whatever is the best solution without being shackled by the underlying data.”
Raghu Ramakrishnan, CTO for Data @ Microsoft
The Data Lake Foundation for Onehouse
Hudi consistently excels in petabyte-scale mutable workloads for industry giants, leveraging innovations such as Merge-on-Read and pluggable indexes to manage billions of daily record updates.
Unlock best-in-class performance for your Hudi deployment by leveraging the Onehouse team's expertise in operating Hudi for the world's largest organizations.
Hudi's efficient processing and automated maintenance, further optimized by Onehouse, help you significantly reduce infrastructure costs while saving countless engineering hours with a fully-managed platform.
Enjoy the benefits of Hudi's vibrant open-source community alongside Onehouse's enterprise-grade support to maximize your data's value and reliability.