Trino Fest
Meet with us to see how Onehouse enables you to ingest your data once, query anywhere.
Meet with Onehouse at Data Council 24
Learn Universal Data Lakehouse allows you ingest once and query from anywhere.
Austin, March 26-28@ AT&T Conference Center
Onehouse delivers a new bedrock for your data, through a cloud-native, fully-managed lakehouse service built on Apache Hudi. Onehouse makes it possible to blend the ease of use of a warehouse with the scale of a data lake. Now you can build your lake in minutes, process your data in seconds, and own your data in open formats instead of being locked away to individual vendors.
This year, Trino Fest promises to deliver content on all things Trino. Onehouse delivers the Universal Data Lakehouse, which gives you a single source of truth for all your data - making it available to any query engine in minutes, not hours (or days), and at a 50% or greater reduction in cost.
Onehouse helps you create fast, simple, direct low-code and no-code pipelines that get your data exactly where it needs to go. You can represent data in Apache Hudi, Delta Lake, and/or Apache Iceberg formats, and get interoperability across all data catalogs, query engines, and other parts of your data infrastructure. Use the same data to power business intelligence, line-of-business apps – and AI!
See why companies such as Apna, Conductor, and NOW Insurance use Onehouse — including for breakthrough AI and machine learning applications.
Join Ethan Guo, Lead Software Engineer at Onehouse for “Enhancing Trino's query performance and data management with Hudi: innovations and future”
In the ever-evolving landscape of big data and analytics, efficient data management and retrieval systems are paramount. In this talk, Ethan will embark on an enlightening journey through the development and innovation of the Hudi connector in Trino, tracing its roots back to the inception via the Hive connector. He will also dive deep into the Hudi connector's unique capabilities that set it apart from conventional file listing and partition pruning methods for query optimization. He'll explore the specialized features in Hudi, such as its multi-modal indexing framework which incorporates support for Column Statistics and Record Index, highlighting how these features enhance query performance for both point and range lookups. The presentation will outline the ambitious roadmap for the Hudi connector, including the expansion of the multi-modal indexing framework, Alluxio-powered file system caching, and the introduction of DDL/DML support. These advancements promise to further refine data management capabilities with the Hudi connector in Trino, offering more flexibility and efficiency in handling large-scale data operations.
Want to see us in action? Book a meeting and get a free swag pack.
This offer is exclusively for professionals in data management or engineering with expertise in data lakes, warehousing, management, or engineering.