IT Brief Australia - Technology news for CIOs & IT decision-makers
Story image
Oracle achieves industry-first with MySQL HeatWave Lakehouse
Tue, 25th Jul 2023

Oracle has announced the general availability of MySQL HeatWave Lakehouse, enabling customers to query data in object storage as fast as querying data inside the database.

MySQL HeatWave Lakehouse supports a variety of object store file formats such as CSV, Parquet, and export files from other databases, and can combine object storage file data and MySQL database transactional data together in the same query, the company states.

Object store files are queried directly by HeatWave without copying the data into the MySQL database. As a result, MySQL HeatWave Lakehouse sets new standards for scalability and performance for query processing, speed of loading data, cluster provisioning time, and automation to query data in object storage.

Edward Screven, Chief Corporate Architect, Oracle, says, "More than 80% of data is stored in file systems and that number is growing. Customers want to integrate and analyse this varied external data with their internal transactional data, but its often too complex or too expensive to process.

"MySQL HeatWave Lakehouse makes it easy for customers to get valuable real-time insights by combining their data in object storage with database data while gaining significantly higher query performance and much faster data loading at a lower cost."

Querying data in object storage is as fast as querying data inside the database 

As demonstrated by a 10 TB TPC-H benchmark, querying data in object storage in popular file formats with MySQL HeatWave Lakehouse is as fast as querying data in the MySQL database. This is made possible by MySQL Autopilot, a built-in capability of MySQL HeatWave that provides machine learning-powered automation, which learns from the execution of queries and improves the execution plan of future queries.

MySQL Autopilot is an innovation in MySQL HeatWave that is not available anywhere else. MySQL HeatWave on Oracle Cloud Infrastructure (OCI) is powered by AMD EPYC processors.

Best performance for lakehouse use cases 

As demonstrated by a 500 TB TPC-H benchmark, the query performance of MySQL HeatWave Lakehouse is: 9X faster than Amazon Redshift; 17X faster than Snowflake; 17X faster than Databricks; and 36X faster than Google BigQuery.

The performance to load data from the object store with MySQL HeatWave Lakehouse is: 9X faster than Amazon Redshift; 2X faster than Snowflake; 6X faster than Databricks; and 8X faster than Google BigQuery.

MySQL HeatWave's performance is a result of its scale-out architecture that enables massive parallelism to provision the cluster, load data, and process queries with up to 512 nodes, the company states. In addition, enhancements to MySQL Autopilot automate metadata creation for object files and dynamically adapt to the performance of the underlying object store to provide the best performance in any OCI region.

MySQL HeatWave is the only cloud service that provides transaction processing, real-time analytics, machine learning, data lake querying, and machine learning-based automation within a single MySQL database service, the company states.

A core part of Oracle's distributed cloud strategy, MySQL HeatWave is available in OCI, natively on Amazon Web Services, as part of the Oracle Database Service for Azure, and in customers data centers with OCI Dedicated Region.