× TECHNOLOGY
SOFTWARE NETWORKING CLOUD IT SERVICES MOBILE SECURITY STORAGE IOT DATA ANALYTICS CYBER SECURITY ORACLE SAP BIG DATA
BUSINESS
DIGITAL MARKETING ERP RETAIL HEALTHCARE TELECOM
MAGAZINE
CURRENT ISSUE ARCHIVE
OPINION ABOUT US CONTACT US

Splice Machine integrates Hadoop and Apache Spark in Version 2.0

splice machine integrates hadoop and apache spark in version 2

Today's insurmountable data volumes have given rise to a variety of new database options. Each of them has their own particular strengths and features.

Splice Machine 2.0 is the latest version of its RDBMS which integrates the open-source Apache Spark engine into its existing Hadoop-based architecture, creating a flexible hybrid SQL database that lets businesses perform transactional and analytical workloads at the same time.

"Most in-memory systems require you to store all data in memory," said Monte Zweben, Splice Machine's CEO, in an interview last month. “Such technologies can become prohibitively expensive as data volumes grow. We're doing just compute in memory -- you can store data elsewhere," he said.

At one-fourth the cost, the new, flexible hybrid database enables businesses to perform simultaneous OLAP and OLTP workloads and increase performance over traditional RDBMS, such as Oracle & MySQL, by 10-20X.

“Splice Machine 2.0 uses in-memory computation to bring forth analytical business-intelligence results faster but uses Hadoop's HBase database to durably store and access data at scale. Benefits include lower cost and higher speed”, Zweben said.
"Our endeavor is to use in-memory to create an integrated hybrid technology," he said. "We'll have transactions hitting our database while simultaneously doing the BI without either impeding the other."

With separate processes and resource management for Hadoop and Spark, the Splice Machine RDBMS can ensure that large, complex analytical-processing queries do not overwhelm time-sensitive transactional ones. For example, users can set custom priority levels for analytical queries to ensure that important reports are not blocked behind a massive batch process that consumes all cluster resources.

“The result is performance between 10 and 20 times better than what's offered by traditional relational database management systems, at as little as one-fourth the cost”, the company said.

“Splice Machine 2.0 is particularly well-suited for use in applications including digital marketing, operational data lakes, data warehouse offloads and the Internet of Things”, it added. 

MAGAZINES