Meta and partners build Velox open source execution engine

2022-09-03 06:43:44 By : Ms. Grace Xu

Meta, along with a group of its partners, on Wednesday revealed details about its new open source Velox technology that aims to provide a unified execution layer for data management and queries.

Among the vendors working alongside the Facebook parent company are commercial Presto platform provider Ahana, Intel and Voltron Data, which recently raised $110M in funding.

Velox is an effort to accelerate data management and data queries by replacing existing execution layers within different technologies, including Apache Spark and Presto. The execution layer is the component that includes the code that handles the movement and processing of data.

While Meta is starting to integrate Velox inside of Meta's platforms, the technology isn't ready for broader production use.

A key goal of the Velox technology is to provide a common standard for execution of data management and processing.

Most data modernization initiatives, including this one, aim for consolidation and simplicity, noted Kevin Petrie, an analyst at Eckerson Research.

But the reality is that most enterprise data environments grow more heterogenous by the day. Multiplying workloads often have specialized requirements, and that complexity extends to data processing engines, Petrie said.

"Enterprises need to simplify how they build, deploy, integrate, reuse and adapt their processing engines," he said. "Velox seeks to help by offering an execution engine that unifies common engine components. This has the potential to make engines more modular, interoperable and reusable, thereby simplifying data environments."

Velox is not a tool for data analysts, said Philip Bell, developer advocate at Meta. Rather, Velox is a modular library to be used by those who build and maintain large-scale data processing and storage platforms and is a replacement for the compute layer in those platforms, he said.

"Velox is a self-contained execution engine that replaces existing layers within data systems," he said.

An execution engine handles some of the computation-intensive work inside of a data computation engine. Velox can replace multiple execution engines found in various data computation engines to streamline data workflows, according to Bell.

Meta uses several different data computation engines for different tasks. Specifically, Bell noted that Meta is looking to replace the execution layer of Presto and Spark with Velox within Meta.

"We are actively integrating Velox into several production systems at Meta," he said. "There are several milestones ahead of us before Velox is production ready more broadly and we hope many will join us in achieving those goals."

Ahana is among the contributors to the Velox open source effort led by Meta. Steven Mih, co-founder and CEO of Ahana, explained that Velox provides a code acceleration library written in the C++ programming language that can plug into different query engines, including Presto.

Today Presto uses the Java programming language to power the execution layer that communicates with different data sources to process and query data. The goal with Velox's new execution engine is to accelerate the processing with an optimized C++ code base.

While he declined to say when Ahana will put Velox into production code, Mih noted that he expects there to be a positive impact on performance when it's ready. Mih also said he is hopeful the Velox code will move to a third-party open source organization, such as the Linux Foundation or the Apache Software Foundation, at some point in the future.

"I'm a big believer in having third-party governance for open source versus it being controlled by any one company or especially one vendor," Mih said.

Coming after the addition of natural language processing capabilities, the new embedded analytics tool strengthens the BI ...

The startup aims to augment human decision-making with automation, and its new tools find and explain changes in data, analyze ...

Self-service BI tools benefit organizations in four major ways, including improved decision-making, organizational efficiency, ...

Many organizations struggle to manage their vast collection of AWS accounts, but Control Tower can help. The service automates ...

There are several important variables within the Amazon EKS pricing model. Dig into the numbers to ensure you deploy the service ...

AWS users face a choice when deploying Kubernetes: run it themselves on EC2 or let Amazon do the heavy lifting with EKS. See ...

A content strategy that includes headless CMS, useful product information and visuals can draw the customer's focus to your ...

With seemingly endless options to choose from, picking the right paperless office software tool starts with understanding what ...

DAM systems help marketing teams manage the multimedia content they work with every day. With this quiz, test your knowledge of ...

With its Cerner acquisition, Oracle sets its sights on creating a national, anonymized patient database -- a road filled with ...

Oracle plans to acquire Cerner in a deal valued at about $30B. The second-largest EHR vendor in the U.S. could inject new life ...

The Supreme Court ruled 6-2 that Java APIs used in Android phones are not subject to American copyright law, ending a ...

Over its 50-year history, SAP rode business and technology trends to the top of the ERP industry, but it now is at a crossroads ...

Third-party support providers make a pitch that they can provide greater flexibility at a lower cost, but customers should think ...

SAP's Thomas Saueressig explains the future of multi-tenant cloud ERP for SAP customers and why it will take some large companies...

All Rights Reserved, Copyright 2005 - 2022, TechTarget Privacy Policy Cookie Preferences Do Not Sell My Personal Info