Our analysis and end-user discussions continue to demonstrate that a new modern data stack is emerging along with sophisticated data-oriented “personas,” such as data analysts and data scientists.
At its Data + AI Summit, Databricks today made the requisite number of announcements one would expect from a company’s flagship developer event. Among those are the launch of Delta Lake 2.0, the next ...
Databricks, the commercial entity created by the developers of the open source Apache Spark project, announced $33M in Series B funding today and the launch of a new cloud product, their first one as ...
Spark Declarative Pipelines provides an easier way to define and execute data pipelines for both batch and streaming ETL workloads across any Apache Spark-supported data source, including cloud ...
You probably did not hear it here first. Spark has been making waves in big data for a while now, and 2017 has not disappointed anyone who has bet on its meteoric rise. That was a pretty safe bet ...
Two years in the making, Apache Spark 2.0 will officially debut in a few weeks from Databricks Inc., which just released a technical preview so Big Data developers could get their hands on the "shiny ...
Apache Spark is a project designed to accelerate Hadoop and other big data applications through the use of an in-memory, clustered data engine. The Apache Foundation describes the Spark project this ...