News

Unveiled last June, the Apache Spark cloud-hosted platform from Databricks has now opened its doors for business.
Together, these Spark 3.0 enhancements deliver an overall 2x boost to Spark SQL’s performance relative to Spark 2.4. But according to Databricks, on 60 out of 102 queries, the speedups ranged from 2x ...
First created as part of a research project at UC Berkeley AMPLab, Spark is an open source project in the big data space, built for sophisticated analytics, speed, and ease of use. It unifies critical ...
Databricks adds new SQL Analytics Workspace and Endpoint features, consolidating its acquisition of Redash and bolstering its "data lakehouse" marketing push.
The technical preview of Spark 2.0 is available in the company’s cloud-based Big Data platform, Databricks Community Edition.
Hydrolix, the company transforming the economics of log data with its streaming data lake platform, is unveiling a new Apache Spark connector that democratizes the power of Databricks to customers' ...
Databricks has announced the general availability of Apache Spark 1.4, including SparkR, a new R API for data scientists. Version 1.4 of the open-source Big Data processing and streaming engine ...
Spark also has an API set that allows it to interface with applications written in Python, Scala, Java, SQL, and R in a consistent manner. So Spark it is a natural thing to add to a commercial Hadoop ...
The Apache Spark community has improved support for Python to such a great degree over the past few years that Python is now a “first-class” language, and no longer a “clunky” add-on as it once was, ...