Tag Archives: ZoomData

Spark Summit 2014 Roundup

Key highlights from the 2014 Spark Summit: Spark is the single most active project in the Hadoop ecosystem Among Hadoop distributors, Cloudera and MapR are clear leaders with Spark SAP now offers a certified Spark distribution and integration with HANA Datastax has delivered a Cassandra connector for Spark Databricks plans to offer a cloud service for Spark Spark SQL will absorb

Read more

Apache Spark for Big Analytics (Updated for Spark Summit and Release 1.0.1)

Updated and bumped July 10, 2014. For a powerpoint version on Slideshare, go here. Introduction Apache Spark is an open source distributed computing framework for advanced analytics in Hadoop.  Originally developed as a research project at UC Berkeley’s AMPLab, the project achieved incubator status in Apache in June 2013 and top-level status in February 2014.  According to one analyst, Apache Spark is among the five

Read more
Recent Entries »