Tag Archives: IBM Infosphere Big Insights

Big Analytics Roundup (April 20, 2015)

Top news this week: a couple of Spark maintenance releases, some interesting new Apache projects, an announcement from Hortonworks and some interesting content from Databricks and Teradata. Also in the news this week, North Bridge and Black Duck Software release their ninth annual Future of Open Source survey.  Meanwhile, Hortonworks, IBM and Pivotal announce ODP harmonization, round up endorsements from

Read more

Spark Summit 2014 Roundup

Key highlights from the 2014 Spark Summit: Spark is the single most active project in the Hadoop ecosystem Among Hadoop distributors, Cloudera and MapR are clear leaders with Spark SAP now offers a certified Spark distribution and integration with HANA Datastax has delivered a Cassandra connector for Spark Databricks plans to offer a cloud service for Spark Spark SQL will absorb

Read more

Apache Spark for Big Analytics (Updated for Spark Summit and Release 1.0.1)

Updated and bumped July 10, 2014. For a powerpoint version on Slideshare, go here. Introduction Apache Spark is an open source distributed computing framework for advanced analytics in Hadoop.  Originally developed as a research project at UC Berkeley’s AMPLab, the project achieved incubator status in Apache in June 2013 and top-level status in February 2014.  According to one analyst, Apache Spark is among the five

Read more