Tag Archives: TypeSafe

Big Analytics Roundup (June 22, 2015)

Last week’s Spark Summit is the big news driver for this roundup: On the Databricks blog, Scott Walent recaps the summit here Anmol Rajpurohit writes KDnuggets’ play-by-play for Day One and Day Two My preliminary report is here; full report when slides are available from the sessions. Spark will be one of several technologies featured at the inaugural In-Memory Computing

Read more

Spark Updates

Here is a quick roundup of some recent Apache Spark news. (1) Databricks and Typesafe released results from a survey of 2,136 individuals (mostly developers).  Some key findings: 13% of respondents run Spark in production, 20% plan to use Spark in 2015 Most say they expect to use the 82% Spark core to replace MapReduce 88% say they use the Scala API

Read more

Apache Spark for Big Analytics (Updated for Spark Summit and Release 1.0.1)

Updated and bumped July 10, 2014. For a powerpoint version on Slideshare, go here. Introduction Apache Spark is an open source distributed computing framework for advanced analytics in Hadoop.  Originally developed as a research project at UC Berkeley’s AMPLab, the project achieved incubator status in Apache in June 2013 and top-level status in February 2014.  According to one analyst, Apache Spark is among the five

Read more