Tag Archives: Amazon EMR

Big Analytics Roundup (April 4, 2016)

Strata + Hadoop World sparks a number of commercial announcements: AtScale has a new release, Microsoft previews R Server on HDInsight, and IBM puts Spark on a mainframe, FWIW. We also have a nice harvest of explainers and perspectives. Slides from Strata available here. The folks at Domino Data ask: Is XGBoost 10X faster than H2O? We’ll never know the answer, since they

Read more

Big Analytics Roundup (November 23, 2015)

Eleven stories this week, including a new Flink release, new developments for Splice Machine, and a very big Spark HPC cluster in Warsaw. InfoWorld publishes a well-written practical guide to Deep Learning. Here are a couple of interesting articles on Spark: MapR’s Jim Scott offers a nice overview of Spark RDDs. Ian Pointer summarizes five things he hates about Spark.

Read more