Tag Archives: Adatao

Big Analytics Roundup (May 18, 2015)

Light news: announcements from Dato, Google, Oracle and Pentaho, plus other cool stuff. On the PWC technology blog, Alan Morrison and Bo Parker interview Martin Van Ryswyk and Marko Rodriguez of Datastax about graph analytics.  PWC’s headline writer gets it wrong; the article is about graph engines and not graph databases.  Special-purpose graph databases, like special-purpose columnar databases, are a

Read more

Big Analytics Roundup (May 11, 2015)

Lots of news this week, to compensate for last week’s lame haul. In an excellent post on O’Reilly Radar, Ben Lorica surveys the landscape of workbooks, notebooks and workflow tools, which he categorizes by user persona. On GitHub, a collection of links for streaming analytics (h/t O’Reilly Data). In a “twofer”, VentureBeat plugs its GrowthBeat Summit and a report on

Read more

Spark Summit 2014 Roundup

Key highlights from the 2014 Spark Summit: Spark is the single most active project in the Hadoop ecosystem Among Hadoop distributors, Cloudera and MapR are clear leaders with Spark SAP now offers a certified Spark distribution and integration with HANA Datastax has delivered a Cassandra connector for Spark Databricks plans to offer a cloud service for Spark Spark SQL will absorb

Read more

Apache Spark for Big Analytics (Updated for Spark Summit and Release 1.0.1)

Updated and bumped July 10, 2014. For a powerpoint version on Slideshare, go here. Introduction Apache Spark is an open source distributed computing framework for advanced analytics in Hadoop.  Originally developed as a research project at UC Berkeley’s AMPLab, the project achieved incubator status in Apache in June 2013 and top-level status in February 2014.  According to one analyst, Apache Spark is among the five

Read more