Tag Archives: Apache Cassandra

Big Analytics Roundup (April 18, 2016)

In hard news this week, Storm hits a milestone with Release 1.0, Google releases TensorFlow 0.8 with distributed computing support, and DataStax announces DataStax Enterprise Graph. And, following on NVIDIA’s DGX-1 announcement last week there are a number of items on Deep Learning featured below. Deep Learning — Adrian Colyer summarizes a paper that summarizes 900 other papers on Deep Learning. —

Read more

Big Analytics Roundup (September 21, 2015)

Top story of the week: release of AtScale’s Hadoop Maturity Survey, which triggered a flurry of analysis.  Meanwhile, the Economist ventures into the world of open source software and venture capital, embarrassing itself in the process; and IBM announces plans to use Spark in its search for extraterrestrial intelligence, a project that would be more useful if pointed toward IBM headquarters. AtScale

Read more

Big Analytics Roundup (May 25, 2015)

This week features new releases from Drill and Hive, plus announcements from DataStax and MemSQL. Andrew Brust summarizes the SQL options presented by Drill, Hive and Spark, noting that Drill’s “SQL everywhere” approach and DBMS vendors’ federated engines make the term “SQL on Hadoop” obsolete. Gartner surveys its panel of 284 people who rely on Gartner and concludes that Hadoop

Read more

Apache Spark for Big Analytics (Updated for Spark Summit and Release 1.0.1)

Updated and bumped July 10, 2014. For a powerpoint version on Slideshare, go here. Introduction Apache Spark is an open source distributed computing framework for advanced analytics in Hadoop.  Originally developed as a research project at UC Berkeley’s AMPLab, the project achieved incubator status in Apache in June 2013 and top-level status in February 2014.  According to one analyst, Apache Spark is among the five

Read more