Tag Archives: Apache NiFi

Big Analytics Roundup (April 4, 2016)

Strata + Hadoop World sparks a number of commercial announcements: AtScale has a new release, Microsoft previews R Server on HDInsight, and IBM puts Spark on a mainframe, FWIW. We also have a nice harvest of explainers and perspectives. Slides from Strata available here. The folks at Domino Data ask: Is XGBoost 10X faster than H2O? We’ll never know the answer, since they

Read more

Big Analytics Roundup (March 21, 2016)

Minimal hard news this week, but some interesting survey results, analysis, articles, explainers and perspectives. — On his personal blog, Will Kurt describes Bayesian reasoning in the Twilight Zone. I tried to learn Bayesian reasoning a few years ago, but it conflicted with my prior beliefs. — Stack Overflow shares results from its 2016 Developer Survey. (h/t Thomas Ott) Key bits:

Read more

Big Analytics Roundup (January 25, 2016)

This week, we have a new release of Spark-TS, Google’s proposal to create an Apache incubator project for Cloud Dataflow, Forrester’s assessment of Hadoop distributions, a couple of funding stories and a nice crop of explainers. Just a reminder that Spark Summit East is coming up February 16-18.  I’ll be delivering a talk in the Executive track on Spark and the

Read more

Big Analytics Roundup (September 21, 2015)

Top story of the week: release of AtScale’s Hadoop Maturity Survey, which triggered a flurry of analysis.  Meanwhile, the Economist ventures into the world of open source software and venture capital, embarrassing itself in the process; and IBM announces plans to use Spark in its search for extraterrestrial intelligence, a project that would be more useful if pointed toward IBM headquarters. AtScale

Read more