Tag Archives: Data Artisans

Big Analytics Roundup (September 26, 2016)

Note to readers: Recently, I’ve noticed that news about events that occur on Tuesdays seems stale by the time I publish on Monday. Beginning this week, I’m shifting to a new publication model, posting analysis of events as they happen instead of a weekly roundup. You could say I’m switching from batch updates to real-time updates, which should please Nathan Marz.

Read more

Big Analytics Roundup (November 16, 2015)

Just three main stories this week: possible trouble for a pair of analytic startups; Google releases TensorFlow to open source; and H2O delivers new capabilities at its annual meeting. In other news, the Spark team announces Release 1.5.2, a maintenance release; and the Mahout guy announces Release 0.11.1, with bug fixes and performance improvements. (h/t Hadoop Weekly) Two items of

Read more

Big Analytics Roundup (October 26, 2015)

Fourteen stories this week, beginning with an announcement from IBM.  This week, IBM celebrates 14 straight quarters of declining revenue at its IBM Insight conference, appropriately enough at the Mandalay Bay in Vegas, where the restaurants are overhyped and overpriced. Meanwhile, the first Spark Summit Europe meets in Amsterdam, in the far more interesting setting of the Beurs van Berlage.  There

Read more

Big Analytics Roundup (October 19, 2015)

Ten stories this week.  Don’t miss story #10, which recaps an analysis of collaboration and influence in the U.S.Congress using open source graph engines and a rich database of legislation. (1) Rexer: R Continues to Lead Rexer Analytics has released preliminary results from its 2015 survey of working analysts; Bob Muenchin reports.  One interesting snippet — reported tool use, as

Read more

Big Analytics Roundup (September 28, 2015)

Strata+Hadoop World NYC is upon us.  Andrew Brust opines that there will be three themes at Strata this year: (1) Spark “versus” Hadoop; (2) streaming goes mainstream; (3) data governance matters.  My take: “Spark versus Hadoop” is controversy for the sake of people who like controversy.  Spark works with Hadoop, and Spark works with other platforms, or by itself.  Use

Read more

Big Analytics Roundup (August 31, 2015)

Top stories for the penultimate week of summer: an excellent SQL-on-Hadoop benchmark; a couple of stories about Gelly, Flink’s graph engine; Apache Ignite goes top-level; a preview of Spark 1.5; and new stuff from RStudio. Also, on Slideshare, evil mad scientist Paco Nathan presents on “Uber for Education.” SQL on Hadoop I missed this story in June, but better late

Read more

Big Analytics Roundup (August 17, 2015)

Catching up from vacation last week.  Top stories: results of a SQL-on-Hadoop evaluation at Pearson; Google launches Dataflow (giving Flink a boost); while IBM shoehorns Spark onto a mainframe, Vertica gets the jump on IBM PureData with native Spark integration. Kaggle announces two new competitions: Springleaf Financial, an Indiana credit union founded in 1920, has rebranded to target millenials. They

Read more

Big Analytics Roundup (August 3, 2015)

This week: IBM pours new wine into old bottles; priorities for the newly formed R Consortium; insight into Spark Streaming and Spark ML pipelines; and the usual snark. The Linux Foundation’s Apache Big Data conference to be held in Budapest in September has already posted slides featuring Spark, Ignite, S2Graph, Kylin and WSO2. Greta Roberts of Talent Analytics wants you

Read more

Big Analytics Roundup (July 27, 2015)

Top stories this week:  Palantir’s valuation grows, Continuum Analytics gets a bump, Cloudera announces a Python interface for Impala, and we have a winner in KDD Cup 2015. Nate Desmond chronicles Palantir‘s $15 Billion growth story just as the company hits $20 Billion. Conversion Logic wins the KDD Cup 2015, which L.A. Biz characterizes as the “Nerd Olympics”. Here’s a picture

Read more