Screen Shot 2016-02-13 at 10.34.28 AM

This is a revised and expanded version of a story that first appeared in the weekly roundup for February 15. Gartner publishes its 2016 Magic Quadrant for Advanced Analytics Platforms.   You can get a free copy here from RapidMiner (registration required.)  The report is a muddle that mixes up products in different categories that […]

maxresdefault (4)

The Apache Spark team announces the production release of Spark 2.0.0.  Release notes are here. Read below for details of the new features, together with explanations culled from Spark Summit and elsewhere. Measured by the number of contributors, Apache Spark remains the most active open source project in the Big Data ecosystem. The Spark team guarantees […]

Chincoteague_ponies_by_Bonnie_Gruenberg5

We have some more summer reading this week; plus, Splice Machine announces availability of its open source Community Edition, and Google launches two new machine learning APIs. There are so many Spark stories I’ve created a special section for them. Plus we have the usual explainers, perspectives, and news. Quant headhunter Linda Burtch repeats her survey […]

FRU-Large-Beach-Chairs

We have lots of fresh material to read on the beach this week — most notably, the “read of the week” below, which might be better labeled as the “read of the year.”  We have another streaming engine to kick around, a slew of earnings releases in the coming week, and some new releases from GraphLab […]

Databricks_2015_survey

Databricks is running a short survey to understand the needs of Apache Spark users. If you haven’t taken the survey yet, do so today. For results of the 2015 survey, look here. Last year’s survey produced a number of interesting findings; here’s what I wrote back in September when Databricks released its report: ===== Databricks released results […]

SD_buffalo_roundup_5 (1)

Light news this week. We have results from an interesting survey on fast data, an excellent paper from Facebook and a nice crop of explainers. From one dumb name to another.  Dato loses trademark dispute, rebrands as Turi. They should have googled it first. Wikibon’s George Gilbert opines on the state of Big Data performance […]

DSCN9003

Quite a few open source announcements this week. One of the most interesting is Apache Bahir, which includes a number of bits spun out from Apache Spark. It’s another indicator of the size and strength of Spark, in case anyone needs a reminder. In other news, Altiscale and H2O.ai concurrently develop time travel: both vendors […]

IMG_0562

We have announcements from BlueData, Databricks, and DataStax this week, plus a nice crop of explainers. Also, a bit of catch-up, something from May that I missed: Bob Hayes publishes an interesting summary of his recent survey of data scientists. Includes an infographic and slides. Thiemo Fetzer asks: did the weather affect the Brexit vote? Spoiler: […]