Tag Archives: Bluedata

Big Analytics Roundup (August 29, 2016)

TechCrunch reports results of a new study, which says that you really don’t need a co-founder after all. Next, they’ll be telling us we don’t need to floss. Python and R Matt Asay argues that Python is a gateway language that leads data scientists to R. (h/t Oliver Vagner). That’s oversimplified and mostly incorrect. The breadth of R’s analytics functionality tends

Read more

Big Analytics Roundup (June 27, 2016)

We have announcements from BlueData, Databricks, and DataStax this week, plus a nice crop of explainers. Also, a bit of catch-up, something from May that I missed: Bob Hayes publishes an interesting summary of his recent survey of data scientists. Includes an infographic and slides. Thiemo Fetzer asks: did the weather affect the Brexit vote? Spoiler: he says no. Presented without

Read more

Big Analytics Roundup (March 21, 2016)

Minimal hard news this week, but some interesting survey results, analysis, articles, explainers and perspectives. — On his personal blog, Will Kurt describes Bayesian reasoning in the Twilight Zone. I tried to learn Bayesian reasoning a few years ago, but it conflicted with my prior beliefs. — Stack Overflow shares results from its 2016 Developer Survey. (h/t Thomas Ott) Key bits:

Read more

Big Analytics Roundup (February 15, 2016)

We have a nice harvest of explainers this week, plus eight hard news stories: Gartner Updates Advanced Analytics MQ DataRobot Lands Cash MapR and Hortonworks Report Robust Revenue StreamAnalytix Adds Spark Support BlueData Announces Something Looker Delivers SQL on Hadoop New RStudio Release New Anaconda Release R has a new logo. On the Databricks blog, Tim Hunter and Joseph Bradley announce

Read more

Big Analytics Roundup (September 28, 2015)

Strata+Hadoop World NYC is upon us.  Andrew Brust opines that there will be three themes at Strata this year: (1) Spark “versus” Hadoop; (2) streaming goes mainstream; (3) data governance matters.  My take: “Spark versus Hadoop” is controversy for the sake of people who like controversy.  Spark works with Hadoop, and Spark works with other platforms, or by itself.  Use

Read more

Big Analytics Roundup (June 22, 2015)

Last week’s Spark Summit is the big news driver for this roundup: On the Databricks blog, Scott Walent recaps the summit here Anmol Rajpurohit writes KDnuggets’ play-by-play for Day One and Day Two My preliminary report is here; full report when slides are available from the sessions. Spark will be one of several technologies featured at the inaugural In-Memory Computing

Read more

Big Analytics Roundup (June 1, 2015)

The Open Data Science Conference launched successfully in Boston this past weekend, attracting more than 1,200 attendees.  Sponsors included Booz Allen, Continuum Analytics, DataRobot, McGraw Hill Education and RStudio, among others.  Organizers plan additional events this year in Boston and San Francisco. Mary Meeker releases her latest Internet Trends Report. In Forbes, Louis Columbus rounds up analyst coverage of the Big Analytics

Read more

Big Analytics Roundup (May 11, 2015)

Lots of news this week, to compensate for last week’s lame haul. In an excellent post on O’Reilly Radar, Ben Lorica surveys the landscape of workbooks, notebooks and workflow tools, which he categorizes by user persona. On GitHub, a collection of links for streaming analytics (h/t O’Reilly Data). In a “twofer”, VentureBeat plugs its GrowthBeat Summit and a report on

Read more

Spark 1.1 Update

For an overview of Spark, see the Apache Spark Page. On September 11, the Spark team announced release of Spark 1.1.   This latest version of Spark includes a number of significant enhancements: As announced at the Spark Summit, Shark is now converged with Spark SQL.  Databricks has migrated its Shark workloads to Spark, and reports 2X-5X performance improvement. The

Read more