Tag Archives: Spark SQL

Big Analytics Roundup (September 26, 2016)

Note to readers: Recently, I’ve noticed that news about events that occur on Tuesdays seems stale by the time I publish on Monday. Beginning this week, I’m shifting to a new publication model, posting analysis of events as they happen instead of a weekly roundup. You could say I’m switching from batch updates to real-time updates, which should please Nathan Marz.

Read more

Big Analytics Roundup (September 19, 2016)

Many thanks to Australia’s Dez Blanchfield for his contributions to this roundup. We set out to create a special “Australia/APAC” edition; however, most of the stories have a global interest: chips are chips and deep learning is deep learning wherever you live. We did find this story, profiling a Tasmanian oyster farm that uses Microsoft’s IoT hub. Well, that’s embarrassing. MapR’s

Read more

Spark 2.0 Released

The Apache Spark team announces the production release of Spark 2.0.0.  Release notes are here. Read below for details of the new features, together with explanations culled from Spark Summit and elsewhere. Measured by the number of contributors, Apache Spark remains the most active open source project in the Big Data ecosystem. The Spark team guarantees API stability for all production

Read more

Big Analytics Roundup (February 29, 2016)

Happy Leap Day.  Tachyon’s rebranding as Alluxio, release of CaffeOnSpark and GA for Google Cloud Dataproc lead the hard news this week.  The Alluxio announcement has inspired big thinkers to share big thoughts.  And, we have a nice crop of explainers.  Scroll down to the bottom for another SQL on Hadoop benchmark. Explainers — In SearchDataManagement, Jack Vaughn explains Spark

Read more
« Older Entries