Tag Archives: Hive on Tez

Big Analytics Roundup (February 29, 2016)

Happy Leap Day.  Tachyon’s rebranding as Alluxio, release of CaffeOnSpark and GA for Google Cloud Dataproc lead the hard news this week.  The Alluxio announcement has inspired big thinkers to share big thoughts.  And, we have a nice crop of explainers.  Scroll down to the bottom for another SQL on Hadoop benchmark. Explainers — In SearchDataManagement, Jack Vaughn explains Spark

Read more

2015 in Big Analytics

Looking back at 2015, a few stories stand out: Steady progress for Spark, punctuated by two big announcements. Solid growth in cloud-based machine learning, led by Microsoft. Expanding options for SQL and OLAP on Hadoop. In 2015, the most widely read post on this blog was Spark is Too Big to Fail, published in April.  I wrote this post in

Read more

Big Analytics Roundup (August 31, 2015)

Top stories for the penultimate week of summer: an excellent SQL-on-Hadoop benchmark; a couple of stories about Gelly, Flink’s graph engine; Apache Ignite goes top-level; a preview of Spark 1.5; and new stuff from RStudio. Also, on Slideshare, evil mad scientist Paco Nathan presents on “Uber for Education.” SQL on Hadoop I missed this story in June, but better late

Read more