Tag Archives: Apache Drill

Big Analytics Roundup (March 7, 2016)

Hortonworks wins the internet this week beating the drum for its partnership with Hewlett-Packard Enterprise.  The story is down under “Commercial Announcements,” just above the story about Hortonworks’ shareholder lawsuit. Google releases a distributed version of TensorFlow, and HDP releases a new version of Dataflow.  We are reaching peak flow. IBM demonstrates its core values. Folks who fret about cloud security

Read more

Big Analytics Roundup (January 25, 2016)

This week, we have a new release of Spark-TS, Google’s proposal to create an Apache incubator project for Cloud Dataflow, Forrester’s assessment of Hadoop distributions, a couple of funding stories and a nice crop of explainers. Just a reminder that Spark Summit East is coming up February 16-18.  I’ll be delivering a talk in the Executive track on Spark and the

Read more

2015 in Big Analytics

Looking back at 2015, a few stories stand out: Steady progress for Spark, punctuated by two big announcements. Solid growth in cloud-based machine learning, led by Microsoft. Expanding options for SQL and OLAP on Hadoop. In 2015, the most widely read post on this blog was Spark is Too Big to Fail, published in April.  I wrote this post in

Read more

Big Analytics Roundup (December 21, 2015)

With the holidays approaching, we still have some hard news; plus, some explainers and end of 2015 roundups.  I’ll post my own roundup of 2015 later this week. On the BlueData blog, Anant Chintamaneni delivers an excellent overview of Hadoop virtualization, and the trend toward decoupling compute and storage. (h/t Hadoop Weekly) Quick Hits In InfoWorld, H2O.ai’s Sri Ambati delivers

Read more

Big Analytics Roundup (December 14, 2015)

Quite a bit of hard news this week — nine stories, including software releases from Hortonworks and Confluent, a milestone for Apache Kylin, and three funding stories.  Plus, a number of items “above the news.”  Let’s get to it. Risk Management on Spark The growing number of applications that run on Spark show the platform is maturing.  ThinkReactive, a consultancy

Read more

Big Analytics Roundup (November 23, 2015)

Eleven stories this week, including a new Flink release, new developments for Splice Machine, and a very big Spark HPC cluster in Warsaw. InfoWorld publishes a well-written practical guide to Deep Learning. Here are a couple of interesting articles on Spark: MapR’s Jim Scott offers a nice overview of Spark RDDs. Ian Pointer summarizes five things he hates about Spark.

Read more

Big Analytics Roundup (November 9, 2015)

My roundup of the Spark Summit Europe is here. Two important events this week: H2O World starts today and runs through Wednesday at the Computer History Museum in Mountain View CA.   Yotam Levy summarizes here and here. Open Data Science Conference meets November 14-15 at the Marriott Waterfront in SFO Five backgrounders and explainers: At HUG London, Apache’s Ufuk Celebi

Read more

Big Analytics Roundup (October 19, 2015)

Ten stories this week.  Don’t miss story #10, which recaps an analysis of collaboration and influence in the U.S.Congress using open source graph engines and a rich database of legislation. (1) Rexer: R Continues to Lead Rexer Analytics has released preliminary results from its 2015 survey of working analysts; Bob Muenchin reports.  One interesting snippet — reported tool use, as

Read more
« Older Entries Recent Entries »