Tag Archives: Apache Hadoop

Big Analytics Roundup (August 29, 2016)

TechCrunch reports results of a new study, which says that you really don’t need a co-founder after all. Next, they’ll be telling us we don’t need to floss. Python and R Matt Asay argues that Python is a gateway language that leads data scientists to R. (h/t Oliver Vagner). That’s oversimplified and mostly incorrect. The breadth of R’s analytics functionality tends

Read more

Big Analytics Roundup (March 21, 2016)

Minimal hard news this week, but some interesting survey results, analysis, articles, explainers and perspectives. — On his personal blog, Will Kurt describes Bayesian reasoning in the Twilight Zone. I tried to learn Bayesian reasoning a few years ago, but it conflicted with my prior beliefs. — Stack Overflow shares results from its 2016 Developer Survey. (h/t Thomas Ott) Key bits:

Read more

Big Analytics Roundup (February 1, 2016)

There are three hard news stories this week: MSFT releases its toolkit for deep learning, MSFT releases its hybrid cloud platform, and the Zeppelin team releases its latest version.  So MSFT wins the internet. In Computerworld, Doug Cutting celebrates Hadoop’s tenth birthday. InfoWorld distributes its 2016 Technology of the Year awards.  Among the 32 awards: Docker, Kubernetes, Apache Mesos, Apache Ambari, Apache Kafka, Amazon Aurora, Apache Spark,

Read more

2016 Big Analytics Predictions Roundup

Before publishing my own predictions for 2016 later this week, I thought it would be fun to round up published predictions on analytics and Big Data.  Looking through this list, I see a few patterns: — Streaming is hot.  Analysts do not seem to understand distinctions between streaming data, streaming analytics and real-time decisioning. — “Data Science” continues to be a

Read more

2015 in Big Analytics

Looking back at 2015, a few stories stand out: Steady progress for Spark, punctuated by two big announcements. Solid growth in cloud-based machine learning, led by Microsoft. Expanding options for SQL and OLAP on Hadoop. In 2015, the most widely read post on this blog was Spark is Too Big to Fail, published in April.  I wrote this post in

Read more

Big Analytics Roundup (December 21, 2015)

With the holidays approaching, we still have some hard news; plus, some explainers and end of 2015 roundups.  I’ll post my own roundup of 2015 later this week. On the BlueData blog, Anant Chintamaneni delivers an excellent overview of Hadoop virtualization, and the trend toward decoupling compute and storage. (h/t Hadoop Weekly) Quick Hits In InfoWorld, H2O.ai’s Sri Ambati delivers

Read more
« Older Entries