Tag Archives: Apache Hive

Big Analytics Roundup (September 6, 2016)

Jim Kyung-Soo Liew and Tamas Budavari of Johns Hopkins ask whether Tweet sentiments still predict the stock market. Short Version: they do, but the market has arbitraged away any advantage from trading on the information. So there you have it: the stock market is efficient with respect to fundamental information, technical information, and Tweets. Enterra’s Stephen DeAngelis celebrates the “Algorithmic

Read more

Big Analytics Roundup (July 25, 2016)

We have some more summer reading this week; plus, Splice Machine announces availability of its open source Community Edition, and Google launches two new machine learning APIs. There are so many Spark stories I’ve created a special section for them. Plus we have the usual explainers, perspectives, and news. Quant headhunter Linda Burtch repeats her survey of working analysts in her

Read more

Big Analytics Roundup (April 18, 2016)

In hard news this week, Storm hits a milestone with Release 1.0, Google releases TensorFlow 0.8 with distributed computing support, and DataStax announces DataStax Enterprise Graph. And, following on NVIDIA’s DGX-1 announcement last week there are a number of items on Deep Learning featured below. Deep Learning — Adrian Colyer summarizes a paper that summarizes 900 other papers on Deep Learning. —

Read more

Big Analytics Roundup (March 7, 2016)

Hortonworks wins the internet this week beating the drum for its partnership with Hewlett-Packard Enterprise.  The story is down under “Commercial Announcements,” just above the story about Hortonworks’ shareholder lawsuit. Google releases a distributed version of TensorFlow, and HDP releases a new version of Dataflow.  We are reaching peak flow. IBM demonstrates its core values. Folks who fret about cloud security

Read more

2015 in Big Analytics

Looking back at 2015, a few stories stand out: Steady progress for Spark, punctuated by two big announcements. Solid growth in cloud-based machine learning, led by Microsoft. Expanding options for SQL and OLAP on Hadoop. In 2015, the most widely read post on this blog was Spark is Too Big to Fail, published in April.  I wrote this post in

Read more

Big Analytics Roundup (December 7, 2015)

Cloudera’s expanded Spark support leads the news this week, together with a Data Science Virtual Machine from Microsoft.  Neural network devotees will be pleased to see that Keras now runs on TensorFlow. On the Databricks blog, H2O.ai’s Michal Malohlava describes Sparking Water, a Spark package that enables data scientists to build machine learning pipelines that integrate Spark and H2O functions.

Read more

Big Analytics Roundup (July 27, 2015)

Top stories this week:  Palantir’s valuation grows, Continuum Analytics gets a bump, Cloudera announces a Python interface for Impala, and we have a winner in KDD Cup 2015. Nate Desmond chronicles Palantir‘s $15 Billion growth story just as the company hits $20 Billion. Conversion Logic wins the KDD Cup 2015, which L.A. Biz characterizes as the “Nerd Olympics”. Here’s a picture

Read more
« Older Entries