Tag Archives: Spark Streaming

Big Analytics Roundup (September 6, 2016)

Jim Kyung-Soo Liew and Tamas Budavari of Johns Hopkins ask whether Tweet sentiments still predict the stock market. Short Version: they do, but the market has arbitraged away any advantage from trading on the information. So there you have it: the stock market is efficient with respect to fundamental information, technical information, and Tweets. Enterra’s Stephen DeAngelis celebrates the “Algorithmic

Read more

Big Analytics Roundup (August 1, 2016)

There are two big stories this week: Apache Spark 2.0 and Apache Mesos 1.0. There’s also a new release from Kylin, and a nice crop of explainers. IEEE Spectrum publishes its third annual ranking of top programming languages, based on twelve metrics drawn from Google Search, Google Trends, Twitter, GitHub, Stack Overflow, Reddit, Hacker News, CareerBuilder, Dice, and the IEEE

Read more

Spark 2.0 Released

The Apache Spark team announces the production release of Spark 2.0.0.  Release notes are here. Read below for details of the new features, together with explanations culled from Spark Summit and elsewhere. Measured by the number of contributors, Apache Spark remains the most active open source project in the Big Data ecosystem. The Spark team guarantees API stability for all production

Read more

Big Analytics Roundup (May 31, 2016)

Google’s TPU announcement on May 18 continues to reverberate in the tech press. In Forbes, HPC expert Karl Freund dissects Google’s announcement, suggesting that Google is indulging in a bit of hocus-pocus to promote its managed services.  Freund believes that TPUs are actually used for inference and not for model training; in other words, they replace CPUs rather than GPUs. Read

Read more

Big Analytics Roundup (March 21, 2016)

Minimal hard news this week, but some interesting survey results, analysis, articles, explainers and perspectives. — On his personal blog, Will Kurt describes Bayesian reasoning in the Twilight Zone. I tried to learn Bayesian reasoning a few years ago, but it conflicted with my prior beliefs. — Stack Overflow shares results from its 2016 Developer Survey. (h/t Thomas Ott) Key bits:

Read more

Big Analytics Roundup (February 29, 2016)

Happy Leap Day.  Tachyon’s rebranding as Alluxio, release of CaffeOnSpark and GA for Google Cloud Dataproc lead the hard news this week.  The Alluxio announcement has inspired big thinkers to share big thoughts.  And, we have a nice crop of explainers.  Scroll down to the bottom for another SQL on Hadoop benchmark. Explainers — In SearchDataManagement, Jack Vaughn explains Spark

Read more
« Older Entries