Tag Archives: Spark Streaming

Big Analytics Roundup (September 6, 2016)

Jim Kyung-Soo Liew and Tamas Budavari of Johns Hopkins ask whether Tweet sentiments still predict the stock market. Short Version: they do, but the market has arbitraged away any advantage from trading on the information. So there you have it: the stock market is efficient with respect to fundamental information, technical information, and Tweets. Enterra’s Stephen DeAngelis celebrates the “Algorithmic

Read more

Big Analytics Roundup (August 1, 2016)

There are two big stories this week: Apache Spark 2.0 and Apache Mesos 1.0. There’s also a new release from Kylin, and a nice crop of explainers. IEEE Spectrum publishes its third annual ranking of top programming languages, based on twelve metrics drawn from Google Search, Google Trends, Twitter, GitHub, Stack Overflow, Reddit, Hacker News, CareerBuilder, Dice, and the IEEE

Read more

Spark 2.0 Released

The Apache Spark team announces the production release of Spark 2.0.0.  Release notes are here. Read below for details of the new features, together with explanations culled from Spark Summit and elsewhere. Measured by the number of contributors, Apache Spark remains the most active open source project in the Big Data ecosystem. The Spark team guarantees API stability for all production

Read more

Big Analytics Roundup (May 31, 2016)

Google’s TPU announcement on May 18 continues to reverberate in the tech press. In Forbes, HPC expert Karl Freund dissects Google’s announcement, suggesting that Google is indulging in a bit of hocus-pocus to promote its managed services.  Freund believes that TPUs are actually used for inference and not for model training; in other words, they replace CPUs rather than GPUs. Read

Read more
« Older Entries