Tag Archives: Spark

Spark 2.0 Released

The Apache Spark team announces the production release of Spark 2.0.0.  Release notes are here. Read below for details of the new features, together with explanations culled from Spark Summit and elsewhere. Measured by the number of contributors, Apache Spark remains the most active open source project in the Big Data ecosystem. The Spark team guarantees API stability for all production

Read more

Spark 1.5 Released

On September 9, the Spark team announced availability of Release 1.5.  (Release notes here.)  230 developers contributed more than 1,400 commits, the largest release to date.  Spark continues to expand its contributor base, the best measure of health for an open source project. On the Databricks blog, Reynold Xin and Patrick Wendell summarize the key new bits:  Some highlights: Project Tungsten, a

Read more

Big Analytics Roundup (August 31, 2015)

Top stories for the penultimate week of summer: an excellent SQL-on-Hadoop benchmark; a couple of stories about Gelly, Flink’s graph engine; Apache Ignite goes top-level; a preview of Spark 1.5; and new stuff from RStudio. Also, on Slideshare, evil mad scientist Paco Nathan presents on “Uber for Education.” SQL on Hadoop I missed this story in June, but better late

Read more

Big Analytics Roundup (August 17, 2015)

Catching up from vacation last week.  Top stories: results of a SQL-on-Hadoop evaluation at Pearson; Google launches Dataflow (giving Flink a boost); while IBM shoehorns Spark onto a mainframe, Vertica gets the jump on IBM PureData with native Spark integration. Kaggle announces two new competitions: Springleaf Financial, an Indiana credit union founded in 1920, has rebranded to target millenials. They

Read more

Big Analytics Roundup (August 3, 2015)

This week: IBM pours new wine into old bottles; priorities for the newly formed R Consortium; insight into Spark Streaming and Spark ML pipelines; and the usual snark. The Linux Foundation’s Apache Big Data conference to be held in Budapest in September has already posted slides featuring Spark, Ignite, S2Graph, Kylin and WSO2. Greta Roberts of Talent Analytics wants you

Read more

Big Analytics Roundup (July 27, 2015)

Top stories this week:  Palantir’s valuation grows, Continuum Analytics gets a bump, Cloudera announces a Python interface for Impala, and we have a winner in KDD Cup 2015. Nate Desmond chronicles Palantir‘s $15 Billion growth story just as the company hits $20 Billion. Conversion Logic wins the KDD Cup 2015, which L.A. Biz characterizes as the “Nerd Olympics”. Here’s a picture

Read more
« Older Entries