Tag Archives: Dataiku

The Year in Machine Learning (Part Four)

This is the fourth installment in a four-part review of 2016 in machine learning and deep learning. — Part One covered Top Trends in the field, including concerns about bias, interpretability, deep learning’s explosive growth, the democratization of supercomputing, and the emergence of cloud machine learning platforms. — Part Two surveyed significant developments in Open Source machine learning projects, such as R, Python, Spark, Flink,

Read more

Machine Learning Roundup 10/14/2016

Machine learning (ML) and deep learning (DL) content from the past 24 hours. Note to readers: Big Analytics will rebrand as ML/DL on Monday, October 17. Fundamentals — Cynthia Harvey explains the difference between AI, ML, and DL. Issues — Arun Krishnan asks: can algorithms reinforce our biases? — In Nature, Kate Crawford and Ryan Calo note rising use of AI, summarize

Read more

Big Analytics Roundup (August 8, 2016)

So, Apple acquires Turi for $200 million. Hopefully, Apple did not pay for brand equity. Bridget Botelho argues that businesses must either disrupt or be disrupted, and outlines the role of machine learning. Someone should write a book about that. Conference Announcements — Flink Forward announces the schedule for its second annual event, to be held September 12-14 in Berlin. —

Read more

Big Analytics Roundup (April 25, 2016)

Mesosphere wins the internet this week with its announcement that it has open sourced DC/OS, its datacenter virtualization project built around Apache Mesos. While not an “analytics” project per se, DC/OS has the potential to transform how organizations provision and deploy their analytics platforms. In a nutshell, Apache Mesos distributes workloads across physical IT resources. DC/OS adds a container orchestration platform; installation,

Read more

Big Analytics Roundup (April 4, 2016)

Strata + Hadoop World sparks a number of commercial announcements: AtScale has a new release, Microsoft previews R Server on HDInsight, and IBM puts Spark on a mainframe, FWIW. We also have a nice harvest of explainers and perspectives. Slides from Strata available here. The folks at Domino Data ask: Is XGBoost 10X faster than H2O? We’ll never know the answer, since they

Read more

Big Analytics Roundup (March 21, 2016)

Minimal hard news this week, but some interesting survey results, analysis, articles, explainers and perspectives. — On his personal blog, Will Kurt describes Bayesian reasoning in the Twilight Zone. I tried to learn Bayesian reasoning a few years ago, but it conflicted with my prior beliefs. — Stack Overflow shares results from its 2016 Developer Survey. (h/t Thomas Ott) Key bits:

Read more

Big Analytics Roundup (February 29, 2016)

Happy Leap Day.  Tachyon’s rebranding as Alluxio, release of CaffeOnSpark and GA for Google Cloud Dataproc lead the hard news this week.  The Alluxio announcement has inspired big thinkers to share big thoughts.  And, we have a nice crop of explainers.  Scroll down to the bottom for another SQL on Hadoop benchmark. Explainers — In SearchDataManagement, Jack Vaughn explains Spark

Read more

Gartner’s 2016 MQ for Advanced Analytics Platforms

This is a revised and expanded version of a story that first appeared in the weekly roundup for February 15. Gartner publishes its 2016 Magic Quadrant for Advanced Analytics Platforms.   You can get a free copy here from RapidMiner (registration required.)  The report is a muddle that mixes up products in different categories that don’t compete with one another,

Read more
« Older Entries Recent Entries »