H2O.ai Releases Steam

With minimal fanfare, H2O.ai releases the Steam AI engine, which it describes as “…an end-to-end platform that streamlines the entire process of building and deploying smart applications.”  Overall, the software looks like a reasonable product extension for data science teams committed to H2O for machine learning.

H2O.ai’s product strategy for Steam has evolved over time; as late as mid-August, the company planned to release Steam under a commercial license. In September, the company settled on an open source release under an AGPL license and laid off about ten inside sellers and account executives. The layoffs make sense in that context — there’s not much point in having sellers when there is no product to sell.


Setting aside broad claims about Steam as an “AI engine”, the software adds cluster management, model management and model deployment capabilities to the H2O stack. Here is a demo of the software.

For cluster management, Steam can start and stop H2O clusters under YARN, or connect to predefined clusters. This capability decouples H2O software administration from Hadoop provisioning; however, it’s a stretch to characterize this as support for elastic computing. True elastic machine learning doesn’t simply decouple the software from provisioning, it manages the provisioning as well, scaling out according to the demands of the job. An H2O user who needs more computing power for a particular job will still have to contact an administrator.

While H2O.ai accurately states that Steam works on all major cloud platforms, it does so under an IaaS/PaaS model. In other words, a user or administrator manually procures needed compute instances. That is quite a contrast to managed services like Qubole (which scales out and back automatically), or Databricks (which offers self-service provisioning in an integrated notebook.)

The model management capability enables an H2O user to save models, manually build a leaderboard and compare model performance. For model deployment, Steam offers the user a capability to deploy models to services accessible either through an API or a REST interface. These are useful capabilities for organizations that plan to rely exclusively on H2O for machine learning.

For the record, however, I am skeptical of any model management and deployment facility that is tightly coupled to a single model training platform. Data scientists use diverse tools for machine learning; no single tool or platform meets all needs. An enterprise model management and deployment should manage all of an organization’s models regardless of the tools for model training. That’s not a criticism of H2O.ai — you can’t blame vendors for moving forward with their own deployment tools — it’s a caution to clients to avoid single-platform solutions to the model management and deployment problem.


  • Hi Thomas, good round up. Curious, what model management platforms have you seen and/or used that you think look good? Agree that cross-vendor model management is ideal.

  • Hey Thomas, great summary. Curious, what model management platforms have you seen/used that look good and are vendor-agnostic?

    • Great question. I’m researching that currently. At the moment, I’m not sure that there are ANY commercial platforms that are platform agnostic, so organizations build their own.

      • I’d be interested to hear if you find anything, but agree there isn’t a great deal I can see at this time. I do think that the concept has a lot of potential: H2O Steam looks like a reasonable starting point (albeit vendor-specific). Thanks!

  • There are a number of cross-vendors solution. We are evaluating some products like Data Robot, Domino, YHat, Alpine, Sense, etc.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.