Productionizing Apache Spark™ MLlib Models for Real-time Prediction Serving

Logo
Presented by

Joseph Bradley and Sue Ann Hong

About this talk

Data science and machine learning tools traditionally focus on training models. When companies begin to employ machine learning in actual production workflows, they encounter new sources of friction such as sharing models across teams, deploying identical models on different systems, and maintaining featurization logic. In this webinar, we discuss how Databricks provides a smooth path for productionizing Apache Spark MLlib models and featurization pipelines. Databricks Model Scoring provides a simple API for exporting MLlib models and pipelines. These exported models can be deployed in many production settings, including: * External real-time low-latency prediction serving systems, without Spark dependencies, * Apache Spark Structured Streaming jobs, and * Apache Spark batch jobs. In this webinar, we overview our solution’s functionality, describe its architecture, and demonstrate how to use it to deploy MLlib models to production.
Related topics:

More from this channel

Upcoming talks (0)
On-demand talks (92)
Subscribers (39062)
No matter at what stage of your data journey you’re in, this channel will help you get a better understanding of the fundamental concepts of the Databricks Lakehouse platform and the problems we’re helping to solve for data teams.