DataRobot
Last Updated:
Analyst Coverage: Daniel Howard
DataRobot was founded in 2012, and its headquarters are located in Boston, MA. It has received $225 million in total funding, employs more than 500 people (of which approximately 175 are data scientists) and has a variety of business partners, including cloud vendors (Microsoft Azure, Google Cloud and AWS), data platforms (Hortonworks, Cloudera and Snowflake), self-service analytics vendors (Qlik, Alteryx, Looker and Tableau) and data science workbenches (Amazon SageMaker, Domino and Databricks).
Its core mission, supported by its namesake product, is to enable the AI-driven enterprise, in which predictive AI and analytics are usefully applied to every business process within an organisation.
DataRobot
Last Updated: 9th December 2019
Mutable Award: Gold 2018
DataRobot is an automated machine learning platform targeted at the enterprise. Its aim is to make it much faster and easier to create data science models and, in doing so, both afford greater efficiency to existing data scientists, and enable business users and business analysts to create data science and machine learning models themselves. To wit, a comparison between the processes for creating a model manually and in DataRobot is shown in Figure 1; it is very much in DataRobot’s favour.
DataRobot also provides a form of democratisation of your models, by making them available throughout the enterprise in a way that is easily shared and understood. This promotes communication and collaboration on data science, and helps prevent silos from forming around your data scientists.
DataRobot can be deployed on-premises as well as in-cloud via the top three cloud providers (AWS, Azure and Google Cloud). It can also be run on a Hadoop cluster and thereby supports big data and data lakes.
Customer Quotes
“We’ve already experienced tremendous cost and time savings with DataRobot.”
Steward Healthcare
“What DataRobot was able to accomplish in the first hour was more thorough and accurate than models we had built over the prior month.”
Crest Financial
“DataRobot justifies its place by providing value and returning significant ROI immediately.”
TD Ameritrade
DataRobot expediates the model creation process via extensive automation. The idea is that for most models the vast majority of work has already been done, in the form of existing algorithms and acknowledged best practices, and there is no reason to repeat this work. Hence in DataRobot this has all been automated, and consequently creating a model is a simple and straightforward process that requires neither programming nor mathematical skills. As can be seen in Figure 2, DataRobot supports predictive models as well as time series modelling.
The process for creating a new model begins with data ingestion, accomplished by dragging and dropping relevant datasets into the platform or importing them from an external database. If desired, you can also perform some preliminary, exploratory analytics at this stage, assisted by DataRobot via a selection of visualisations. You then need to specify what you would like to predict (for example, the value of a particular column) via201 text entry. There are advanced options available, but they are strictly optional, and most users are not expected to want or need them. Once you have provided this information, the platform will automatically create a number of models based on your data, each according to a particular ‘blueprint’, a pre-built collection of algorithms and best practices designed by experienced data scientists. These models are then ranked for accuracy, and you are invited to choose the one that is most appropriate (this will often, but not necessarily always, be the most accurate).
At this point, your model has been created. However, this is not the end of the story. Each model within the platform is equipped with a variety of visualisation options, including old standards such as line graphs and bar charts as well as more esoteric options such as word clouds. Each model also contains a workflow that describes exactly how the model was created. The steps in this workflow might include data transforms, model training, and so on, and each step is fully documented within DataRobot.
Models can be deployed manually or automatically, the latter via the use of an API. Model predictions can be displayed online and enterprise-wide, and each prediction comes with a full explanation that elaborates on the factors and reasoning that went into making it. This makes it easy for your business users to get access to predictive models and to understand the predictions they make, as well as why they make them. DataRobot has also recently released the capability to manage and monitor deployed models, allowing your users to efficiently manage their inventory of models in production and get indications on whether some of them are deteriorating and need to be updated.
DataRobot’s approach to AI is important for several reasons. First of all, many organisations are facing a gulf between the extensive AI-driven digital transformations they would like to implement, and the relatively small number of data scientists they have to actually implement them. There’s no denying that AI and machine learning are hot topics in the world of data, and a great many companies have started initiatives to leverage them within their businesses. However, data scientists themselves are few and far between, and appropriately expensive. By empowering anyone with sufficient domain knowledge to do meaningful work on data models, DataRobot dramatically expands the workforce available for implementing AI, allowing AI-based digital transformation to be achieved much more quickly. Moreover, even for an experienced data scientist, creating and deploying a model using DataRobot is far faster and more efficient than doing so by hand.
Moreover, the value of data science is maximised when it is deployed throughout the business in support of numerous existing business processes. Therefore, organisations would do well to democratise their data science and allow all business users to gain access to it, allowing them to leverage it for whichever purposes best suit their corner of the business. This is exactly what DataRobot provides, allowing all users to readily access and understand their organisation’s data models, and breaking down any silos that may have formed around its existing data science.
In short, by improving the efficiency of your data scientists, and by introducing a new pool of manpower to model creation, DataRobot dramatically increases the speed at which you can do data science. By democratising the result, it maximises the impact of said data science on your organisation.
The Bottom Line
The amount of automation DataRobot provides, and the efficiency and productivity gains thereof, simply cannot be overstated. Compared to creating models by hand, it’s like night and day. Needless to say, we highly recommend it.
Mutable Award: Gold 2018
Commentary
Coming soon.
Solutions
Research
Coming soon.