Machine Learning (ML) Definition

Users of Process Director v5.0 and higher have access to the Machine Learning, or ML, definition object. The ML Definition enables you to use Process Director's Artificial Intelligence capabilities to review a dataset, and make predictions based on the state of that dataset.

Process Director has long used Machine Learning/Artificial Intelligence (ML/AI) to analyze how Timelines work in the real world, and make predictions about when tasks will run in the current instance, based on the ML/AI analysis. For instance, this AI capability is how Process Director can predict when a task will be late. With the ML Definition, you can use the same capability to make predictions on any desired data, using a number of different statistical and analytic functions. The ML Definition object is globally available in Process Director, just like a Business Value, and can analyze data from both inside of and/or external to Process Director.

Keep in mind that this help topic isn't designed to teach you what statistical models are, or provide a lesson on how ML/AI works. It is, rather, intended to assist you in familiarizing yourself with the Process Director object itself. Just like with a SQL Business Value, where many users won't have the experience that enables them to construct SQL statements, the ML Definition assumes some basic familiarity with statistical functions, e.g., regression analysis, SVM, etc., to use effectively.

To create an ML Definition, simply select Machine Learning Definition from the Create New... dropdown menu located in the upper right corner of the Content List.

Properties Tab #

The Properties tab contains the basic configuration and publication options for the object.

Data Set Tab #

The Data Set tab enables you to choose the dataset that will be used for the ML Analysis. You can select any of the following data sources, and each selected data source will change the user interface to reflect the type of dataset you choose.

Transformation Tab #

Once your dataset has been selected from the Data Set tab, you may find it necessary to apply some changes to your data, or to ignore part of the data that you think isn't relevant to the decision or prediction that you'd like the ML Definition to make. This process of altering or ignoring some data in the dataset is called transformation, and conducting those transformations is the purpose of the Transformation tab.

To add a Transformation, click the Add Transformation button, which will make a transformation row appear on the page. Click the button again to add additional rows.

The transformation row initially has two properties to set:

Training Tab #

Once you have selected and transformed your dataset, Process Director needs to train itself on the data to apply the type of analysis or prediction you want to apply. The Training tab is where you conduct this training.

The Training tab is divided into two sections. The top section is where you configure the Prediction Feature. The purpose of ML/AI is to analyze data and make predictions based on that analysis, much like the Process Timeline, based on past instances of a Timeline definition, can predict whether a future Activity is likely to be late.

The Input Features section enables you to select the fields from your dataset that you'd like to analyze to create the prediction. Different fields will have different levels of effectiveness in the analysis. It may be difficult for you to know which fields will provide the best predictive result. You can do sample training on a field or collection of fields to enable Process Director to help you find the most effective fields to analyze by clicking the Train button.

For each available field, a graphical representation of the field's data is displayed. You can select a field to train on by checking the box next to the field, then fort each selected field, choose the type of data analysis you wish to perform during the training. For numerical columns, you can perform Categorical, Numerical, or Exponential analyses, while, for text fields, you can conduct Categorical or "Bag of Words" analyses.

Visualize Tab #

The visualize table enables you to select from your data columns and your predicted column to visualize the data set in graphical form. Once you have selected your data, click the Visualize button to see the data representation.

Schedule Tab #

You can manually publish your ML definition, using the current data, by selecting Publish from the actions menu in the upper right corner of the ML Definition. This will make the ML Definition available, but only the currently existing data will be used for all future analyses/predictions. In order to update the and retrain the ML definition on a continuing basis, so that new data is included in the ML Definition, we need to go to the Schedule tab to configure how often we want to retrain and republish the ML Definition.

The first item to configure is to turn on scheduled training and publishing. Change the Dropdown value from No Automatic Training & Publishing to Train & Publish on a Schedule. When you do so, scheduling controls will appear that enable you to specify the training and publishing schedule.