This topic describes Population Stability Index (PSI) that measures the difference in the expected distribution and actual distribution of data to evaluate the stability of a model.
Scenarios
PSI is commonly used in the field of risk control, where stability is an important factor to determine the effectiveness of a model. Most risk control models are used for longer than a year before they are replaced, which is a fairly long time. If the model is not stable, the quality of decisions may be negatively affected.
Syntax
CREATE FEATURE feature_name WITH ( feature_class = '', x_cols = '', parameters=())
Parameter description:Parameter | Description |
---|---|
feature_name | The name of the feature. |
feature_class | The type of the feature. Set the value to psi. |
x_cols | The list of independent variables. Separate multiple variables with commas (,). |
parameters | Custom parameters for creating the feature. The following parameters are supported:
|
Example
/*polar4ai*/CREATE FEATURE psi_001 WITH ( feature_class = 'psi',x_cols='Airline,Flight,AirportFrom,AirportTo,DayOfWeek,Time,Length',parameters=(actual_table='airlines_train_1000',predict_table='airlines_test_1000',bins_num=10,bins_method='quantile',categorical_feature='Airline,Flight,AirportFrom,AirportTo,DayOfWeek'));