Regression - IsotonicRegressionPredict(stream) - 《Alink v1.1.2 Document》

Description
Parameters
Script Example
- Code
- Results
  - Model
  - Prediction

Description

Isotonic Regression. Implement parallelized pool adjacent violators algorithm. Support single feature input or vector input(extractor one index of the vector).

Parameters

Name	Description	Type	Required？	Default Value
predictionCol	Column name of prediction.	String	✓

Script Example

Code

data = np.array([[0.35, 1],\
[0.6, 1],\
[0.55, 1],\
[0.5, 1],\
[0.18, 0],\
[0.1, 1],\
[0.8, 1],\
[0.45, 0],\
[0.4, 1],\
[0.7, 0],\
[0.02, 1],\
[0.3, 0],\
[0.27, 1],\
[0.2, 0],\
[0.9, 1]])
df = pd.DataFrame({"feature" : data[:,0], "label" : data[:,1]})
data = dataframeToOperator(df, schemaStr="label double, feature double",op_type="batch")
dataStream = dataframeToOperator(df, schemaStr="label double, feature double",op_type="stream")
trainOp = IsotonicRegTrainBatchOp()\
            .setFeatureCol("feature")\
            .setLabelCol("label")
model = trainOp.linkFrom(data)
predictOp = IsotonicRegPredictStreamOp(model).setPredictionCol("result")
res = predictOp.linkFrom(dataStream)
res.print()

Results

Model

model_id	model_info
0	{“vectorCol”:”\”col2\””,”featureIndex”:”0”,”featureCol”:null}
1048576	[0.02,0.3,0.35,0.45,0.5,0.7]
2097152	[0.5,0.5,0.6666666865348816,0.6666666865348816,0.75,0.75]

Prediction

col1	col2	col3	pred
1.0	0.9	1.0	0.75
0.0	0.7	1.0	0.75
1.0	0.35	1.0	0.6666666865348816
1.0	0.02	1.0	0.5
1.0	0.27	1.0	0.5
1.0	0.5	1.0	0.75
0.0	0.18	1.0	0.5
0.0	0.45	1.0	0.6666666865348816
1.0	0.8	1.0	0.75
1.0	0.6	1.0	0.75
1.0	0.4	1.0	0.6666666865348816
0.0	0.3	1.0	0.5
1.0	0.55	1.0	0.75
0.0	0.2	1.0	0.5
1.0	0.1	1.0	0.5