Description
Remove duplicated records.
Parameters
Name | Description | Type | Required? | Default Value |
---|---|---|---|---|
Script Example
Code
URL = "http://alink-dataset.cn-hangzhou.oss.aliyun-inc.com/csv/iris.csv"
SCHEMA_STR = "sepal_length double, sepal_width double, petal_length double, petal_width double, category string";
data = CsvSourceBatchOp().setFilePath(URL).setSchemaStr(SCHEMA_STR)
data = data.select('category').link(DistinctBatchOp())
data.print()
Result
category
0 Iris-setosa
1 Iris-versicolor
2 Iris-virginica