Spark ml 和 PMML 导出

Spark ml and PMML export

我知道可以将模型导出为 PMMLSpark-MLlib,但是 Spark-ML 呢?

是否可以将 LinearRegressionModelorg.apache.spark.ml.regression 转换为 LinearRegressionModelorg.apache.spark.mllib.regression 以调用 toPMML() 方法?

您可以使用 JPMML-SparkML 库将 Spark ML 管道转换为 PMML:

StructType schema = dataFrame.schema()
PipelineModel pipelineModel = pipeline.fit(dataFrame);
org.dmg.pmml.PMML pmml = org.jpmml.sparkml.ConverterUtil.toPMML(schema, pipelineModel);
JAXBUtil.marshalPMML(pmml, new StreamResult(System.out));