sklearn.pipeline 支持分支吗?

Does sklearn.pipeline support branching?

假设我有两条管道:

pipln1 = Pipeline([("trsfm1",transformer_1),
                   ("trsfm2",transformer_2),
                   ("estmtr1",estimator_1)])

pipln2 = Pipeline([("trsfm1",transformer_1),
                   ("trsfm3",transformer_3),
                   ("estmtr2",estimator_2)])

两个线性流水线共享相同的步骤,trsfm1

是否可以避免计算两次trsfm1

你会喜欢

vect = transformer_1() #Or whatever is meant to do!

vX = vect.fit_transform(Xtrain) 或您应用 transformer_1

的任何适当方式

然后

pipln1 = Pipeline([("trsfm2",transformer_2),
                   ("estmtr1",estimator_1)])

pipln2 = Pipeline([("trsfm3",transformer_3),
                   ("estmtr2",estimator_2)])

然后在 vX

上应用两个 Pipeline