在结构化流式检查点中，为什么在 foreachbatch 之后不提交偏移量

Question

df
.writeStream
.trigger(Trigger.Once)
.option(checkpointKey, checkpointVal)
.foreachBatch { (batchDF: DataFrame, batchId: Long) => }

这是我运行的示例代码。观察到结构化流在开头本身创建了偏移量文件：checkpoints/offsets/3

为什么不等待 foreachBatch 完成，然后将偏移量写入检查点目录？

Answer 1

每个微批类似于一个交易。

In structured streaming checkpointing why offsets are not committed after foreachbatch