apache spark 有地理意识吗?

Does apache spark has geo awareness?

我正在尝试为跨不同站点的 apache spark 集群选择拓扑。 spark 有自己的感知能力吗?

例如,假设一个集群的工作人员在俄勒冈州和槟城。

现在提交申请时,从俄勒冈州加载数据进行处理并将其保存回俄勒冈州。俄勒冈州的工人是否会受到优先考虑(如果他们有空)? 尚未找到有关此主题的文档。

如此处所述https://jaceklaskowski.gitbooks.io/mastering-apache-spark/content/spark-data-locality.html

Spark relies on data locality, aka data placement or proximity to data source, that makes Spark jobs sensitive to where the data is located. It is therefore important to have Spark running on Hadoop YARN cluster if the data comes from HDFS. The data system may itself be geo-aware e.g. cassandra: Does Spark use data locality? http://www.slideshare.net/RussellSpitzer/spark-cassandralocality