pyspark - 将 .orderBy 链接到 .read 方法

pyspark - Chaining a .orderBy to a .read method

假设您有如下代码:

df = sqlContext.read.parquet('s3://somebucket/some_parquet_file')

如何将订单链接到该对象?

df = df.orderBy(df.some_col)

让它变成这样:

df = sqlContext.read.parquet('s3://somebucket/some_parquet_file').orderBy(?.some_col)

您可以将列名称指定为 string or a list of strings:

df = sqlContext.read.parquet('s3://somebucket/some_parquet_file').orderBy("some_col")