pyspark - 将 .orderBy 链接到 .read 方法
pyspark - Chaining a .orderBy to a .read method
假设您有如下代码:
df = sqlContext.read.parquet('s3://somebucket/some_parquet_file')
如何将订单链接到该对象?
df = df.orderBy(df.some_col)
让它变成这样:
df = sqlContext.read.parquet('s3://somebucket/some_parquet_file').orderBy(?.some_col)
您可以将列名称指定为 string or a list of strings:
df = sqlContext.read.parquet('s3://somebucket/some_parquet_file').orderBy("some_col")
假设您有如下代码:
df = sqlContext.read.parquet('s3://somebucket/some_parquet_file')
如何将订单链接到该对象?
df = df.orderBy(df.some_col)
让它变成这样:
df = sqlContext.read.parquet('s3://somebucket/some_parquet_file').orderBy(?.some_col)
您可以将列名称指定为 string or a list of strings:
df = sqlContext.read.parquet('s3://somebucket/some_parquet_file').orderBy("some_col")