如何从 pyspark 列表的数据框中创建元组数据框?
How to create dataframe of tuples out of dataframe of lists in pyspark?
这是我的数据框:
my_df.show()
+----------+
| features|
+----------+
| [0,'a'] |
| [1,'b'] |
| [0,'c'] |
| [1,'d'] |
| [2,'e'] |
| [0,'f'] |
+----------+
如何将其转换为元组数据框(单列'features')?
尝试
my_df.map(lambda x: (x[0],x[1]))
这是我的数据框:
my_df.show()
+----------+
| features|
+----------+
| [0,'a'] |
| [1,'b'] |
| [0,'c'] |
| [1,'d'] |
| [2,'e'] |
| [0,'f'] |
+----------+
如何将其转换为元组数据框(单列'features')?
尝试
my_df.map(lambda x: (x[0],x[1]))