Python: 打开一组字典作为数据框
Python: Opening a set of dictionaries as a dataframe
很可能是一个相当基本的问题,但是
我有以下原始数据文件:
[{"column1":"value1","column2":"value2","column3":value3,},{"column1":"value4","column2":"value5","column3":value6},{"column1":"value7","column2":"value8","column3":value9}]
我需要将其作为以下形式的数据框:
column1 column2 column3
value1 value2 value3
value4 value5 value6
value7 value8 value9
由于文件很大(3 列中有 120 万个值),打开文件的最快最方便的方法是什么?
非常感谢任何关于我应该研究的建议!
谢谢!
您可以使用 pandas 的 from_dict 功能。
import pandas as pd
dat = [{"column1":"value1","column2":"value2","column3":"value3",},{"column1":"value4","column2":"value5","column3":"value6"},{"column1":"value7","column2":"value8","column3":"value9"}]
print(pd.DataFrame.from_dict(dat))
结果:
column1 column2 column3
0 value1 value2 value3
1 value4 value5 value6
2 value7 value8 value9
来自 pandas 文档 (https://pandas.pydata.org/pandas-docs/version/0.21/generated/pandas.DataFrame.html):
d = {'col1': [1, 2], 'col2': [3, 4]}
df = pd.DataFrame(数据=d)
df
很可能是一个相当基本的问题,但是
我有以下原始数据文件:
[{"column1":"value1","column2":"value2","column3":value3,},{"column1":"value4","column2":"value5","column3":value6},{"column1":"value7","column2":"value8","column3":value9}]
我需要将其作为以下形式的数据框:
column1 column2 column3
value1 value2 value3
value4 value5 value6
value7 value8 value9
由于文件很大(3 列中有 120 万个值),打开文件的最快最方便的方法是什么?
非常感谢任何关于我应该研究的建议!
谢谢!
您可以使用 pandas 的 from_dict 功能。
import pandas as pd
dat = [{"column1":"value1","column2":"value2","column3":"value3",},{"column1":"value4","column2":"value5","column3":"value6"},{"column1":"value7","column2":"value8","column3":"value9"}]
print(pd.DataFrame.from_dict(dat))
结果:
column1 column2 column3
0 value1 value2 value3
1 value4 value5 value6
2 value7 value8 value9
来自 pandas 文档 (https://pandas.pydata.org/pandas-docs/version/0.21/generated/pandas.DataFrame.html):
d = {'col1': [1, 2], 'col2': [3, 4]} df = pd.DataFrame(数据=d) df