无法将字符串转换为浮点数 - Python Dash

Can't convert string to float - Python Dash

我试图在 Dash 回调中将字符串转换为浮点数,但是当我 运行 我的代码出现在我的 Dash 应用程序错误中:lati = float(lati[-1 ]) ValueError: could not convert string to float: 'float64) 不过我在终端中没有收到这个错误。

首先我需要做的是提取给定的纬度(和经度)数。因此我需要它将它转换为字符串并拆分它,因为我找不到更好的方法来使用 pandas.

从 csv 文件中获取此数字

输出:

# converting to string:
12    41.6796
Name: latitude, dtype: float64

# splitting:
['12', '', '', '', '41.6796']

# converting to float:
41.6796

这是实际代码:

@app.callback(Output('text-output', 'children'),
    [Input('submit-val', 'n_clicks')],
    [State('search-input', 'value')])
def updateText(n_clicks, searchVar):
    df = pd.read_csv("powerplant.csv")
    df = df[df.name == searchVar]

    # converting to string
    lati = str(df['latitude'])
    longi = str(df['longitude'])

    # splitting it
    lati = lati.split('\n', 1)
    lati = lati[0].split(' ', 4)
    longi = longi.split('\n', 1)
    longi = longi[0].split(' ', 4)

    #converting to float
    lati = float(lati[-1])
    longi = float(longi[-1]) 

我实际上在其他脚本中测试了这段代码,它工作得很好。有什么更好的方法可以提取纬度和经度数字?

数据可以从https://datasets.wri.org/dataset/globalpowerplantdatabase;下载,这里是摘录。

country,country_long,name,gppd_idnr,capacity_mw,latitude,longitude,primary_fuel,other_fuel1,other_fuel2,other_fuel3,commissioning_year,owner,source,url,geolocation_source,wepp_id,year_of_capacity_data,generation_gwh_2013,generation_gwh_2014,generation_gwh_2015,generation_gwh_2016,generation_gwh_2017,estimated_generation_gwh
AFG,Afghanistan,Kajaki Hydroelectric Power Plant Afghanistan,GEODB0040538,33.0,32.3220,65.1190,Hydro,,,,,,GEODB,http://globalenergyobservatory.org,GEODB,1009793,2017,,,,,,
AFG,Afghanistan,Mahipar Hydroelectric Power Plant Afghanistan,GEODB0040541,66.0,34.5560,69.4787,Hydro,,,,,,GEODB,http://globalenergyobservatory.org,GEODB,1009795,2017,,,,,,
ALB,Albania,Shkopet,WRI1002173,24.0,41.6796,19.8305,Hydro,,,,1963.0,,Energy Charter Secretariat,http://www.energycharter.org/fileadmin/DocumentsMedia/IDEER/IDEER-Albania_2013_en.pdf,GEODB,1021238,,,,,,,79.22851153039832
ALB,Albania,Ulez,WRI1002174,25.0,41.6796,19.8936,Hydro,,,,1958.0,,Energy Charter Secretariat,http://www.energycharter.org/fileadmin/DocumentsMedia/IDEER/IDEER-Albania_2013_en.pdf,GEODB,1021241,,,,,,,82.52969951083159

您正在查看的是一个 pandas.Series 对象,其中包含单行数据,并且您正试图将其 __repr__ 拆分以获得值。没有这个必要。我不熟悉 plotly 的 Python 版本,但我看到你有一个回调,所以我把它包装成一个函数(我不确定是否存在名称不能的情况找到):

import pandas as pd

def get_by_name(name):
    df = pd.read_csv('powerplants.csv')
    df = df[df['name'] == name]
    if not df.empty:
        return df[['latitude', 'longitude']].values.tolist()[0]
    return None, None


lat, lon = get_by_name('Kajaki Hydroelectric Power Plant Afghanistan')

问题在于您访问数据框中的值的方式。 Pandas 允许您访问数据而无需解析字符串表示形式。

您可以一次访问 .loc 的行和列 如果你知道你将有一个单一的价值,你可以调用 squeeze 方法

>>> import pandas as pd
>>> from io import StringIO
>>> # data shortened for brievity
>>> df = pd.read_csv(StringIO("""country,country_long,name,gppd_idnr,capacity_mw,latitude,longitude
... AFG,Afghanistan,Kajaki Hydroelectric Power Plant Afghanistan,GEODB0040538,33.0,32.3220,65.1190
... AFG,Afghanistan,Mahipar Hydroelectric Power Plant Afghanistan,GEODB0040541,66.0,34.5560,69.4787
... ALB,Albania,Shkopet,WRI1002173,24.0,41.6796,19.8305
... ALB,Albania,Ulez,WRI1002174,25.0,41.6796,19.8936"""))
>>> searchVar = "Ulez"
>>> df.loc[df["name"] == searchVar, "latitude"] # here you have a pd.Series
3    41.6796
Name: latitude, dtype: float64
>>> df.loc[df["name"] == searchVar, "latitude"].squeeze() # here you have a scalar
41.6796
>>> df.loc[df["name"] == searchVar, "longitude"].squeeze()
19.8936

如果由于某种原因你有几行同名,你会得到一个系列而不是一个标量。但也许在这种情况下,您想要的是失败,而不是传递不明确的数据。