如何获取带有 nbsp ("non breaking space") 属性的标签?
How to get tag with attribute containing a nbsp ("non breaking space")?
您好,我正在尝试获取一个包含不间断值的标签-space (nbsp) 但是当我这样做时:
a_url=soup.find_all('a', {"aria-label":"Siguiente "})
a_url
return 空列表
我怎样才能得到真正的价值=?
如果您正在搜索“下一页按钮”的 href
,您可以通过这些 selector
获得它(先决条件 - 它是唯一的 @ website):
soup.select_one('a[aria-label*="Siguiente"]')['href']
例子
from bs4 import BeautifulSoup
soup = BeautifulSoup('''<a href="/jobs?q=ingeniero&start=30&pp=gQAtAAAAAAAAAAAAAAABtmO50gBfAQEBChquI8zsEUMb97LmWiIyJ6B9BupjjNHe0wHVJkxir7vk5faUnfGbH8SIKViz3xGntfsggaFcG0AVf914ketkZJK-TUcyKlIrQmiKVG-Mkh5cMa0vUE4tVGeMixwAAA" aria-label="Siguiente »" data-pp="gQAtAAAAAAAAAAAAAAABtmO50gBfAQEBChquI8zsEUMb97LmWiIyJ6B9BupjjNHe0wHVJkxir7vk5faUnfGbH8SIKViz3xGntfsggaFcG0AVf914ketkZJK-TUcyKlIrQmiKVG-Mkh5cMa0vUE4tVGeMixwAAA" onmousedown="addPPUrlParam && addPPUrlParam(this);" rel="nofollow"><span class="pn"><span class="np"><svg width="24" height="24" fill="none"><path d="M10 6L8.59 7.41 13.17 12l-4.58 4.59L10 18l6-6-6-6z" fill="#2D2D2D"></path></svg></span></span></a>''', "lxml")
soup.select_one('a[aria-label*="Siguiente"]')['href']
您好,我正在尝试获取一个包含不间断值的标签-space (nbsp) 但是当我这样做时:
a_url=soup.find_all('a', {"aria-label":"Siguiente "})
a_url
return 空列表
我怎样才能得到真正的价值=?
如果您正在搜索“下一页按钮”的 href
,您可以通过这些 selector
获得它(先决条件 - 它是唯一的 @ website):
soup.select_one('a[aria-label*="Siguiente"]')['href']
例子
from bs4 import BeautifulSoup
soup = BeautifulSoup('''<a href="/jobs?q=ingeniero&start=30&pp=gQAtAAAAAAAAAAAAAAABtmO50gBfAQEBChquI8zsEUMb97LmWiIyJ6B9BupjjNHe0wHVJkxir7vk5faUnfGbH8SIKViz3xGntfsggaFcG0AVf914ketkZJK-TUcyKlIrQmiKVG-Mkh5cMa0vUE4tVGeMixwAAA" aria-label="Siguiente »" data-pp="gQAtAAAAAAAAAAAAAAABtmO50gBfAQEBChquI8zsEUMb97LmWiIyJ6B9BupjjNHe0wHVJkxir7vk5faUnfGbH8SIKViz3xGntfsggaFcG0AVf914ketkZJK-TUcyKlIrQmiKVG-Mkh5cMa0vUE4tVGeMixwAAA" onmousedown="addPPUrlParam && addPPUrlParam(this);" rel="nofollow"><span class="pn"><span class="np"><svg width="24" height="24" fill="none"><path d="M10 6L8.59 7.41 13.17 12l-4.58 4.59L10 18l6-6-6-6z" fill="#2D2D2D"></path></svg></span></span></a>''', "lxml")
soup.select_one('a[aria-label*="Siguiente"]')['href']