如何获取带有 nbsp ("non breaking space") 属性的标签?

How to get tag with attribute containing a nbsp ("non breaking space")?

您好,我正在尝试获取一个包含不间断值的标签-space (nbsp) 但是当我这样做时:

a_url=soup.find_all('a', {"aria-label":"Siguiente "})

a_url return 空列表

我怎样才能得到真正的价值=?

如果您正在搜索“下一页按钮”的 href,您可以通过这些 selector 获得它(先决条件 - 它是唯一的 @ website):

soup.select_one('a[aria-label*="Siguiente"]')['href'] 

例子

from bs4 import BeautifulSoup

soup = BeautifulSoup('''<a href="/jobs?q=ingeniero&amp;start=30&amp;pp=gQAtAAAAAAAAAAAAAAABtmO50gBfAQEBChquI8zsEUMb97LmWiIyJ6B9BupjjNHe0wHVJkxir7vk5faUnfGbH8SIKViz3xGntfsggaFcG0AVf914ketkZJK-TUcyKlIrQmiKVG-Mkh5cMa0vUE4tVGeMixwAAA" aria-label="Siguiente&nbsp;»" data-pp="gQAtAAAAAAAAAAAAAAABtmO50gBfAQEBChquI8zsEUMb97LmWiIyJ6B9BupjjNHe0wHVJkxir7vk5faUnfGbH8SIKViz3xGntfsggaFcG0AVf914ketkZJK-TUcyKlIrQmiKVG-Mkh5cMa0vUE4tVGeMixwAAA" onmousedown="addPPUrlParam &amp;&amp; addPPUrlParam(this);" rel="nofollow"><span class="pn"><span class="np"><svg width="24" height="24" fill="none"><path d="M10 6L8.59 7.41 13.17 12l-4.58 4.59L10 18l6-6-6-6z" fill="#2D2D2D"></path></svg></span></span></a>''', "lxml")

soup.select_one('a[aria-label*="Siguiente"]')['href']