如何获取组中的元素? Python
how get elements in group? Python
我需要获取列表中每个元素的 href,我该怎么做?在 bs4
<div class="group">
<a href="link1" target="_blank" rel="rel" class="class">
<h1>"test"</h1>
</a>
<a href="link2" target="_blank" rel="rel" class="class">
<h1>"test2"</h1>
</a>
<a href="link3" target="_blank" rel="rel" class="class">
<h1>"test3"</h1>
</a> </div>
Select <div>
和 class 组并迭代其 <a>
例如 css selectors
和 list comprehension
:
[x['href'] for x in soup.select('div.group a')]
示例
from bs4 import BeautifulSoup
html = '''
<div class="group">
<a href="link1" target="_blank" rel="rel" class="class">
<h1>"test"</h1>
</a>
<a href="link2" target="_blank" rel="rel" class="class">
<h1>"test2"</h1>
</a>
<a href="link3" target="_blank" rel="rel" class="class">
<h1>"test3"</h1>
</a>
</div>
'''
soup = BeautifulSoup(html)
[x['href'] for x in soup.select('div.group a')]
输出
['link1', 'link2', 'link3']
我需要获取列表中每个元素的 href,我该怎么做?在 bs4
<div class="group">
<a href="link1" target="_blank" rel="rel" class="class">
<h1>"test"</h1>
</a>
<a href="link2" target="_blank" rel="rel" class="class">
<h1>"test2"</h1>
</a>
<a href="link3" target="_blank" rel="rel" class="class">
<h1>"test3"</h1>
</a> </div>
Select <div>
和 class 组并迭代其 <a>
例如 css selectors
和 list comprehension
:
[x['href'] for x in soup.select('div.group a')]
示例
from bs4 import BeautifulSoup
html = '''
<div class="group">
<a href="link1" target="_blank" rel="rel" class="class">
<h1>"test"</h1>
</a>
<a href="link2" target="_blank" rel="rel" class="class">
<h1>"test2"</h1>
</a>
<a href="link3" target="_blank" rel="rel" class="class">
<h1>"test3"</h1>
</a>
</div>
'''
soup = BeautifulSoup(html)
[x['href'] for x in soup.select('div.group a')]
输出
['link1', 'link2', 'link3']