如何获取组中的元素? Python

how get elements in group? Python

我需要获取列表中每个元素的 href,我该怎么做?在 bs4

<div class="group">

<a href="link1" target="_blank" rel="rel" class="class">
    <h1>"test"</h1>
</a>
    <a href="link2" target="_blank" rel="rel" class="class">
        <h1>"test2"</h1>
    </a>
<a href="link3" target="_blank" rel="rel" class="class">
    <h1>"test3"</h1>
</a>            </div>

Select <div> 和 class 组并迭代其 <a> 例如 css selectorslist comprehension:

[x['href'] for x in soup.select('div.group a')]

示例

from bs4 import BeautifulSoup

html = '''
<div class="group">
<a href="link1" target="_blank" rel="rel" class="class">
    <h1>"test"</h1>
</a>
    <a href="link2" target="_blank" rel="rel" class="class">
        <h1>"test2"</h1>
    </a>
<a href="link3" target="_blank" rel="rel" class="class">
    <h1>"test3"</h1>
</a>
</div>
'''
soup = BeautifulSoup(html)

[x['href'] for x in soup.select('div.group a')]

输出

['link1', 'link2', 'link3']