如何使用 Beautifulsoup 废弃锚标签的文本?
How do I scrap the text of anchor tag using Beautifulsoup?
<span class="companyName">
<a data-tn-element="companyName" class="turnstileLink companyOverviewLink" target="_blank" href="/cmp/Lush-Cosmetics" rel="noopener">LUSH Cosmetics</a>
</span>
我这样做了 -
item.find('span', class_='companyName').find('a').text
但它给了我错误 -
item.find('span', class_='companyName').find('a').text
AttributeError: 'NoneType' 对象没有属性 'text'
这应该可以解决问题
item.find('span',{'class':'companyName'}).find('a').text
工作代码
from bs4 import BeautifulSoup
content = '''
<div class="job_seen_beacon">
<span class="companyName">
<a data-tn-element="companyName" class="turnstileLink companyOverviewLink" target="_blank" href="/cmp/Lush-Cosmetics" rel="noopener">LUSH Cosmetics</a>
</span></div>
'''
soup = BeautifulSoup(content,features="lxml")
anchorText = soup.find('span',{'class':'companyName'}).find('a').text
print(anchorText)
<span class="companyName">
<a data-tn-element="companyName" class="turnstileLink companyOverviewLink" target="_blank" href="/cmp/Lush-Cosmetics" rel="noopener">LUSH Cosmetics</a>
</span>
我这样做了 - item.find('span', class_='companyName').find('a').text
但它给了我错误 - item.find('span', class_='companyName').find('a').text AttributeError: 'NoneType' 对象没有属性 'text'
这应该可以解决问题
item.find('span',{'class':'companyName'}).find('a').text
工作代码
from bs4 import BeautifulSoup
content = '''
<div class="job_seen_beacon">
<span class="companyName">
<a data-tn-element="companyName" class="turnstileLink companyOverviewLink" target="_blank" href="/cmp/Lush-Cosmetics" rel="noopener">LUSH Cosmetics</a>
</span></div>
'''
soup = BeautifulSoup(content,features="lxml")
anchorText = soup.find('span',{'class':'companyName'}).find('a').text
print(anchorText)