如何从 BS4 输出中制作列表
How to make a list out of a BS4 output
我现在有这个代码:
from bs4 import BeautifulSoup
import requests
get = requests.get("https://solmfers-minting-site.netlify.app/")
soup = BeautifulSoup(get.text, 'html.parser')
for i in soup.find_all('script'):
print(i.get('src'))
我需要以某种方式将输出转换为列表并从中删除 None
值,因为它输出如下:
jquery.js
nicepage.js
None
None
/static/js/2.c20455e8.chunk.js
/static/js/main.87864e1d.chunk.js
只需将提取的值附加到列表中即可。
result = []
for i in soup.find_all('script'):
elem = i.get('src')
if elem is not None:
result.append(elem)
或者使用列表理解:
result = [x['src'] for x in soup.find_all('script') if x.get('src') is not None]
您接近您的目标,但 select 您的元素更具体,并在迭代 ResultSet
:
时将 src
附加到列表
data = []
for i in soup.find_all('script', src=True):
data.append(i.get('src'))
替代 css selectors
:
for i in soup.select('script[src]'):
data.append(i.get('src'))
正如 list comprehension
中提到的那样:
[i.get('src') for i in soup.select('script[src]')]
输出
['jquery.js', 'nicepage.js', '/static/js/2.c20455e8.chunk.js', '/static/js/main.87864e1d.chunk.js']
我现在有这个代码:
from bs4 import BeautifulSoup
import requests
get = requests.get("https://solmfers-minting-site.netlify.app/")
soup = BeautifulSoup(get.text, 'html.parser')
for i in soup.find_all('script'):
print(i.get('src'))
我需要以某种方式将输出转换为列表并从中删除 None
值,因为它输出如下:
jquery.js
nicepage.js
None
None
/static/js/2.c20455e8.chunk.js
/static/js/main.87864e1d.chunk.js
只需将提取的值附加到列表中即可。
result = []
for i in soup.find_all('script'):
elem = i.get('src')
if elem is not None:
result.append(elem)
或者使用列表理解:
result = [x['src'] for x in soup.find_all('script') if x.get('src') is not None]
您接近您的目标,但 select 您的元素更具体,并在迭代 ResultSet
:
src
附加到列表
data = []
for i in soup.find_all('script', src=True):
data.append(i.get('src'))
替代 css selectors
:
for i in soup.select('script[src]'):
data.append(i.get('src'))
正如 list comprehension
中提到的那样:
[i.get('src') for i in soup.select('script[src]')]
输出
['jquery.js', 'nicepage.js', '/static/js/2.c20455e8.chunk.js', '/static/js/main.87864e1d.chunk.js']