Python: 如何监控所有可选子产品的可用性电子商务产品?
Python: how to monitor availability ecommerce products for all sub products selectable?
我想抓取以下产品https://www.decathlon.it/p/disco-ghisa-bodybuilding-28mm/_/R-p-7278?mc=1042303&c=NERO
但对于产品,我们可以 select 不同的重量(从 0.5 到 20 公斤)。我创建了以下代码,但它只给我第一个重量 (0,5kg) 而不是另一个。
import requests
import re
import time
urls = ['p/disco-ghisa-bodybuilding-28mm/_/R-p-7278.html']
user_agent = {'User-agent': 'Mozilla/5.0'}
def main(site):
with requests.Session() as req:
for url in urls:
r = req.get(site.format(url), headers=user_agent)
match = re.search('availability.+org\/(.*?)"', r.text)
print("url: {:<70}, status: {}".format(r.url, match.group(1)))
while True:
main("https://www.decathlon.it/{}")
time.sleep(1)
但是我想弄清楚下面的输出
weight 0,5kg outofstock, 1kg outofstock and so on.
您可能应该检查 BeautifulSoup python 库和来自此 link or use Selenium just to change option from dropdown menu what you can learn more about here https://www.guru99.com/select-option-dropdown-selenium-webdriver.html
的讨论
我想抓取以下产品https://www.decathlon.it/p/disco-ghisa-bodybuilding-28mm/_/R-p-7278?mc=1042303&c=NERO
但对于产品,我们可以 select 不同的重量(从 0.5 到 20 公斤)。我创建了以下代码,但它只给我第一个重量 (0,5kg) 而不是另一个。
import requests
import re
import time
urls = ['p/disco-ghisa-bodybuilding-28mm/_/R-p-7278.html']
user_agent = {'User-agent': 'Mozilla/5.0'}
def main(site):
with requests.Session() as req:
for url in urls:
r = req.get(site.format(url), headers=user_agent)
match = re.search('availability.+org\/(.*?)"', r.text)
print("url: {:<70}, status: {}".format(r.url, match.group(1)))
while True:
main("https://www.decathlon.it/{}")
time.sleep(1)
但是我想弄清楚下面的输出
weight 0,5kg outofstock, 1kg outofstock and so on.
您可能应该检查 BeautifulSoup python 库和来自此 link