如何使用 selenium webdriver 提取数据

How to extract data with selenium webdriver

您好,我正在尝试提取此网页的赔率:https://www.netbet.fr/derniere-minute?filter=13

这是我的 python 脚本:

#!/usr/bin/python3
# -*- coding: utf­-8 ­-*-

from selenium import webdriver
from selenium.webdriver.chrome.options import Options
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.common.by import By
from selenium.webdriver.support import expected_conditions as EC
import os

options = Options()
options.headless = True
options.add_argument("window-size=1400,800")
options.add_argument("--no-sandbox")
options.add_argument("--disable-gpu")
options.add_argument("start-maximized")
options.add_argument("enable-automation")
options.add_argument("--disable-infobars")
options.add_argument("--disable-dev-shm-usage")

driver = webdriver.Chrome(options=options)

driver.get('https://www.netbet.fr/derniere-minute?filter=13')

odds = [my_elem.text for my_elem in WebDriverWait(driver, 10).until(EC.visibility_of_all_elements_located((By.XPATH, '//div[contains(@class, "nb-odds_amount")]')))]

print(odds, '\n')

driver.close()
driver.quit()

输出结果是:

Traceback (most recent call last):
  File "./azerty.py", line 31, in <module>
    odds = [my_elem.text for my_elem in WebDriverWait(driver, 10).until(EC.visibility_of_all_elements_located((By.XPATH, '//div[contains(@class, "nb-odds_amount")]')))]
  File "/usr/local/lib/python3.8/dist-packages/selenium/webdriver/support/wait.py", line 80, in until
    raise TimeoutException(message, screen, stacktrace)
selenium.common.exceptions.TimeoutException: Message: 

此脚本 运行 与其他网页完美搭配,但在本例中并非如此。一些帮助,谢谢

某些元素被隐藏,这就是问题所在。你等到所有元素都可见 visibility_of_all_elements_located 而有些元素被隐藏,所以你将无限等待。尝试等待状态而不是可见性来解决该问题 presence_of_all_elements_located

odds = [my_elem.text for my_elem in WebDriverWait(driver, 10).until(EC.presence_of_all_elements_located((By.XPATH, '//div[contains(@class, "nb-odds_amount")]')))]