您将如何使用 Python 解析此 HTML table？

Question

我正在尝试在 Python 2.7.

中创建抓取脚本

请求没问题，但我很难用 Beautiful soup 解析这个 table。我试了很多，在论坛上搜索了很多，但对我来说没什么用，我第一次这样做。

这是代码：

 import requests, os 
 from bs4 import BeautifulSoup  

 url='http://fse.vdkruijssen.eu/ferrylist.php' params={'selectplane':'Cessna 208 Caravan','submit':''}
 response=requests.post(url, data=params) 

 soup = BeautifulSoup(response.text, "html5lib")
 table=soup.find('table')
 print table

但这不会返回任何 table。我正在尝试至少检索第一列和最后一列。

Answer 1

soup = BeautifulSoup(response.text, "lxml")

将解析器更改为 lxml

Beautiful Soup 支持 Python 标准库中包含的 HTML 解析器，但它也支持许多 third-party Python 解析器。一个是 lxml 解析器。根据您的设置，您可以使用以下命令之一安装 lxml：

$ apt-get install python-lxml

$ easy_install lxml

$ pip install lxml

默认情况下，BS4 使用 lxml 解析器。

您将如何使用 Python 解析此 HTML table？

How would you parse this HTML table using Python?

beautifulsoup

html-parsing

python-2.7