从（相对）父文件夹导入 BeautifulSoup，并在其旁边导入 html 解析器

Question

这是我的文件夹树：

script/
    main.py
    dependencies/
        bs4/
            ...
        requests/
            ...

以下是我从 main.py:

导入模块的方式

import dependencies.requests as requests
import dependencies.bs4 as bs4

脚本似乎工作得很好，但我不知道如何将 html 解析器添加到 dependencies\ 中以与 BeautifulSoup 一起工作。所以目前在 main.py 我发出以下 GET 请求：

response = requests.get(url)

然后尝试解析它：

parsed_html = bs4.BeautifulSoup(response.content, "lxml")

然后我得到以下异常：

File "C:\Users\usr\Desktop\script\dependencies\bs4\__init__.py", line 165, in __init__
% ",".join(features))

dependencies.bs4.FeatureNotFound: Couldn't find a tree builder with the features you requested: lxml. Do you need to install a parser library?

如有任何帮助，我们将不胜感激。

另外，默认 "html.parser" 抛出相同的异常。

Answer 1

您应该将 PYTHONPATH 扩展到安装 lxml 或 html.parser 的文件夹。这样，"python" 就会知道在哪个特定位置搜索您的包裹。

在任何一种情况下，您都在创建自己的具有依赖关系的文件夹结构，因为有一些工具可以做到这一点，例如 virtualenv。

从（相对）父文件夹导入 BeautifulSoup，并在其旁边导入 html 解析器

Importing BeautifulSoup from a (relative) parent folder, and a html parser alongside it

python

lxml

beautifulsoup

html-parsing

python-3.x