使用 aiohttp 转储请求 headers

Dumping the request headers with aiohttp

我想显示请求的所有 HTTP headers(我添加的和自动生成的)。我尝试使用痕迹 (https://aiohttp.readthedocs.io/en/stable/tracing_reference.html#aiohttp-client-tracing-reference) :

#!/usr/bin/env python3                                                                                                      

import aiohttp
import asyncio

async def on_request_start(session, trace_config_ctx, params):
    print("Starting %s request for %s. I will send: %s" % (params.method, params.url, params.headers))

async def on_request_end(session, trace_config_ctx, params):
    print("Ending %s request for %s. I sent: %s" % (params.method, params.url, params.headers))

async def fetch(session, url):
    async with session.get(url) as response:
        return response

async def main():
    trace_config = aiohttp.TraceConfig()
    trace_config.on_request_start.append(on_request_start)
    trace_config.on_request_end.append(on_request_end)
    async with aiohttp.ClientSession(trace_configs=[trace_config]) as session:
        r = await fetch(session, 'http://whosebug.com')
        print(r)

loop = asyncio.get_event_loop()
loop.run_until_complete(main())

使用这段代码,我得到了方法和 URL 但是 headers 的字典总是空的:

% ./test-debug.py
Starting GET request for http://whosebug.com. I will send: <CIMultiDict()>
Ending GET request for https://whosebug.com/. I sent: <CIMultiDict()>

我错过了什么?

Python 3.7.2

% pip show aiohttp
Name: aiohttp
Version: 3.5.4
Summary: Async http client/server framework (asyncio)
Home-page: https://github.com/aio-libs/aiohttp
Author: Nikolay Kim
Author-email: fafhrd91@gmail.com
License: Apache 2
Location: /usr/lib/python3.7/site-packages
Requires: async-timeout, attrs, multidict, yarl, chardet
Required-by: 

我也一样:

$ ./test-debug.py
Starting GET request for http://whosebug.com. I will send: <CIMultiDict()>
Ending GET request for https://whosebug.com/. I sent: <CIMultiDict()>
<ClientResponse(https://whosebug.com/) [200 OK]>
<CIMultiDictProxy('Cache-Control': 'private', 'Content-Type': 'text/html; charset=utf-8', 'Content-Encoding': 'gzip', 'X-Frame-Options': 'SAMEORIGIN', 'X-Request-Guid': 'c89dd68d-cb88-43c1-b08d-f2a07bf81043', 'Strict-Transport-Security': 'max-age=15552000', 'Content-Security-Policy': 'upgrade-insecure-requests', 'Content-Length': '52698', 'Accept-Ranges': 'bytes', 'Date': 'Tue, 15 Jan 2019 08:06:32 GMT', 'Via': '1.1 varnish', 'Connection': 'keep-alive', 'X-Served-By': 'cache-cdg20748-CDG', 'X-Cache': 'MISS', 'X-Cache-Hits': '0', 'X-Timer': 'S1547539592.382231,VS0,VE120', 'Vary': 'Accept-Encoding,Fastly-SSL', 'X-DNS-Prefetch-Control': 'off')>

$ python --version
Python 3.7.1

$ python -c "import aiohttp; print(aiohttp.__version__)"
3.4.4

如果我明确地向 ClientSession 添加 header,

    async with aiohttp.ClientSession(trace_configs=[trace_config], headers={"Host": "whosebug.com"}) as session: 

我在跟踪中看到了它:

$ ./test-debug.py
Starting GET request for http://whosebug.com. I will send: <CIMultiDict('Host': 'whosebug.com')>
Ending GET request for http://whosebug.com. I sent: <CIMultiDict('Host': 'whosebug.com')>

仔细阅读库源代码后,request_start为时过早,它甚至在请求object创建之前就被调用了,所以它永远看不到完整的请求及其headers;定时器启动后循环发送东西。

但是在 request_end 中您可以访问完整的响应 object,它与请求 object 相关联,因此所有 headers.

有了这个变化:

async def on_request_end(session, trace_config_ctx, params):
    print("Ending %s request for %s. I sent: %s" % (params.method, params.url, params.headers))
    print('Sent headers: %s' % params.response.request_info.headers)

我得到:

Sent headers: <CIMultiDictProxy('Host': 'whosebug.com', 'Accept': '*/*', 'Accept-Encoding': 'gzip, deflate', 'User-Agent': 'Python/3.7 aiohttp/3.5.4', 'Cookie': 'prov=f4fad342-c1f7-bcc2-5d25-0e30ae5cdbf6')>

您可能还需要查看 params.response.history 以防重定向。它是 ClientResponse object 的序列,因此您应该能够对它们中的每一个调用 request_info.headers