Puppeteer:从点击输入标签按钮后不刷新的页面抓取 html
Puppeteer: Grabbing html from page that doesn't refresh after input tag button is clicked
我试图在单击输入标签按钮后抓取一些 html。我用 page.evaluate() 单击按钮,因为 page.click() 似乎不适用于输入标签按钮。我已经尝试在 puppeteer 启动选项中使用 headless:false 进行可视化调试,以验证浏览器确实在单击按钮后导航到了该点。我不确定为什么 page.content() returns 单击按钮之前的 html 而不是事件发生后的 html。
const puppeteer = require('puppeteer');
const url = 'http://www.yvr.ca/en/passengers/flights/departing-flights';
const fs = require('fs');
const tomorrowSelector = '#flights-toggle-tomorrow'
puppeteer.launch().then(async browser => {
const page = await browser.newPage();
await page.goto(url);
await page.evaluate((selector)=>document.querySelector(selector).click(),tomorrowSelector);
let html = await page.content();
await fs.writeFile('index.html', html, function(err){
if (err) console.log(err);
console.log("Successfully Written to File.");
});
await browser.close();
});
您可以点击收音机的标签。此外,您需要等待一些状态更改的迹象(对于 XHR/fetch 响应或新选择器)。例如,此代码适用于我,但您可以使用任何其他条件或等待几秒钟。
const fs = require('fs');
const puppeteer = require('puppeteer');
const url = 'http://www.yvr.ca/en/passengers/flights/departing-flights';
const tomorrowLabelSelector = 'label[for=flights-toggle-tomorrow]';
const tomorrowLabelSelectorChecked = '.yvr-form__toggle:checked + label[for=flights-toggle-tomorrow]';
puppeteer.launch({ headless: false }).then(async (browser) => {
const page = await browser.newPage();
await page.goto(url);
await Promise.all([
page.click(tomorrowLabelSelector),
page.waitForSelector(tomorrowLabelSelectorChecked),
]);
const html = await page.content();
await fs.writeFile('index.html', html, (err) => {
if (err) console.log(err);
console.log('Successfully Written to File.');
});
// await browser.close();
});
我试图在单击输入标签按钮后抓取一些 html。我用 page.evaluate() 单击按钮,因为 page.click() 似乎不适用于输入标签按钮。我已经尝试在 puppeteer 启动选项中使用 headless:false 进行可视化调试,以验证浏览器确实在单击按钮后导航到了该点。我不确定为什么 page.content() returns 单击按钮之前的 html 而不是事件发生后的 html。
const puppeteer = require('puppeteer');
const url = 'http://www.yvr.ca/en/passengers/flights/departing-flights';
const fs = require('fs');
const tomorrowSelector = '#flights-toggle-tomorrow'
puppeteer.launch().then(async browser => {
const page = await browser.newPage();
await page.goto(url);
await page.evaluate((selector)=>document.querySelector(selector).click(),tomorrowSelector);
let html = await page.content();
await fs.writeFile('index.html', html, function(err){
if (err) console.log(err);
console.log("Successfully Written to File.");
});
await browser.close();
});
您可以点击收音机的标签。此外,您需要等待一些状态更改的迹象(对于 XHR/fetch 响应或新选择器)。例如,此代码适用于我,但您可以使用任何其他条件或等待几秒钟。
const fs = require('fs');
const puppeteer = require('puppeteer');
const url = 'http://www.yvr.ca/en/passengers/flights/departing-flights';
const tomorrowLabelSelector = 'label[for=flights-toggle-tomorrow]';
const tomorrowLabelSelectorChecked = '.yvr-form__toggle:checked + label[for=flights-toggle-tomorrow]';
puppeteer.launch({ headless: false }).then(async (browser) => {
const page = await browser.newPage();
await page.goto(url);
await Promise.all([
page.click(tomorrowLabelSelector),
page.waitForSelector(tomorrowLabelSelectorChecked),
]);
const html = await page.content();
await fs.writeFile('index.html', html, (err) => {
if (err) console.log(err);
console.log('Successfully Written to File.');
});
// await browser.close();
});