puppeteer 爬取一个做了反爬的网站 已经返回'content-disposition': 'attachment; filename=xx.zip 但还是无法下载文件

查看 31|回复 1
作者:zhouyin   
```javascript
{
  'cache-control': 'no-store, no-cache, must-revalidate',
  connection: 'Keep-Alive',
  'content-disposition': 'attachment; filename="xxxx.zip"',
  'content-length': '21147482',
  'content-type': 'application/zip',
  date: 'Sun, 02 Feb 2025 04:09:28 GMT',
  expires: 'Thu, 19 Nov 1981 08:52:00 GMT',
  'keep-alive': 'timeout=5, max=98',
  pragma: 'no-cache',
  server: 'Apache/2.4.52 (Ubuntu)',
  'set-cookie': 'PHPSESSID=94fvcskn745lsq6tb751gnrpov; expires=Sun, 02-Feb-2025 04:39:28 GMT; Max-Age=1800; path=/',
  'x-php-sess-user': 'dddddd'
}
```
本来能下载的 前几天网站好像升级了

puppeteer, 下载, 反爬

zhouyin
OP
  
在 win11 上随 node 启动的 chrome 看图
https://imgur.la/image/imgapp.xF2jF
puppeteer 是下面这样启动的
```javascript
const puppeteer = require('puppeteer-extra');
const StealthPlugin = require('puppeteer-extra-plugin-stealth')
puppeteer.use(StealthPlugin())
const browser = await puppeteer.launch({
    headless: false,
    args: ['--no-sandbox', '--disable-setuid-sandbox',   '-–disable-gpu',
        '-–disable-dev-shm-usage', '-–no-first-run', '-–no-zygote',
        '-–single-process'],
});
```
package.json 版本:
```javascript
"dependencies": {
    "chrome-har": "^0.7.1",
    "chrome-launcher": "^0.10.5",
    "puppeteer": "^24.1.1",
    "puppeteer-extra": "^3.3.6",
    "puppeteer-extra-plugin-stealth": "^2.11.2"
  }
```
您需要登录后才可以回帖 登录 | 立即注册

返回顶部