ygp-gdzwfw-gov-cn/examples/list-request.sh
秦秋旭 5acb847bc1 优化爬虫:增量爬取、API参数优化、Excel兼容
主要变更:
1. 重命名 ygp_crawler.py -> main.py
2. API参数优化:
   - tradingProcess 固定传 "513,2C52,3C52" 精准筛选中标结果
   - pageSize 固定为 50 提高抓取效率
   - 通过 publishStartTime/publishEndTime 传入时间范围
3. 默认查询最近3个月(原为当天)
4. 增量爬取改为默认开启(移除 -i 参数)
5. CSV文件添加 UTF-8 BOM,Excel可直接打开
6. 更新 README.md 文档
7. 添加前端 JS 代码参考文件到 assets/ 目录

使用方法:
- 增量更新:python main.py
- 全量查询:rm results.csv && python main.py

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-05 00:37:07 +08:00

20 lines
1.3 KiB
Bash

curl 'https://ygp.gdzwfw.gov.cn/ggzy-portal/search/v2/items' \
-H 'Connection: keep-alive' \
-b '_horizon_uid=ec39e18f-7968-4277-afeb-0bea42d5de45; _horizon_sid=0f61bd04-5719-467b-8f3a-4c2fb2208da4' \
-H 'Origin: https://ygp.gdzwfw.gov.cn' \
-H 'Referer: https://ygp.gdzwfw.gov.cn/' \
-H 'Sec-Fetch-Dest: empty' \
-H 'Sec-Fetch-Mode: cors' \
-H 'Sec-Fetch-Site: same-origin' \
-H 'User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/144.0.0.0 Safari/537.36' \
-H 'accept: application/json, text/plain, */*' \
-H 'accept-language: zh-CN,zh;q=0.9' \
-H 'content-type: application/json' \
-H 'sec-ch-ua: "Not(A:Brand";v="8", "Chromium";v="144", "Google Chrome";v="144"' \
-H 'sec-ch-ua-mobile: ?0' \
-H 'sec-ch-ua-platform: "macOS"' \
-H 'x-dgi-req-app: ggzy-portal' \
-H 'x-dgi-req-nonce: fCuDiZcgnqabdAfT' \
-H 'x-dgi-req-signature: 73c3b89179feda9eeb62397c326c440be997a8aa9eab5387820b8642f2c72b79' \
-H 'x-dgi-req-timestamp: 1770220343291' \
--data-raw '{"type":"trading-type","openConvert":false,"keyword":"","siteCode":"44","secondType":"A","tradingProcess":"513,2C52,3C52","thirdType":"[]","projectType":"","publishStartTime":"20251106000000","publishEndTime":"20260204235959","pageNo":1,"pageSize":50}'