Content-Length: 240623 | pFad | http://github.com/cwjokaka/ok_ip_proxy_pool/commit/f03116834ac2e47c3ddf72b62ddc0b83ff3bd886

01 Update README.md · cwjokaka/ok_ip_proxy_pool@f031168 · GitHub
Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
cwjokaka authored Sep 27, 2019
1 parent b4e5c06 commit f031168
Showing 1 changed file with 16 additions and 6 deletions.
22 changes: 16 additions & 6 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@



## 运行环境:
## 运行环境

- python 3.7

Expand All @@ -12,9 +12,9 @@
## 特点

- 异步爬取&验证代理🚀
- 使用Sqlite,无需额外数据库环境🛴

- 目前支持的代理有: 免费代理/全网/66/西刺/快代理/云代理/IP海
- 用权重加减来衡量代理的可用性(可用性:通过验证则+1,否则-1)🎭
- 使用Sqlite,无需安装数据库环境🛴
- 目前支持的免费代理有: 免费代理/全网/66/西刺/快代理/云代理/IP海



Expand Down Expand Up @@ -55,9 +55,9 @@ SPIDER = {

# 校验器配置
VALIDATOR = {
'test_url': 'http://www.baidu.com',
'test_url': 'http://www.baidu.com', # 验证url
'request_timeout': 4, # 校验超时时间
'validate_interval': 30
'validate_interval': 30 # 验证时间间隔(秒)
}

# 数据库配置
Expand Down Expand Up @@ -103,6 +103,16 @@ HEADERS = {



## 代理爬虫扩展
如果需要添加自定义代理爬虫,可通过以下步骤添加:

1. 进入src/spider/spiders.py
2. 添加自己的爬虫类,继承AbsSpider,实现它的do_crawl方法。注:请求需要**异步调用**
3. 用@spider_register修饰此类
4. 在配置文件setting.py的SPIDER['list']中添加此类名



## LAST

欢迎Fork|Star|Issue 三连😘

0 comments on commit f031168

Please sign in to comment.








ApplySandwichStrip

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier!      Saves Data!


--- a PPN by Garber Painting Akron. With Image Size Reduction included!

Fetched URL: http://github.com/cwjokaka/ok_ip_proxy_pool/commit/f03116834ac2e47c3ddf72b62ddc0b83ff3bd886

Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy