python: 爬虫利器requests

发布时间：2019-08-19 09:22:29编辑：auto阅读（2001）

requests并不是系统自带的模块，他是第三方库，需要安装才能使用

requests库使用方式

闲话少说，来，让我们上代码：
简单的看一下效果：

import requests
requests = requests.session()
headers = {
    'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:57.0) Gecko/20100101 Firefox/57.0'
}
url = "http://httpbin.org"
response = requests.get(url, headers=headers, timeout=None)
print(response.text)
print(response.cookies)
print(response.content)
print(response.content.decode("utf-8"))
print(respone.json())

基本的post请求：

data = {
    "name":"zhaofan",
    "age":23
}
response = requests.post("http://httpbin.org/post",data=data)
print(response.text)

对于无效的网站证书请求方法：

import requests
from requests.packages import urllib3
urllib3.disable_warnings()
response = requests.get("https://www.12306.cn",verify=False)
print(response.status_code)

代理设置：

import requests

proxies= {
    "http":"http://127.0.0.1:9999",
    "https":"http://127.0.0.1:8888"
}
response  = requests.get("https://www.baidu.com",proxies=proxies)
print(response.text)

如果代理需要设置账户名和密码,只需要将字典更改为如下：
proxies = {
"http":"http://user:password@127.0.0.1:9999"
}
如果你的代理是通过sokces这种方式则需要pip install "requests[socks]"
proxies= {
"http":"socks5://127.0.0.1:9999",
"https":"sockes5://127.0.0.1:8888"
}

超时设置

通过timeout参数可以设置超时的时间

没有超时时间，一直等待
timeout=None

异常捕捉：

import requests

from requests.exceptions import ReadTimeout,ConnectionError,RequestException

try:
    response = requests.get("http://httpbin.org/get",timout=0.1)
    print(response.status_code)
except ReadTimeout:
    print("timeout")
except ConnectionError:
    print("connection Error")
except RequestException:
    print("error")

关键字：

上一篇： OpenStack Keystone V

下一篇： Python 2.6.6升级到Pytho



搜索

热门推荐

最新文章

Python搭建一个RAG系统(分片/检索/召回/重排序/生成)
 2022°
Browser-use:智能浏览器自动化(Web-Agent)
 2733°
使用 LangChain 实现本地 Agent
 2282°
使用 LangChain 构建本地 RAG 应用
 2210°
使用LLaMA-Factory微调大模型的function calling能力
 2683°
复现一个简单Agent系统
 2238°
LLaMA Factory-Lora微调实现声控语音多轮问答对话-1
 2984°
LLaMA Factory微调后的模型合并导出和部署-4
 4921°
LLaMA Factory微调模型的各种参数怎么设置-3
 4779°
LLaMA Factory构建高质量数据集-2
 3396°

博主信息

姓名：Run
职业：谜
邮箱：383697894@qq.com
定位：上海 · 松江

扫我打开

友情链接

百度 淘宝 腾讯 慕课网 CSDN 博客园 51cto博客