【python2.7】urllib2抓取网页基础

作者: tonyemail_st | 来源:发表于2017-11-02 21:12 被阅读0次

python爬虫经典案例，看完这一篇就够了
【python2.7】urllib2抓取网页基础
python爬虫(四)_urllib2库的基本使用
静态网站爬图片
8 个常用的Python爬虫技巧总结！
Python：爬虫技巧总结！
urllib2的使用（三）
爬虫原理与数据抓取之四: urllib2库的基本使用
爬虫003
python 爬取搜狐新闻

参考官方文档：https://docs.python.org/2/library/urllib2.html

urllib2模块只能在python2.7中使用，它在python3中被拆分成urllib.request和urllib.error
urllib2模块定义的方法与类主要涉及：

basic and digest authentication
redirections
cookies
These are provided by objects called handlers and openers.

写法1

import urllib2

response = urllib2.urlopen("http://www.baidu.com")
#print response.read()
file = open("baidu.html", 'w')
file.write(response.read())
file.close()

写法2

import  urllib2
req = urllib2.request("http://www.baidu.com")
fp = urllib2.urlopen(req)
file = open("baidu.html", 'w')
file.write(fp.read())
file.close()

python爬虫经典案例，看完这一篇就够了
urllib2 urllib2是Python中用来抓取网页的库，urllib2 是 Python2.7 自带的模块...
【python2.7】urllib2抓取网页基础
参考官方文档：https://docs.python.org/2/library/urllib2.html url...
python爬虫(四)_urllib2库的基本使用
本篇我们将开始学习如何进行网页抓取，更多内容请参考:python学习指南 urllib2库的基本使用所谓网页抓取...
静态网站爬图片
依赖于 Python2.7,urllib,urllib2,re1.简单的静态网页解析： 2.简单的动态网页解析：P...
8 个常用的Python爬虫技巧总结！
1、基本抓取网页 get方法 import urllib2 url"http://www.baidu.com" r...
Python：爬虫技巧总结！
一些常用的爬虫技巧归纳与以下几点： 1、基本抓取网页 get方法 import urllib2 url "http...
urllib2的使用（三）
urllib2的基本使用所谓网页抓取，就是把URL地址中指定的网络资源从网络流中读取出来，保存到本地。在Pyt...
爬虫原理与数据抓取之四: urllib2库的基本使用
urllib2库的基本使用所谓网页抓取，就是把URL地址中指定的网络资源从网络流中读取出来，保存到本地。在Py...
爬虫003
urllib2的使用所谓网页抓取，就是把URL地址中指定的网络资源从网络中读取出来，保存到本地或者数据库。在Py...
python 爬取搜狐新闻
python2.7,通过urllib2和BeautifulSoup爬取新闻文中还包括一些BeautifulSou...

网友评论

本文标题：【python2.7】urllib2抓取网页基础

本文链接：https://www.haomeiwen.com/subject/sgiapxtx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

【python2.7】urllib2抓取网页基础

相关文章

python爬虫经典案例，看完这一篇就够了

【python2.7】urllib2抓取网页基础

python爬虫(四)_urllib2库的基本使用

静态网站爬图片

8 个常用的Python爬虫技巧总结！

Python：爬虫技巧总结！

urllib2的使用（三）

爬虫原理与数据抓取之四: urllib2库的基本使用

爬虫003

python 爬取搜狐新闻

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读