Python爬虫入门-爬取新浪新闻

作者: 杏月阿六 | 来源:发表于2017-09-17 21:55 被阅读78次

Python爬虫入门-爬取新浪新闻
python爬虫
各类链接
一个不那么典型的Python爬虫
50行Python爬取猫眼电影TOP100榜单信息
爬虫很难？最适合新人上手的3个Python项目,即学即用！
3 个适合新人上手的Python项目
Python网络爬虫（八） - 利用有道词典实现一个简单翻译程序
Python网络爬虫（七）- 深度爬虫CrawlSpider
Python网络爬虫（二）- urllib爬虫案例

运行环境：Python3.6.0

所需的包：

from bs4 import BeautifulSoup
import requests

response = requests.get("http://news.sina.com.cn/china/")
response.encoding = "utf-8"
soup = BeautifulSoup(response.text, "lxml")
headers = soup.select("div.news-item > h2")
links = soup.select("div.news-item > h2 > a")
times = soup.select("div.time")

for header, link, time in zip(headers, links, times):
    with open("sina_news.txt", "a") as f:
        f.write(header.get_text() + "\n" +
                time.get_text() + "\n" +
                link.get("href") +
                "\n---------------------\n")

爬取结果：

Python爬虫入门-爬取新浪新闻.JPG

Python爬虫入门-爬取新浪新闻
运行环境：Python3.6.0 所需的包：爬取结果：
python爬虫
一、新闻爬虫实战（爬取新浪新闻首页所有新闻内容）思路：1、爬取新闻首页2、得到各新闻链接3、爬取新闻链接4、寻找有...
各类链接
爬虫使用python-aiohttp爬取今日头条【Python】爬虫爬取各大网站新闻 Scrapy 模拟登录新...
一个不那么典型的Python爬虫
PYTHON爬虫入门&视频网站BILIBILI用户爬取爬虫详解前言 Python使用版本：2.7 得到数据挖掘的...
50行Python爬取猫眼电影TOP100榜单信息
今天，手把手教你入门 Python 爬虫，爬取猫眼电影 TOP100 榜信息。对于 Python 初学者来说，爬...
爬虫很难？最适合新人上手的3个Python项目,即学即用！
今天给大家分享三个极实用的Python爬虫案例。 1、爬取网站美图爬取图片是最常见的爬虫入门项目，不复杂却能很好...
3 个适合新人上手的Python项目
今天给大家分享三个极实用的Python爬虫案例。 1、爬取网站美图爬取图片是最常见的爬虫入门项目，不复杂却能很好...
Python网络爬虫（八） - 利用有道词典实现一个简单翻译程序
目录： Python网络爬虫（一）- 入门基础Python网络爬虫（二）- urllib爬虫案例Python网络爬...
Python网络爬虫（七）- 深度爬虫CrawlSpider
目录： Python网络爬虫（一）- 入门基础Python网络爬虫（二）- urllib爬虫案例Python网络爬...
Python网络爬虫（二）- urllib爬虫案例
目录： Python网络爬虫（一）- 入门基础Python网络爬虫（二）- urllib爬虫案例Python网络爬...