技术教育社区
www.teccses.org

Python网络爬虫

封面

作者:耿兴隆

页数:220

出版社:电子工业出版社

出版日期:2023

ISBN:9787121438103

电子书格式:pdf/epub/txt

内容简介

本书介绍如何结合Python进行网络爬虫程序的开发,从Python语言的基本特性入手,详细介绍了Python网络爬虫开发的各个方面,涉及HTTP、HTML、JavaScript、正则表达式、自然语言处理、数据科学等不同领域的内容。全书共10章,包括Python基础知识、网站分析、网页解析、Python文件读写、Python与数据库、AJAX技术、模拟登录、文本与数据分析、网站测试、Scrapy爬虫框架、爬虫性能等多个主题。本书可作为高等职业院校计算机类专业的专业课教材,也可供计算机相关从业人员选用参考。

作者简介

耿兴隆,Autodesk中国认证考试中心首席专家,全面负责Autodesk中国官方认证考试大纲制定、题库建设、技术咨询和师资力量培训工作。其创作的很多教材成为国内具有引导性的旗帜作品,在国内相关专业方向图书创作领域具有举足轻重的地位。

目录

目录

项目一 Python 基础认知 ····················································································.1

任务一 Python 概述 ·······································································································.1

一、Python 简介 ······································································································.1

二、安装Python ······································································································.2

三、安装PyCharm ···································································································.6

四、Python 语法规范 ·······························································································.11

任务二 Python 命令的组成 ·····························································································.13

一、基本符号 ·········································································································.14

二、常量与变量 ······································································································.16

三、数据类型 ·········································································································.19

四、功能符号 ·········································································································.24

任务三 程序结构 ·········································································································.26

一、表达式语句 ······································································································.26

二、顺序结构 ·········································································································.27

三、选择结构 ·········································································································.28

四、循环结构 ·········································································································.30

五、条件表达式 ······································································································.31

六、程序的流程控制 ································································································.32

项目实战 ·····················································································································.33

实战 输出百度网址 ································································································.33

项目二 网络爬虫基础认知 ················································································.35

任务一 网络爬虫概述 ···································································································.35

一、网络爬虫的基本原理 ··························································································.36

二、网络爬虫系统框架 ·····························································································.37

三、爬行策略 ·········································································································.37

四、网络爬虫的分类 ································································································.38

五、开源网络爬虫框架/项目 ······················································································.39

任务二 HTTP ·············································································································.41

一、HTTP 的工作原理 ·····························································································.41

二、Urllib 模块库 ···································································································.42

三、URL 定义 ·······································································································.43

四、URL 编码设置 ·································································································.47

任务三 网页请求过程 ···································································································.50

一、发送请求报文 ··································································································.51

二、返回响应 ········································································································.52

三、HTTP 消息 ··········································································

下载地址

立即下载

(解压密码:www.teccses.org)

Article Title:《Python网络爬虫》
Article link:https://www.teccses.org/1452395.html