新書推薦:
![索恩丛书·时尚女王与法国大革命](http://103.6.6.66/upload/mall/productImages/24/19/9787522828145.jpg)
《
索恩丛书·时尚女王与法国大革命
》
售價:NT$
510.0
![秦汉帝陵制度研究](http://103.6.6.66/upload/mall/productImages/24/18/9787573210951.jpg)
《
秦汉帝陵制度研究
》
售價:NT$
510.0
![共赢——商业生态与平台战略](http://103.6.6.66/upload/mall/productImages/24/20/9787300326184.jpg)
《
共赢——商业生态与平台战略
》
售價:NT$
359.0
![如何配置全球资产](http://103.6.6.66/upload/mall/productImages/24/19/9787521763324.jpg)
《
如何配置全球资产
》
售價:NT$
442.0
![暗黑历史书系·欧洲的国王和女王(疯狂、疾病和丑闻,欧洲王室历史上的黑暗篇章)](http://103.6.6.66/upload/mall/productImages/24/20/9787218172743.jpg)
《
暗黑历史书系·欧洲的国王和女王(疯狂、疾病和丑闻,欧洲王室历史上的黑暗篇章)
》
售價:NT$
510.0
![海外中国研究·纠纷与秩序:徽州文书中的明朝(海外中国研究丛书精选版第四辑)](http://103.6.6.66/upload/mall/productImages/24/20/9787214290571.jpg)
《
海外中国研究·纠纷与秩序:徽州文书中的明朝(海外中国研究丛书精选版第四辑)
》
售價:NT$
510.0
![海外中国研究·寻找六边形:中国农村的市场和社会结构(海外中国研究丛书精选版第四辑)](http://103.6.6.66/upload/mall/productImages/24/20/9787214290687.jpg)
《
海外中国研究·寻找六边形:中国农村的市场和社会结构(海外中国研究丛书精选版第四辑)
》
售價:NT$
354.0
![创业股权融资地图:安全引入投资人 何青阳](http://103.6.6.66/upload/mall/productImages/24/20/9787111749790.jpg)
《
创业股权融资地图:安全引入投资人 何青阳
》
售價:NT$
463.0
|
內容簡介: |
本书介绍如何结合Python进行网络爬虫程序的开发,从Python语言的基本特性入手,详细介绍了Python网络爬虫开发的各个方面,涉及HTTP、HTML、JavaScript、正则表达式、自然语言处理、数据科学等不同领域的内容。全书共10章,包括Python基础知识、网站分析、网页解析、Python文件读写、Python与数据库、AJAX技术、模拟登录、文本与数据分析、网站测试、Scrapy爬虫框架、爬虫性能等多个主题。本书可作为高等职业院校计算机类专业的专业课教材,也可供计算机相关从业人员选用参考。
|
關於作者: |
耿兴隆,Autodesk中国认证考试中心首席专家,全面负责Autodesk中国官方认证考试大纲制定、题库建设、技术咨询和师资力量培训工作。其创作的很多教材成为国内具有引导性的旗帜作品,在国内相关专业方向图书创作领域具有举足轻重的地位。
|
目錄:
|
目录项目一 Python 基础认知 ····················································································.1任务一 Python 概述 ·······································································································.1一、Python 简介 ······································································································.1二、安装Python ······································································································.2三、安装PyCharm ···································································································.6四、Python 语法规范 ·······························································································.11任务二 Python 命令的组成 ·····························································································.13一、基本符号 ·········································································································.14二、常量与变量 ······································································································.16三、数据类型 ·········································································································.19四、功能符号 ·········································································································.24任务三 程序结构 ·········································································································.26一、表达式语句 ······································································································.26二、顺序结构 ·········································································································.27三、选择结构 ·········································································································.28四、循环结构 ·········································································································.30五、条件表达式 ······································································································.31六、程序的流程控制 ································································································.32项目实战 ·····················································································································.33实战 输出百度网址 ································································································.33项目二 网络爬虫基础认知 ················································································.35任务一 网络爬虫概述 ···································································································.35一、网络爬虫的基本原理 ··························································································.36二、网络爬虫系统框架 ·····························································································.37三、爬行策略 ·········································································································.37四、网络爬虫的分类 ································································································.38五、开源网络爬虫框架/项目 ······················································································.39任务二 HTTP ·············································································································.41一、HTTP 的工作原理 ·····························································································.41二、Urllib 模块库 ···································································································.42三、URL 定义 ·······································································································.43四、URL 编码设置 ·································································································.47任务三 网页请求过程 ···································································································.50一、发送请求报文 ··································································································.51二、返回响应 ········································································································.52三、HTTP 消息 ··········································································
|
|