Java知识分享网 - 轻松学习从此开始!    

Java知识分享网

Java1234官方群25:java1234官方群17
Java1234官方群25:838462530
        
SpringBoot+SpringSecurity+Vue+ElementPlus权限系统实战课程 震撼发布        

最新Java全栈就业实战课程(免费)

springcloud分布式电商秒杀实战课程

IDEA永久激活

66套java实战课程无套路领取

锋哥开始收Java学员啦!

Python学习路线图

锋哥开始收Java学员啦!
当前位置: 主页 > Java文档 > Java基础相关 >

Elasticsearch数据分析与实战应用 PDF 下载


分享到:
时间:2024-03-29 09:46来源:http://www.java1234.com 作者:转载  侵权举报
Elasticsearch数据分析与实战应用 PDF 下载 下载地址: 版权归出版社和原作者所有,链接已删除,请购买正版 用户下载说明: 电子版仅供预览,下载后24小时内务必删除,支持正版,喜欢
失效链接处理
Elasticsearch数据分析与实战应用 PDF 下载

下载地址:
版权归出版社和原作者所有,链接已删除,请购买正版

用户下载说明:

电子版仅供预览,下载后24小时内务必删除,支持正版,喜欢的请购买正版书籍:
https://product.dangdang.com/29292697.html
 

相关截图:




资料简介:

Elasticsearch数据分析与实战应用    使用一套金融数据以及Elasticsearch搜索技术和聚合框架,用来进行数据分析,是一本技术全面、案例丰富、注重实操的入门书。书中不仅细致地讲解Elasticsearch基础知识和核心接口,还进一步讲述数据建模和实际应用,例如金融舆情分析及机器学习等技术应用。后逐步详细讲解了如何以编程方式构建RESTful服务接口,实践所学技术。由于书中有很多金融数据分析例子,读者也可了解相应应用场景以及学会其解决方案。书中示例采用一套开源的金融数据和文本分析插件进行编写。本书非常适合搜索工程师、数据分析师、数据库工程师阅读使用,还适合Elasticsearch初学者以及想要进阶提升为中高级技术的读者使用



资料目录:

第 1 章 Elasticsearch 概述
1.1 准备环境 ..........................................................................................................................1
1.2 运行 Elasticsearch 7.5.1 ...................................................................................................2
1.2.1 Elasticsearch 基本配置 .......................................................................................2
1.2.2 重要的系统配置 .................................................................................................4
1.3 与 Elasticsearch 7.5.1 进行对话 ......................................................................................4
1.4 了解 Elasticsearch 的架构 ...............................................................................................5
1.5 熟悉关键概念 ..................................................................................................................7
1.5.1 Elasticsearch 映射概念 .......................................................................................7
1.5.2 比较 SQL 与 Elasticsearch 的相关概念 ............................................................7
1.5.3 分析器(Analyzer) ..........................................................................................8
1.5.4 标准分析器(Standard Analyzer) ...................................................................8
1.6 接口用法约定说明 ..........................................................................................................8

第 2 章 获取本书实验数据
2.1 了解实验数据集 ............................................................................................................11
2.1.1 公募基金简介 ...................................................................................................11
2.1.2 公募基金类型介绍 ...........................................................................................11
2.1.3 公募基金数据归类 ...........................................................................................12
2.2 通过 Python SDK 获取多维数据 ..................................................................................15
2.2.1 注册及安装 Tushare .........................................................................................15
2.2.2 Tushare 数据接口的调用方法 .........................................................................16

第 3 章 索引管理
3.1 基础索引管理 ................................................................................................................19
3.1.1 基本索引操作 ...................................................................................................19
3.1.2 设置索引 ...........................................................................................................22
3.1.3 索引模板 ...........................................................................................................25
3.1.4 索引别名 ...........................................................................................................26
3.1.5 索引别名应用场景 ...........................................................................................27
3.2 进阶索引管理 ................................................................................................................29
3.2.1 索引存储原理 ...................................................................................................29
3.2.2 索引的底层信息接口 .......................................................................................30
3.2.3 优化索引 ...........................................................................................................33

第 4 章 文档管理
4.1 文档管理 ........................................................................................................................37
4.1.1 了解 Elasticsearch 文档管理原理 ....................................................................37
4.1.2 认识单文档接口功能 .......................................................................................38
4.1.3 认识批量多文档接口功能 ...............................................................................43
4.2 文档结构 ........................................................................................................................47
4.2.1 认识映射类型 ...................................................................................................47
4.2.2 映射的元字段简介 ...........................................................................................52
4.2.3 映射数据类型简介 ...........................................................................................52
4.2.4 映射数据类型的参数简介 ...............................................................................54
4.2.5 更新显式映射内容并刷新文档索引 ...............................................................56

第 5 章 分析文本内容
5.1 分析器的结构 ................................................................................................................59
5.1.1 分析接口 ...........................................................................................................59
5.1.2 字符过滤器 .......................................................................................................60
5.1.3 分词器 ...............................................................................................................61
5.1.4 词汇单元过滤器 ...............................................................................................64
5.2 利用内置分析器进行分析 ............................................................................................68
5.3 利用自定义分析器进行分析 ........................................................................................69
5.4 归一化处理器 ................................................................................................................70

第 6 章 文本分析插件
6.1 Elasticsearch 插件是什么 ..............................................................................................71
安装插件 .......................................................................................................................72
6.2 使用 ICU 分析插件 .......................................................................................................72
使用 icu_analyzer 分析器示例 .....................................................................................73
6.3 使用 Smart Chinese 分析插件 .......................................................................................74
6.3.1 安装 Smart Chinese 分析插件 ..........................................................................74
6.3.2 使用 smartcn 分析器示例 ................................................................................74
6.4 使用 IK 分析插件 ..........................................................................................................75
6.4.1 安装 IK 分析插件 .............................................................................................75
6.4.2 使用 ik_smart 分析器示例 ...............................................................................76
6.5 使用 HanLP 分析插件 ...................................................................................................76
6.5.1 安装 elasticsearch-analysis-hanlp 分析插件 ....................................................77
6.5.2 使用 hanlp 分析器示例 ....................................................................................77
6.5.3 使用 hanlp 自定义词典热更新 ........................................................................78
6.5.4 使用 hanlp 分词器自定义分析器 ....................................................................79
6.5.5 简评 hanlp 分词器的对称性 ............................................................................81
6.6 使用 Aliws 分析插件 .....................................................................................................81

第 7 章 搜索数据和查询表达式
7.1 索引样本文件 ................................................................................................................83
7.2 基础搜索接口 ................................................................................................................85
7.2.1 通过 URI 进行搜索 ..........................................................................................85
7.2.2 通过请求主体(request body)进行搜索 ......................................................89
7.3 进阶搜索 ........................................................................................................................98
认识查询表达式 ...........................................................................................................98
7.4 其他相关功能 ..............................................................................................................114
7.4.1 搜索多重目标接口 .........................................................................................114
7.4.2 搜索结果试算接口 .........................................................................................115
7.4.3 评分说明接口 .................................................................................................117
7.4.4 字段功能接口 .................................................................................................117
7.4.5 搜索查询评估接口 .........................................................................................118
7.4.6 性能分析设置参数 .........................................................................................119
7.4.7 查询建议器 .....................................................................................................120

第 8 章 数据建模
8.1 数据建模及方法 ..........................................................................................................125
8.1.1 使用非规范化方法 .........................................................................................125
8.1.2 使用对象数据类型方法 .................................................................................128
8.1.3 使用嵌套数据类型方法 .................................................................................132
8.1.4 使用父子类关联数据类型方法 .....................................................................134
8.1.5 父子类关联数据类型查询方法 .....................................................................137
8.2 实际应用场景操作 ......................................................................................................140

第 9 章 聚合框架
9.1 基金净值和基金持仓样本文档 ..................................................................................142
9.2 聚合查询语法 ..............................................................................................................144
9.3 矩阵统计聚合 ..............................................................................................................145
9.4 度量指标聚合 ..............................................................................................................146
9.4.1 最大值聚合 .....................................................................................................146
9.4.2 最小值聚合 .....................................................................................................147
9.4.3 总和聚合 .........................................................................................................147
9.4.4 值计数聚合 .....................................................................................................148
9.4.5 平均值聚合 .....................................................................................................148
9.4.6 加权平均值聚合 .............................................................................................149
9.4.7 基数聚合 .........................................................................................................149
9.4.8 统计聚合 .........................................................................................................149
9.4.9 扩展统计聚合 .................................................................................................150
9.4.10 中位数绝对偏差聚合 ...................................................................................151
9.4.11 百分位聚合 ...................................................................................................151
9.4.12 百分位等级聚合 ...........................................................................................152
9.4.13 地理重心聚合 ...............................................................................................152
9.4.14 地理边界聚合 ...............................................................................................153
9.4.15 最热点聚合 ...................................................................................................153
9.4.16 脚本式度量指标聚合 ...................................................................................154
9.5 存储桶聚合 ..................................................................................................................155
9.5.1 范围聚合 .........................................................................................................156
9.5.2 IP 范围聚合 ....................................................................................................157
9.5.3 日期范围聚合 .................................................................................................158
9.5.4 直方图聚合 .....................................................................................................159
9.5.5 日期直方图聚合 .............................................................................................160
9.5.6 自动间隔日期直方图聚合 .............................................................................161
9.5.7 词条聚合 .........................................................................................................161
9.5.8 稀有词条聚合 .................................................................................................162
9.5.9 显著词条聚合 .................................................................................................163
9.5.10 显著文本聚合 ...............................................................................................164
9.5.11 采样器聚合 ...................................................................................................164
9.5.12 多元化采样器聚合 .......................................................................................165
9.5.13 过滤器聚合 ...................................................................................................166
9.5.14 多过滤器聚合 ...............................................................................................166
9.5.15 地理距离聚合 ...............................................................................................167
9.5.16 地理哈希网格聚合 .......................................................................................167
9.5.17 地理瓦片网格聚合 .......................................................................................169
9.5.18 缺失字段聚合 ...............................................................................................169
9.5.19 全局聚合 .......................................................................................................170
9.5.20 邻接矩阵聚合 ...............................................................................................170
9.5.21 复合聚合 .......................................................................................................171
9.5.22 子文档聚合 ...................................................................................................173
9.5.23 嵌套聚合 .......................................................................................................174
9.5.24 父文档聚合 ...................................................................................................175
9.5.25 反向嵌套聚合 ...............................................................................................176
9.6 管道聚合 ......................................................................................................................178
9.6.1 桶平均值聚合 .................................................................................................179
9.6.2 桶最小值聚合 .................................................................................................180
9.6.3 桶最大值聚合 .................................................................................................181
9.6.4 桶百分位聚合 .................................................................................................182
9.6.5 桶统计聚合 .....................................................................................................182
9.6.6 桶扩展统计聚合 .............................................................................................183
9.6.7 桶总和聚合 .....................................................................................................184
9.6.8 桶脚本聚合 .....................................................................................................185
9.6.9 桶选择器聚合 .................................................................................................186
9.6.10 桶排序聚合 ...................................................................................................187
9.6.11 累计基数聚合 ...............................................................................................188
9.6.12 累计总和聚合 ...............................................................................................189
9.6.13 导数聚合 .......................................................................................................189
9.6.14 移动函数聚合 ...............................................................................................191
9.6.15 串行差分聚合 ...............................................................................................194
9.7 后置过滤器 ..................................................................................................................195

第 10 章 摄取节点管道处理接口
10.1 摄取节点接口 ............................................................................................................197
10.1.1 创建或更新接口 ...........................................................................................197
10.1.2 读取接口 .......................................................................................................198
10.1.3 模拟接口 .......................................................................................................198
10.1.4 删除接口 .......................................................................................................199
10.2 摄取管道处理器 ........................................................................................................199
10.2.1 附加处理器 ...................................................................................................200
10.2.2 删除处理器 ...................................................................................................200
10.2.3 重命名处理器 ...............................................................................................201
10.2.4 小写处理器 ...................................................................................................202
10.2.5 大写处理器 ...................................................................................................202
10.2.6 拆分处理器 ...................................................................................................203
10.2.7 连接处理器 ...................................................................................................204
10.2.8 修剪处理器 ...................................................................................................205
10.2.9 设置处理器 ...................................................................................................205
10.2.10 日期处理器 .................................................................................................206
10.2.11 脚本处理器 .................................................................................................207
10.2.12 丢弃处理器 .................................................................................................207
10.2.13 管道委托处理器 .........................................................................................207
10.2.14 故障处理器 .................................................................................................208
10.2.15 字节处理器 .................................................................................................209
10.2.16 转换处理器 .................................................................................................209
10.2.17 循环处理器 .................................................................................................210
10.2.18 geoip 处理器 ...............................................................................................211
10.2.19 Grok 处理器 ................................................................................................211
10.2.20 分解处理器 .................................................................................................212
10.2.21 Gsub 处理器 ................................................................................................213
10.2.22 HTML Strip 处理器 ....................................................................................214
10.2.23 URL 解码处理器 ........................................................................................215
10.2.24 JSON 处理器 ..............................................................................................215
10.2.25 键值对处理器 .............................................................................................216
10.2.26 用户代理处理器 .........................................................................................218
10.2.27 排序处理器 .................................................................................................219
10.2.28 点扩展器处理器 .........................................................................................219
10.2.29 丰富处理器 .................................................................................................220
10.2.30 日期索引名称处理器 .................................................................................221
10.3 处理管道中的故障 ....................................................................................................222

第 11 章 使用 Elasticsearch 进行探索性数据分析
11.1 数据处理 ....................................................................................................................224
11.1.1 日线行情显式映射 .......................................................................................224
11.1.2 创建 ohlc_avg_price_pipeline 摄取节点管道 ..............................................225
11.1.3 批量处理索引文档 .......................................................................................225
11.1.4 公募基金交易行情文档索引操作 ...............................................................225
11.2 指标数据分析 ............................................................................................................226
11.2.1 执行扩展统计聚合 .......................................................................................227
11.2.2 执行矩阵统计聚合 .......................................................................................227
11.2.3 执行百分位聚合和百分位等级聚合 ...........................................................228
11.2.4 执行导数聚合 ...............................................................................................229
11.2.5 执行移动函数聚合 .......................................................................................229
11.3 投资组合 ....................................................................................................................230
投资组合季度清单 .....................................................................................................230

第三篇  Java 和 Python 客户端编程介绍
第 12 章 Java 客户端编程
12.1 Elasticsearch Java REST 客户端概览 .......................................................................234
12.2 Java 低级别 REST 客户端 ........................................................................................236
12.2.1 Java 低级别 REST 客户端操作流程 ...........................................................236
12.2.2 REST 客户端初始化 ....................................................................................236
12.2.3 执行 REST 客户端请求 ...............................................................................237
12.2.4 处理 REST 客户端响应 ...............................................................................238
12.2.5 关闭 REST 客户端 .......................................................................................238
12.2.6 封装低级别 REST 客户端请求与处理其响应 ...........................................238
12.2.7 调用自定义的 performSyncRequest 和 performAsyncRequest 方法 .........240
12.3 使用 Swagger UI 测试低级别 REST 客户端 ...........................................................240
12.4 Java 高级别 REST 客户端 ........................................................................................244
12.4.1 封装高级别 REST 客户端 ...........................................................................244
12.4.2 提供间接调用方法的 RestHighLevelClient 成员 .......................................245
12.4.3 间接调用方法 ...............................................................................................245
12.4.4 直接调用方法 ...............................................................................................248
12.4.5 构造查询请求 ...............................................................................................251
12.4.6 自定义 searchSync 与处理其响应 ...............................................................253
12.4.7 自定义 searchASync 与处理其响应 ............................................................254
12.4.8 自定义 getIndexSettingsSync 方法与处理其响应 ......................................255
12.4.9 自定义 getIndexSettingsAsync 方法与处理其响应 ....................................256
12.5 使用 Swagger UI 测试高级别 REST 客户端 ...........................................................256
12.6 Java 高级别 REST 客户端聚合操作简介 ................................................................259
12.7 使用 Swagger UI 测试 Java 高级别 REST 客户端聚合操作 ..................................262

第 13 章 Python 客户端编程
13.1 Elasticsearch Python 客户端概览 ..............................................................................264
13.2 elasticsearch-py 软件包 .............................................................................................265
13.2.1 提供间接调用方法的成员 ...........................................................................265
13.2.2 间接调用方法 ...............................................................................................266
13.2.3 直接调用方法 ...............................................................................................267
13.2.4 操作流程 .......................................................................................................269
13.3 使用 Swagger UI 测试调用方法 ...............................................................................270
13.4 elasticsearch-dsl 软件包 .............................................................................................274
13.4.1 提供特定接口的类 .......................................................................................275
13.4.2 elasticsearch-dsl 软件包中 Index 类提供的调用方法 ................................276
13.4.3 elasticsearch-dsl 软件包中 Search 类提供的调用方法 ...............................276
13.4.4 构造查询请求 ...............................................................................................277
13.4.5 执行请求方法与处理其响应 .......................................................................279
13.5 使用 Swagger UI 测试 elasticsearch-dsl 软件包 .......................................................280
13.6 elasticsearch-dsl 聚合操作简介 .................................................................................283
13.7 使用 Swagger UI 测试 elasticsearch-dsl 聚合操作 ...................................................285

第四篇  进阶功能和数据分析实战
第 14 章 Elasticsearch 与金融舆情分析
14.1 文本情感分析简介 ....................................................................................................287
14.2 文本情感分析软件服务 ............................................................................................287
14.3 文本情感分析开源项目 ............................................................................................288
14.3.1 TextBlob ........................................................................................................288
14.3.2 SnowNLP ......................................................................................................288
14.3.3 BosonNLP .....................................................................................................289
14.3.4 Stanford CoreNLP .........................................................................................289
14.3.5 百度 Senta .....................................................................................................289
14.4 文本情感分析插件开源项目 ....................................................................................291
14.4.1 ESAP 开源项目简介 ....................................................................................291
14.4.2 ESAP 开源项目安装与测试 ........................................................................294
14.5 中文金融领域文本情感分析 ....................................................................................295
14.6 应用 Elasticsearch 进行股票分析和预测 .................................................................296
14.6.1 安装与运行相关软件 ...................................................................................296
14.6.2 Stocksight 开源项目的 sentiment_analysis 程序 .........................................299

第 15 章 使用 Elasticsearch 进行机器学习
15.1 Kibana 简介 ................................................................................................................301
15.1.1 准备环境和运行 ...........................................................................................301
15.1.2 测试 Kibana ..................................................................................................301
15.2 Kibana、Elasticsearch 与机器学习 ...........................................................................304
15.2.1 安装试用许可证 ...........................................................................................304
15.2.2 机器学习测试数据 .......................................................................................305
15.2.3 Elastic 机器学习工作流程 ...........................................................................308
15.3 Elasticsearch 机器学习异常检测接口 ......................................................................322
15.3.1 异常检测任务资源 .......................................................................................322
15.3.2 数据馈送 .......................................................................................................324
15.3.3 特定时间段资源 ...........................................................................................326
15.3.4 过滤器 ...........................................................................................................329
15.4 Elasticsearch 机器学习数据框分析接口 ..................................................................330

第 16 章 构建金融数据分析服务 RESTful 接口
16.1 基金业绩指标 ............................................................................................................333
16.1.1 基金业绩分类 ...............................................................................................333
16.1.2 投资类型数据丰富处理器 ...........................................................................334
16.2 测试样本文件 ............................................................................................................335
16.2.1 准备测试环境 ...............................................................................................336
16.2.2 检验测试环境 ...............................................................................................337
16.3 使用 Spring Boot 构建 RESTful 接口服务...............................................................342
16.3.1 AnalyticsController 类简介 ..........................................................................343
16.3.2 AnalyticsServiceImpl 类简介 .......................................................................343
16.3.3 使用 Swagger UI 测试 poof-analytics 项目 .................................................348
16.3.4 聚合结果解析简介 .......................................................................................353




------分隔线----------------------------

锋哥公众号


锋哥微信


关注公众号
【Java资料站】
回复 666
获取 
66套java
从菜鸡到大神
项目实战课程

锋哥推荐