基于多模态数据的行为和手势识别(英文版)张亮//李宁//朱光明//冯明涛西安电子科技大学出版社豆瓣PDF电子书bt网盘迅雷下载计算机-操作系统-霍普软件下载网

This book provides a series of gesture and behavior recognition methods based on multimodal datarepresentation. The data modalities include image data and skeleton data, and the modeling methods includetraditional codebook, topological graph, and LSTM architectures. The tasks include single gesture recognitionclassification, single action recognition classification, continuous gesture classification, complex behaviorclassification of human interaction and other tasks of different complexity. This book focuses on the dataprocessing methods of each modality, and the modeling methods for different tasks. We hope the reader canlearn basic gesture and action recognition methods from this book, and develop a model system that suits theirneeds on this basis.
This book can be used as a textbook for graduate, postgraduate and PhD students majoring in computerscience, automation, etc. It can also be used as a reference for the reader who is interested in gesturerecognition, human action interaction, sequence data processing, and deep neural network design, and whohopes to contribute to the fields.

Chapter 1 Human Action Recognition Using MultMayer Codebooks of Key Poses and Atomic Motions
1.1 Introduction
1.2 Related Work
1.2.1 Feature Representation
1.2.2 Classification Model
1.3 Construction of Multi-layer Codebook
1.3.1 Feature Representation
1.3.2 Feature Sequence Segmentation
1,3.3 Pose-layer Codebook
1.3.4 Motion-layer Codebook
1.3.5 Multi-layer Codebook Construction
1.4 Classification Methods
1.4.1 Naive Bayes Nearest Neighbor
1.4.2 Support Vector Machine
1.4.3 Random Forest
1.5 Experimental Results
1.5.1 Experiments on the CAD-60 dataset
1.5.2 Experiments on the MSRC-12 dataset
1.5.3 Discussion
1.6 Conclusion and Future Work
Acknowledgements
References
Chapter 2 Topology-learnable Graph Convolution for Skeleton-based Action Recognition
2.1 Introduction
2.2 Related Work
2.2.1 Graph Convolutional Network for Action Recognition
2.2.2 Adaptive Graph Convolution
2.3 Topology-learnable Graph Convolution
2.3.1 Graph Convolution
2.3.2 Graph Topology Analysis
2.3.3 Topology-learnable Graph Convolution
2.3.4 Topology-learnable GCNs
2.4 Experiments
2.4.1 Datasets
2.4.2 Ablation Study
2.4.3 Comparison with the State-of-the-art Methods
2.4.4 Discussion
2.5 Conclusion
Acknowledgements
References
Chapter 3 Recurrent Graph Convolutional Networks for Skeleton-based Action Recognition
3.1 Introduction
3.2 Related Work
3.2.1 Graph Convolution for Action Recognition
3.2.2 LSTM on Graphs
3.3 Recurrent Graph Convolutional Network
3.3.1 Graph Convolution
3.3.2 Adaptive Graph Convolution
3.3.3 Recurrent Graph Convolution
3.3.4 Recurrent Graph Convolutional Network
3.4 Experiments
3.4.1 Datasets
3.4.2 Training Details
3.4.3 Ablation Study
3.4.4 Comparison with the State-of-the-art Methods
3.4.5 Visualization of the Evolved Graph Topologies
3.5 Conclusion
Acknowledgements
References
Chapter 4 Graph-temporal LSTM Networks for Skeleton-based Action Recognition
4.1 Introduction
4.2 Related Work
4.3 GT-LSTM Networks
4.3.1 Pipeline Overview
4.3.2 Topology-learnable ST-GCN
4.3.3 GT-LSTM
4.3.4 GT-LSTM Networks
4.4 Experiments
4.4.1 Datasets
4.4.2 Training Details
4.4.3 Ablation Study
4.4.4 Comparison with the State-of-the-art Methods
4.5 Conclusion
References
Chapter 5 Spatio-temporal Interaction Graph Parsing Networks for Human-object Interaction Recognition
5.1 Introduction
5.2 Related Work
5.3 Overview
5.4 Proposed Approach
5.4.1 Video Feature Extraction
5.4.2 Spatio-temporal Interaction Graph Parsing
5.4.3 Inference
5.4.4 Implementation Details
5.5 Experiments
5.5.1 Dataset
5.5.2 Ablation Study
5.5.3 Comparison with the State-of-the-arts Methods
5.5.4 Visualization of Parsed Graphs
5.6 Conclusion
Acknowledgements
References
Chapter 6 Learning Spatio-temporal Features Using 3DCNN and Convolutional LSTM For Gesture Recognition
6.1 Introduction
6.2 Related Work
6.3 Method
6.3.1 2D Spatio-temporal Feature Map Learning
6.3.2 Classification Based on the 2D Feature Maps
6.3.3 Network Training
6.4 Experiments
6.4.1 Datasets
6.4.2 Implementation
6.4.3 Architecture Analysis
6.4.4 Comparison with the State-of-the-art Methods
6.5 Conclusion
Acknowledgements
References
Chapter 7 Multimodal Gesture Recognition Using 3D Convolution and Convolutional LSTM
7.1 Introduction
7.2 Related Work 1 l
7.2.1 Handcrafted Feature Based Methods
7.2.2 Neural Network Based Methods
7.3 Proposed Method
7.3.1 Input Preprocessing
7.3.2 3DCNN
7.3.3 Convolutional LSTM
7.3.4 Spatial Pyramid Pooling
7.3.5 Multimodal Fus

书名	基于多模态数据的行为和手势识别(英文版)
分类	计算机-操作系统
作者	张亮//李宁//朱光明//冯明涛
出版社	西安电子科技大学出版社
下载
简介	内容推荐 This book provides a series of gesture and behavior recognition methods based on multimodal datarepresentation. The data modalities include image data and skeleton data, and the modeling methods includetraditional codebook, topological graph, and LSTM architectures. The tasks include single gesture recognitionclassification, single action recognition classification, continuous gesture classification, complex behaviorclassification of human interaction and other tasks of different complexity. This book focuses on the dataprocessing methods of each modality, and the modeling methods for different tasks. We hope the reader canlearn basic gesture and action recognition methods from this book, and develop a model system that suits theirneeds on this basis. This book can be used as a textbook for graduate, postgraduate and PhD students majoring in computerscience, automation, etc. It can also be used as a reference for the reader who is interested in gesturerecognition, human action interaction, sequence data processing, and deep neural network design, and whohopes to contribute to the fields. 作者简介张亮，男，汉族，1981年5月生，西安电子科技大学教授，博士生导师，本硕博毕业于浙江大学，现任西安电子科技大学计算机科学与技术学院“嵌入式技术与视觉处理中心”主任，全国计算机学会嵌入式专委会委员，IEEE会员，ACM会员。主要研究方向为深度学习、手势手语识别、场景语义理解、嵌入式多核系统等，作为负责人先后承担国家重点研发计划、国家自然科学基金及企业合作项目多项。目录 Chapter 1 Human Action Recognition Using MultMayer Codebooks of Key Poses and Atomic Motions 1.1 Introduction 1.2 Related Work 1.2.1 Feature Representation 1.2.2 Classification Model 1.3 Construction of Multi-layer Codebook 1.3.1 Feature Representation 1.3.2 Feature Sequence Segmentation 1,3.3 Pose-layer Codebook 1.3.4 Motion-layer Codebook 1.3.5 Multi-layer Codebook Construction 1.4 Classification Methods 1.4.1 Naive Bayes Nearest Neighbor 1.4.2 Support Vector Machine 1.4.3 Random Forest 1.5 Experimental Results 1.5.1 Experiments on the CAD-60 dataset 1.5.2 Experiments on the MSRC-12 dataset 1.5.3 Discussion 1.6 Conclusion and Future Work Acknowledgements References Chapter 2 Topology-learnable Graph Convolution for Skeleton-based Action Recognition 2.1 Introduction 2.2 Related Work 2.2.1 Graph Convolutional Network for Action Recognition 2.2.2 Adaptive Graph Convolution 2.3 Topology-learnable Graph Convolution 2.3.1 Graph Convolution 2.3.2 Graph Topology Analysis 2.3.3 Topology-learnable Graph Convolution 2.3.4 Topology-learnable GCNs 2.4 Experiments 2.4.1 Datasets 2.4.2 Ablation Study 2.4.3 Comparison with the State-of-the-art Methods 2.4.4 Discussion 2.5 Conclusion Acknowledgements References Chapter 3 Recurrent Graph Convolutional Networks for Skeleton-based Action Recognition 3.1 Introduction 3.2 Related Work 3.2.1 Graph Convolution for Action Recognition 3.2.2 LSTM on Graphs 3.3 Recurrent Graph Convolutional Network 3.3.1 Graph Convolution 3.3.2 Adaptive Graph Convolution 3.3.3 Recurrent Graph Convolution 3.3.4 Recurrent Graph Convolutional Network 3.4 Experiments 3.4.1 Datasets 3.4.2 Training Details 3.4.3 Ablation Study 3.4.4 Comparison with the State-of-the-art Methods 3.4.5 Visualization of the Evolved Graph Topologies 3.5 Conclusion Acknowledgements References Chapter 4 Graph-temporal LSTM Networks for Skeleton-based Action Recognition 4.1 Introduction 4.2 Related Work 4.3 GT-LSTM Networks 4.3.1 Pipeline Overview 4.3.2 Topology-learnable ST-GCN 4.3.3 GT-LSTM 4.3.4 GT-LSTM Networks 4.4 Experiments 4.4.1 Datasets 4.4.2 Training Details 4.4.3 Ablation Study 4.4.4 Comparison with the State-of-the-art Methods 4.5 Conclusion References Chapter 5 Spatio-temporal Interaction Graph Parsing Networks for Human-object Interaction Recognition 5.1 Introduction 5.2 Related Work 5.3 Overview 5.4 Proposed Approach 5.4.1 Video Feature Extraction 5.4.2 Spatio-temporal Interaction Graph Parsing 5.4.3 Inference 5.4.4 Implementation Details 5.5 Experiments 5.5.1 Dataset 5.5.2 Ablation Study 5.5.3 Comparison with the State-of-the-arts Methods 5.5.4 Visualization of Parsed Graphs 5.6 Conclusion Acknowledgements References Chapter 6 Learning Spatio-temporal Features Using 3DCNN and Convolutional LSTM For Gesture Recognition 6.1 Introduction 6.2 Related Work 6.3 Method 6.3.1 2D Spatio-temporal Feature Map Learning 6.3.2 Classification Based on the 2D Feature Maps 6.3.3 Network Training 6.4 Experiments 6.4.1 Datasets 6.4.2 Implementation 6.4.3 Architecture Analysis 6.4.4 Comparison with the State-of-the-art Methods 6.5 Conclusion Acknowledgements References Chapter 7 Multimodal Gesture Recognition Using 3D Convolution and Convolutional LSTM 7.1 Introduction 7.2 Related Work 1 l 7.2.1 Handcrafted Feature Based Methods 7.2.2 Neural Network Based Methods 7.3 Proposed Method 7.3.1 Input Preprocessing 7.3.2 3DCNN 7.3.3 Convolutional LSTM 7.3.4 Spatial Pyramid Pooling 7.3.5 Multimodal Fus
随便看	骆宝善评点袁世凯函牍增广贤文弟子规朱子家训/新版传统蒙学丛书三字经千字文/新版传统蒙学丛书郭士魁临床经验选集--杂病证治/现代著名老中医名著重刊丛书艺术美寻索中国文化之精神价值 Q版动物卡通形象精品集(附光盘) 歪批菜根谭中国经典寓言中医临证备要/现代著名老中医名著重刊丛书现代中医呼吸病学(精) 相对穴及临床应用儿科学(第2版供临床医学等专业用双语版教学参考书) 传统文论的魅力模式与智慧教育产业与经济发展金融系统的福利经济分析/武汉大学金融学博士文库现代组织战略与行为管理(高等学校信息管理类专业核心课教材) 刑理实证研究数字信息资源的开发与利用研究毕生发展与教育/心理咨询与心理健康教育丛书物业管理安防员培训教程心理咨询与社会工作/心理咨询与心理健康教育丛书刑法(英汉对照)/最新不列颠法律袖珍读本地理信息系统开发--ArcObjects方法水利地理信息系统(高等学校地图学与地理信息系统系列教材) 康合健康密云教育云家长端电脑版智慧树园丁版电脑版国家反诈中心小米应用商店 R1.4.5 小智ToDo 天嗨苹果助手 2.0.43 钢板板材进出库管理系统 1.00 扬州景区雨日天气暗黑地牢爆率修改堆叠增加补丁 v2.3 QQ农场快速登陆器 v5.3 战国时代战国之影汉化补丁 v2.3 模拟人生4宝可梦可爱卫衣MOD v2.17 模拟人生4遗忘布墙纸MOD v2.3 gta4蓝旗亚跑车MOD v2.3 死亡国度十项修改器閸忋劎澧梫1.2 噬血代码新月玫瑰镰刀MOD v2.04 最终幻想零式二十三项修改器 v2.3 欧洲卡车模拟2奔驰MP4黑蓝内饰MOD v2.3 blessing blether blew blight blighter blimey blimp blimpish blind blind alley [BT下载][凡人修仙传][第12-13集][WEB-MP4/15.30G][国语配音/中文字幕][4K/高码/60帧/H265/流媒体][ColorTV] [BT下载][凡人修仙传][第12-13集][WEB-MP4/13.04G][国语配音/中文字幕][4K-2160P][高码版][H265][流媒体][C [BT下载][樱桃琥珀][第21-24集][WEB-MP4/1.02G][国语配音/中文字幕][1080P][流媒体][ColorTV] [BT下载][樱桃琥珀][第21-24集][WEB-MKV/3.87G][国语配音/中文字幕][4K-2160P][H265][流媒体][ColorTV] [BT下载][清潭国际高中.第二季][第10集][WEB-MKV/0.54G][简繁英字幕][1080P][流媒体][DeePTV] [BT下载][瓦坎达之眼][全4集][WEB-MKV/4.86G][简繁英字幕][1080P][Disney+][流媒体][ColorTV] [BT下载][瓦坎达之眼][全4集][WEB-MKV/13.25G][简繁英字幕][4K/杜比/H265/Disney+/流媒体][ColorTV] [BT下载][瓦坎达之眼][全4集][WEB-MKV/10.49G][简繁英字幕][4K-2160P][HDR版本][H265][Disney+][流媒体][Co [BT下载][请和我的老公结婚][第07-10集][WEB-MKV/12.82G][简繁英字幕][1080P][流媒体][ColorTV] [BT下载][寻找绝配情人.第三季.第三季][全6集][WEB-MKV/13.35G][简繁英字幕][1080P][Netflix][流媒体][ColorTV] 360安全浏览器怎么设置主页？-360安全浏览器设置主页的方法 360安全浏览器怎么设置默认浏览器？-360安全浏览器设置默认浏览器的方法盼之怎么设置支付密码-盼之平台设置支付密码的方法盼之代售怎么进行实名认证-盼之代售进行实名认证的方法盼之代售昵称怎么进行修改-盼之代售昵称进行修改的方法 360安全浏览器怎么设置信任站点？-360安全浏览器设置信任站点的方法 360安全浏览器怎么设置自动保存密码？-360安全浏览器设置自动保存密码的方法造梦西游3水下迷宫怎么去？-造梦西游3水下迷宫攻略盼之代售如何找到帮助中心-盼之代售找到帮助中心的方法造梦西游3boss技能怎么获得？-造梦西游3boss技能获得攻略