内容推荐 高性能分布式SQL查询引擎Presto可用于对不同的数据源执行快速的交互式分析。这本实用指南将带你学习如何对Hive、Cassandra、关系数据库或专有数据存储中的数据进行分析。分析师、软件工程师、生产工程师可以从中学习Presto的管理、使用甚至开发。 最初由Facebook开发的开源软件Presto如今已被Netflix、Airbnb、LinkedIn、Twitter、Uber和许多其他公司使用。Matt Fuller、Manfred Moser、Martin Traverso向你展示了单个Presto查询如何组合来自多个源头的数据,从而对整个组织进行分析。 作者简介 马丁·特拉韦尔索(Martin Traverso)是Presto软件基金会的联合创始人和Starburst的首席技术官。在创立Starburst之前,马丁是Facebook的软件工程师。在Facebook时,他注意到对快速交互式SQL分析的需求,并和其他三位工程师一起创造了Presto,之后带领Presto开发团队于2013年将其投入生产环境。同年秋天,Presto开放源代码,从此在Facebook公司内外得到广泛采用。 在加入Facebook之前,马丁在Proofpoint和Ning担任架构师,在那里他主导了无数复杂的企业应用程序和社交网络应用程序的开发与架构设计。 目录 Foreword. Preface. Part I. Getting Started with Presto 1. Introducing Presto.. The Problems with Big Data Presto to the Rescue Designed for Performance and Scale SQL-on-Anything Separation of Data Storage and Query Compute Resources Presto Use Cases One SQL Analytics Access Point Access Point to Data Warehouse and Source Systems Provide SQL-Based Access to Anything Federated Queries Semantic Layer for a Virtual Data Warehouse Data Lake Query Engine SQL Conversions and ETL Better Insights Due to Faster Response Times Big Data, Machine Learning, and Artificial Intelligence Other Use Cases Presto Resources Website Documentation Community Chat Source Code, License, and Version Contributing Book Repository Iris Data Set Flight Data Set A Brief History of Presto Conclusion 2. Installing and Configuring Presto. Trying Presto with the Docker Container Installing from Archive File Java Virtual Machine Python Installation Configuration Adding a Data Source Running Presto Conclusion 3. Using Presto.. Presto Command-Line Interface Getting Started Pagination History Additional Diagnostics Executing Queries Output Formats Ignoring Errors Presto JDBC Driver Downloading and Registering the Driver Establishing a Connection to Presto Presto and ODBC Client Libraries Presto Web UI SQL with Presto Concepts First Examples Conclusion Part II. Diving Deeper into presto Part III. Presto in Real-World Uses Index |