Run queries and analysis on big data clusters across relational and non relational databases
â— Connect to Hadoop, Azure, Spark, Oracle, Teradata, Cassandra, MongoDB, CosmosDB, MySQL, PostgreSQL, MariaDB, and SAP HANA.
â— Numerous techniques on how to query data and troubleshoot Polybase for better data analytics.
â— Exclusive coverage on Azure Synapse Analytics and building Big Data clusters.
This book brings exciting coverage on establishing and managing data virtualization using polybase. This book teaches how to configure polybase on almost all relational and nonrelational databases. You will learn to set up the test environment for any tool or software instantly without hassle. You will practice how to design and build some of the high performing data warehousing solutions and that too in a few minutes of time.
You will almost become an expert in connecting to all databases including hadoop, cassandra, MySQL, PostgreSQL, MariaDB and Oracle database. This book also brings exclusive coverage on how to build data clusters on Azure and using Azure Synapse Analytics. By the end of this book, you just don't administer the polybase for managing big data clusters but rather you learn to optimize and boost the performance for enabling data analytics and ease of data accessibility.
What you will learn
â— Learn to configure Polybase and process Transact SQL queries with ease.
â— Create a Docker container with SQL Server 2019 on Windows and Polybase.
â— Establish SQL Server instance with any other software or tool using Polybase
â— Connect with Cassandra, MongoDB, MySQL, PostgreSQL, MariaDB, and IBM DB2.
Who this book is for
This book is for database developers and administrators familiar with the SQL language and command prompt. Managers and decision-makers will also find this book useful. No prior knowledge of any other technology or language is required.
Table of Contents
1. What is Data Virtualization (Polybase)
2. History of Polybase
3. Polybase current state
4. Differences with other technologies
7. SQL Server
8. Hadoop Cloudera and Hortonworks
9. Windows Azure Storage Blob
11. From Azure Synapse Analytics
12. From Big Data Clusters
21. SAP HANA
22. IBM DB2