Updated: July 15, 2020 (February 22, 2016)

  Charts & Illustrations

PolyBase Accesses Azure Blob Storage

My Atlas / Charts & Illustrations

223 wordsTime to read: 2 min
Andrew Snodgrass by
Andrew Snodgrass

Andrew analyzes and writes about Microsoft's data management, business intelligence, and machine learning solutions, as well as aspects of licensing... more

The PolyBase technology in SQL Server 2016 accesses unstructured data stored in Azure Blob Storage. The illustration shows how a SQL Server 2016 cluster (top) can use PolyBase to process T-SQL queries to unstructured data in Blob Storage (bottom). The PolyBase engine runs on the head node of a PolyBase Group of servers, where it coordinates queries that are distributed to compute nodes.

Blob Storage data schemas must be created by administrators (left) who use new SQL commands to create table schemas that represent the data source. The schemas are saved as database-level external tables that users can query with T-SQL (upper left). This limits the use of Blob Storage to data with known formats, such as historical data stored in delimited text files.

Querying Blob Storage data starts with the head node, which parses queries, generates query plans, and distributes the workload to the compute nodes (center). Compute nodes access the Blob Storage data (bottom) directly and manage the movement of data. Results can also be joined with traditional SQL Server relational tables (top right) in real time.

Atlas Members have full access

Get access to this and thousands of other unbiased analyses, roadmaps, decision kits, infographics, reference guides, and more, all included with membership. Comprehensive access to the most in-depth and unbiased expertise for Microsoft enterprise decision-making is waiting.

Membership Options

Already have an account? Login Now