Updated: July 15, 2020 (June 2, 2016)

  Sidebar

PolyBase Unstructured Data Storage Options

My Atlas / Sidebar

338 wordsTime to read: 2 min
Andrew Snodgrass by
Andrew Snodgrass

Andrew analyzes and writes about Microsoft's data management, business intelligence, and machine learning solutions, as well as aspects of licensing... more

SQL Server 2016 includes PolyBase, a database management engine that simplifies access to unstructured data in Hadoop, Microsoft’s Hadoop-based HDInsight service, and Azure Blob Storage.

Hadoop

Hadoop is a popular Apache open-source software framework that distributes processing and storage of large unstructured data sets across clusters of commodity computers. The framework provides fast processing with parallel computing, high reliability with automatic failover, and lower cost than high-performance servers and storage.

Hadoop is often used to compile statistics from massive Web server logs for ad targeting and to extract consumer trends. Such data are typically unstructured with significant variety and unpredictable changes in content and structure over time, which makes analyzing the data with traditional database solutions complex.

A common barrier to implementing Hadoop is that it requires special skills to build and manage a Hadoop cluster and to create data processing jobs in Hadoop. Supported Hadoop providers include Hortonworks and Cloudera, Microsoft partners.

Atlas Members have full access

Get access to this and thousands of other unbiased analyses, roadmaps, decision kits, infographics, reference guides, and more, all included with membership. Comprehensive access to the most in-depth and unbiased expertise for Microsoft enterprise decision-making is waiting.

Membership Options

Already have an account? Login Now