Updated: July 15, 2020 (September 12, 2016)

  Sidebar

HDInsight Compared to Azure Data Lake

My Atlas / Sidebar

267 wordsTime to read: 2 min
Andrew Snodgrass by
Andrew Snodgrass

Andrew analyzes and writes about Microsoft's data management, business intelligence, and machine learning solutions, as well as aspects of licensing... more

Two Microsoft hosted services, Azure Data Lake and Azure HDInsight, are both based on Apache’s Hadoop software framework, a popular solution for Big Data storage and analysis. They provide many of the same features and both separate storage and computing resources to allow customers to maintain a data repository at a steady rate and deploy computing resources as needed to meet query demand. While they deliver many of the same capabilities, the services are designed for different workloads.

HDInsight is a multipurpose Hadoop service that provides multiple deployment configurations optimized for a range of workloads, such as general data mining, processing semi-structured data, or high-speed analysis of streaming data. It uses lower-cost storage and allows customers to select virtual machine sizes to balance performance requirements and cost restrictions. However, the service accepts data only from a limited set of products, and queries must go through the HDInsight management layer, which requires deployment of an HDInsight cluster.

Atlas Members have full access

Get access to this and thousands of other unbiased analyses, roadmaps, decision kits, infographics, reference guides, and more, all included with membership. Comprehensive access to the most in-depth and unbiased expertise for Microsoft enterprise decision-making is waiting.

Membership Options

Already have an account? Login Now