Updated: July 15, 2020 (July 13, 2015)
Analyst ReportHigh-Performance Hosted Hadoop at Massive Scale
Azure Data Lake is a scalable managed Hadoop storage service that can ingest and process large amounts of unstructured data in their native formats at high rates. Compared to Azure HDInsight, Microsoft’s existing Hadoop service, Data Lake provides better query performance and access without file and account size limitations. The service will be a viable alternative to on-premises Hadoop deployments, especially for organizations that want to test Big Data analytics without substantial hardware investment. Data Lake is currently in private preview with a public preview expected in late 2015.
Finding Value in Big Data
A “data lake” is a storage repository (frequently based on Hadoop) whose purpose is to hold vast amounts of unprocessed data (often called Big Data) in its native format until it is needed. For example, data lakes are used to store Web server logs, streaming sensor data, and social media posts. The data are typically unstructured, and the content and format change unpredictably over time, which makes analyzing the data with traditional, structured database solutions complex.
Atlas Members have full access
Get access to this and thousands of other unbiased analyses, roadmaps, decision kits, infographics, reference guides, and more, all included with membership. Comprehensive access to the most in-depth and unbiased expertise for Microsoft enterprise decision-making is waiting.
Membership OptionsAlready have an account? Login Now