Share

Microsoft preps Azure data lake flood gates for readiness

First announced at Microsoft BUILD in April and then reassessed at Ignite in May, Azure Data Lake is a way to combine every type of data collected into a single place – no matter size, structure, or platform and supporting massively parallel queries. From now on, the tool will be called Azure Data Lake Store.U-SQL is built on the learnings from Microsoft’s internal experience with SCOPE and existing languages such as T-SQL, ANSI SQL, and Hive.At the core of the new service is the new U-SQL query language.

Advertisement

Canonical and Microsoft confirmed in a joint announcement that the Hadoop-based big data service offering HDInsight will run on Ubuntu and Hortonworks.

HDInsight on Linux allows for broader support for Hadoop ecosystem partners which means more tools and applications for running Hadoop workloads on.

Both companies are committed to meet the needs of their customers, especially in a world where the industry moves at warp-speed to adopt cloud architectures and analytics for performance and scale.

The firm billed Azure Data Lake Store as HDRS for the cloud, capable of chomping petabyte files and that would be “enterprise ready”.

Datagsuise’s approach to Hadoop security focuses on detecting, auditing and monitoring sensitive corporate data in real time. “Today, more than 20 percent of virtual machines on Azure are Linux and VM Depot has more than 1,000 Linux images”. Together the three companies have made it possible to run HDInsight on Ubuntu on Microsoft’s Azure cloud. Microsoft says it is now on general release with a 99.9 per cent uptime service level agreement.

Microsoft’s data lake – a phrase now with huge currency among the big-data analytics and infrastructure providers – will feature analytics as a service.

As well as Hadoop clusters, you can create HBase and Storm clusters on Linux for NoSQL and real-time processing requirements that is useful for building an internet of things (IoT) application. This enables information officers even greater flexibility and choice in what tools they want to use alongside HDInsight.

Advertisement

“With the processing and storage of structured, semi-structured, and unstructured data in Azure HDInsight repositories, it is critical that all sensitive data be identified, protected and monitored to ensure adherence to compliance mandates”, said JT Sison, VP, Marketing for Dataguise. This service will be available in preview later this year and includes U-SQL, a language that unifies the benefits of SQL with the expressive power of user code.

Azure Data Lake Picks Up Some Improvements Prior to AzureCon