Share

Ubuntu debuts in Microsoft Azure as part of HDInsight analytics

Maybe, it will be this: Microsoft has selected Ubuntu for its first Linux-based software-as-a service offering, HDInsight, Microsoft’s big data Hadoop-on-Azure service.

Advertisement

When Azure Data Lake was first revealed, it was plain Microsoft was set on making Azure a welcoming environment for enterprise big data applications.

In addition, the Azure Data Lake suite is gaining an analytics service based on Apache YARN called Azure Data Lake Analytics. In the store, data can be securely shared for collaboration and is accessible for processing and analytics from HDFS applications and tools.

HDInsight was developed by Microsoft working with Ubuntu-shop Canonical and Hadoop spinner Hortonworks, Microsoft and Canonical said.

The above and more will be covered at Microsoft’s AzureCon 2015. Millions of SQL and.NET developers can now process and analyze all of their data with the skills they already have. The tool is built on Apache YARN.

Back in February, when HDInsight opened for preview, Azure’s director of program management Corey Sanders wrote: “Customers are asking for the scale-out power of Azure and easy-to-use Azure services across many different development and management ecosystems”. It uses a new Microsoft-built query language that draws from both the widely used SQL query language and Microsoft’s C# programming language. That can help Microsoft Azure distinguish itself from other public cloud providers with data services, including market leader Amazon Web Services, as well as Google Cloud Platform and IBM SoftLayer. Code-based solutions offer great power, but require significant investments to master, while SQL-based tools make it easy to get started but are hard to extend.

HDInsight is capable of running a wide variety of open source analytics engines, including Hive, Spark, HBase and Storm.

Both the Azure Data Lake Store and Azure Data Lake Analytics will be available in preview later this year, Microsoft reports. The U-SQL support in Azure Data Lake Tools for Visual Studio includes state of the art support for authoring, debugging and advanced performance analysis features for increased productivity when optimizing jobs running across thousands of nodes.

Advertisement

There you have it. In 2015, Microsoft has recognized that its users wants a choice between Linux and Windows applications even on its own cloud. Managed clusters on Linux are now generally available, Microsoft reports.

Azure Data Lake Picks Up Some Improvements Prior to AzureCon