In Microsoft's Big Data solutions, data management is the bottom and most fundamental piece.
The flexible data management layer can support all data types, including structured, semi-structured and unstructured static or dynamic data.
There are three main products included in the data management layer: SQL?Server, SQL?Server Parallel Data Warehouse, and
Hadoop on Windows.
Microsoft offers different solutions for different data types.
Specifically, structured data can be processed using SQL?Server and SQL?Server Parallel Data Warehouse.
Unstructured data can be processed using Hadoop-based distributions on Windows Azure and Windows Server; and streaming data can be managed using SQL?Server?StreamInsight and provide near real-time analytics.
1, SQL?Server. last year's release of SQL?Server?2012 for big data to do a lot of improvements, the most important of which is full support for Hadoop, which is SQL?Server?2012 and SQL?Server?2008 one of the most important differences. In SQL?Server?2014, which will be officially released at the end of this year, SQL?Server is further targeting big data by adding in-memory database features to accelerate data processing from a hardware perspective, which is also seen as an improvement for big data.
2, SQL?Server parallel data warehouse. Parallel Data Warehouse (Parallel Data Warehouse Appliance, referred to as PDW) is a new product introduced in SQL?Server?2008 R2, has become Microsoft's main data warehouse products, and will be released this year based on SQL?Server?2012 new parallel data warehouse all-in-one. SQL? Server Parallel Data Warehouse adopts a massively parallel processing (MPP) architecture, which is fundamentally different from the traditional standalone SQL?Server. It combines multiple advanced data storage and processing technologies into one, and is an important part of Microsoft's big data strategy.
3. Hadoop on Windows. Microsoft provides Hadoop on both the Windows Azure platform and Windows?Server, blending the high performance and scalability of Hadoop with the traditional strengths of Microsoft's products of ease-of-use and ease-of-deployment to form a complete Big Data solution. Microsoft Big Data solutions also provide the ease of use and manageability of Windows for Hadoop through simple deployment and integration with components such as Active Directory and System Center. With Hadoop-based services on Windows Azure, Microsoft provides flexibility for its big data solutions in the cloud.