SQL Server 2016 – Analytic Revolution with R
Feb 12,
2016
Part 3 – Implementing a Data Science Workstation SQL Server Architecture for R and Data Science We have previously seen how to integrate R with SQL Server. While executing R scripts in stored procedures is tremendously valuable, the methods lacks the interactivity required by most analysts. To appreciate how SQL Server’s R services can provide […]
SQL Server 2016: Real-Time Operational Analytics
Feb 10,
2016
SQL Server 2016 introduces updatable nonclustered columnstore indexes. The complexity of that phrase should not mask the simplicity and importance of its application. Updatable nonclustered columnstore indexes make possible what Microsoft calls “Real-Time Operational Analytics”. It would be hard to overstate the importance of making real-time transactional data available immediately for data mining and advanced […]
SQL Server 2016 – Analytic Revolution with R
Feb 8,
2016
Part 1 – Architecture and Installation The folks at Microsoft have created quite a buzz with the integration of R scripting into SQL Server. In recent years, the open-source statistical program R has established itself as the de-facto statistics and analysis package for big data. But it has always been possible to query SQL Server […]
Polybase: Big (Data) Things in SQL Server 2016 – Part 2
Feb 5,
2016
Part Two: Transact-SQL Queries of Hadoop Data Three tasks are necessary if we are to query Hadoop data with Polybase. We must define the connection information, we must define how the semi-structured Hadoop data is to be parsed, and we must specify the row-and-column format for the parsed data that we will use to write […]
Polybase: Big (Data) Things in SQL Server 2016 – Part 1
Feb 4,
2016
Installation and Configuration of SQL Server 2016 Polybase Introduction Among SQL Server’s many new features introduced in 2016 is Polybase, a bridge between relational data stored in SQL Server and bulk data stored in the Hadoop ecosystem. Hadoop is an excellent store for what many folks call “semistructured” data. That is, data which cannot be […]