A key feature of all distributed storage systems is their ability to replicate data not just across machines within a … More
Tag: distributed database
Split Query Processing in Polybase
Abstract This paper presents Polybase, a feature of SQL Server PDW V2 that allows users to manage and query data … More
F1: A Distributed SQL Database That Scales
With both the F1 and Spanner papers out its now possible to understand their interplay a bit holistically. So lets … More
Avatara: OLAP for Web-scale Analytics Products
The highlight of this system is a clear separation of the cube computation engine and the query serving engine of … More
An Efï¬cient Multi-Tier Tablet Server Storage Architecture
This work presents a new, highly scalable, and efficient TSSL architecture called the General Tablet Server Storage Layer or GTSSL. … More
HAcid: A lightweight transaction system for HBase
HAcid is a client library that applications can use for operating multi-row transactions in HBase. Seems to be motivated by … More
Spanner : Google's globally distributed database
Spanner is Google’s scalable, multi-version, globallydistributed, and synchronously-replicated database. It is the ï¬rst system to distribute data at global scale … More
MinuteSort with Flat Datacenter Storage
Its been a couple of days since Microsoft has been in the news as the one to beat the previous … More
PNUTS – Yahoo!’s Hosted Data Serving Platform
PNUTS is a geographically distributed database developed and in use at Yahoo!. PNUTS works off a simple relational model in … More
Hive – A Warehousing Solution Over a Map-Reduce Framework
The size of data sets being collected and analyzed in the industry for business intelligence is growing rapidly, making traditional … More