Index
Reference: CS744 (UW-Madison) and CS 494 (UIC)
Last updated
Was this helpful?
Reference: CS744 (UW-Madison) and CS 494 (UIC)
Last updated
Was this helpful?
Was this helpful?
The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines , L.A. Barroso, U. Holzle, Synthesis Lectures on Computer Architecture, 2009. Chapter 1 and 2.
VL2: A Scalable and Flexible Data Center Network, Greenberg et al., SIGCOMM 2009.
Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Google’s Datacenter Network, Singh et al., SIGCOMM 2015.
The Hadoop Distributed File System, Schvachko et al, MSST, 2010
The Google File System, Ghemawat et al, SOSP, 2003.
. Nightingale et. al, OSDI, 2012.
Rashmi et. al, OSDI, 2016
Muralidhar et. al, OSDI, 2014.
Chang et. al, OSDI, 2006.
DeCandia et. al, SOSP, 2007.
Corbett et. al, OSDI, 2012.
Huang et. al, SOSP, 2013.
Nishtala et. al, NSDI, 2013.
Mike Burrows, OSDI, 2006.
Dryad: Distributed Data-Parallel Programs from Sequential Building Blocks. Isard et. al, EuroSys, 2007.
CIEL: a universal execution engine for distributed data-flow computing. Murray et. al, NSDI, 2011.
Ananthanarayanan et al, OSDI, 2010.
Yu et. al, OSDI, 2008.
Goetz Graefe, SIGMOD, 1990.
Ananthanarayanan et. al, NSDI, 2012.
, Zaharia et al, NSDI, 2012.
, Saha et al, SIGMOD, 2015.
Essertel et. al, OSDI, 2018.
Transaction: Crooks et. al, OSDI, 2018.
Load balancing
. Patel et. al, SIGCOMM, 2013.
Gandhi et. al, SIGCOMM, 2014.
Apache Hadoop YARN: Yet Another Resource Negotiator, Vavilapalli et al, SOCC, 2013.
Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center, Hindman et al, NSDI, 2011.
Dominant Resource Fairness: Fair Allocation of Multiple Resource Types, Ghodsi et al, NSDI, 2011.
Verma et. al, EuroSys, 2015.
Packing
Altruistic Scheduling in Multi-Resource Clusters. Grandl et. al, OSDI, 2016.
Multi-Resource Packing for Cluster Schedulers, Grandl et. al, SIGCOMM, 2014.
Isard et. al, SOSP, 2009.
Re-Planning
Mahajan et. al, OSDI, 2018.
Threads
Qin et. al, OSDI, 2018.
Cache
Berger et. al, OSDI, 2018.
Scaling Distributed Machine Learning with the Parameter Server, Li et al, OSDI, 2014.
STRADS: A Distributed Framework for Scheduled Model Parallel Machine Learning, Kim et al, EuroSys, 2016.
SLAQ: Quality-Driven Scheduling for Distributed Machine Learning, Zhang et al, SoCC, 2017.
, Abadi et al, OSDI, 2016.
, Shen et al, VLDB, 2020
Xiao et al, OSDI, 2018.
, Crankshaw et al, NSDI, 2017.
Narayanan et al, SOSP 2019.
, Chen et al, OSDI, 2018
, Moritz et al, OSDI, 2018.
, Feng et al, SIGMOD, 2012
Zhang et. al, USENIX ATC, 2018.
. Lee et. al, OSDI, 2018.
, Hazelwood et. al, HPCA, 2018.
Chen et. al, Neural Information Processing Systems, Workshop on Machine Learning Systems, 2015.
, Low et al, VLDB, 2012.
. Peng et. al, EuroSys, 2018.
Gu et. al, NSDI, 2019.
. Jeong et. al, 2018.
, Sparks et al, ICDE, 2017.
, Chilimbi et al, OSDI, 2014.
Zhang and Re, VLDB, 2014.
Spark SQL: Relational Data Processing in Spark, Armburst et al, SIGMOD, 2015.
Major technical advancements in Apache Hive, Huai et al, SIGMOD, 2014.
Clarinet: WAN-Aware Optimization for Analytics Queries, Viswanathan et al, OSDI, 2016.
Vulimiri et al, NSDI, 2015.
. Chaiken et al, VLDB
Dageville et al, SIGMOD 2016.
. Vuppalapati et al, NSDI 2020.
Kornacker et. al, CIDR, 2015.
Melnik et. al, VLDB, 2010.
Chandramouli et. al, VLDB, 2014.
Polychroniou et. al, SIGMOD, 2015.
Balkesen et. al, VLDB, 2013.
. Madden et. al, OSDI, 2002.
Storm @Twitter , Toshniwal et al, SIGMOD, 2014.
Twitter Heron: Stream Processing at Scale, Kulkarni et al, SIGMOD, 2015.
Realtime Data Processing at Facebook. Chen et. al, SIGMOD, 2016.
, Zaharia et al, SOSP, 2013.
Reading:
Carbone et al, Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 2015.
Kreps et al, NetDB Workshop, 2011.
Also this document of comparison of widely used .
, Lin et al, NSDI, 2016.
. Venkataraman et. al, SOSP, 2017.
. Mai et. al, PVLDB, 2018.
Rajadurai et. al, ASPLOS, 2018.
Abadi et. al, VLDB, 2003.
Kalavri et. al, OSDI, 2018.
, Murray et al, SOSP, 2013.
Akidau et. al, VLDB, 2015.
Pregel: A System for Large-Scale Graph Processing, Malewicz et al, SIGMOD, 2010.
TAO: Facebook’s Distributed Data Store for the Social Graph. Bronson et. al, USENIX ATC, 2013.
PowerGraph: Distributed Graph-Parallel Computation on Natural Graphs, Gonzalez et al, OSDI, 2012.
Gonzalez et al, OSDI, 2014.
Lerer et al, Proceedings of the 2nd SysML Conference, 2019.
McSherry et al, HOTOS 2015.
Teixeira et. al, SOSP, 2015.
Shi et. al, OSDI, 2016.
Iyer et. al, OSDI, 2018.
Nelson et. al, USENIX ATC, 2015.
Ching et. al, VLDB, 2015.
Runtime
Weld: A Commom Runtime for High Performance Data Analytics, Palkar et al, CIDR, 2017.
Hardware
, Jouppi et al, CIDR, 2017.
Putnam et. al, ISCA, 2014.
. Kwon et. al, SOSP, 2017.
Serverless
, Jonas et al, SoCC, 2017.
Hendrickson et. al, HotCloud, 2016.
Approximation
Agarwal et al, Eurosys, 2013.
. Park et. al, SIGMOD, 2019.
Other: RDMA
Dragojevic et. al, NSDI, 2014.
Dragojevic et. al, SOSP, 2015.
Other: Offload
Phothilimthana et. al, OSDI, 2018.
Liu et. al, 2018.
Peeking Behind the Curtains of Serverless Platforms, Wang et. al, USENIX ATC, 2018.
SOCK: Rapid Task Provisioning with Serverless-Optimized Containers, Oakes et. al, USENIX ATC, 2018.
Datacenter RPCs can be General and Fast. Kalia et. al, NSDI, 2019.
Distributed Lock Management with RDMA: Decentralization without Starvation. Yoon et. al, SIGMOD, 2018.
Efficient Memory Disaggregation with Infiniswap. Gu et. al, NSDI, 2017.
Accelerating Relational Databases by Leveraging Remote Memory and RDMA. Li et. al, SIGMOD, 2016.
Remote Memory in the Age of Fast Networks. Aguilera et. al, SoCC, 2017.