Real-time scheduling in MapReduce clusters

Chen He, Ying Lu, David Swanson

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Scopus citations

Abstract

MapReduce has been widely used as a Big Data processing platform. As it gets popular, its scheduling becomes increasingly important. In particular, since many MapReduce applications require real-time data processing, scheduling real time applications in MapReduce environments has become a significant problem. In this paper, we create a novel real-time scheduler for MapReduce, which overcomes the deficiencies of an existing scheduler. It avoids accepting jobs that will lead to deadline misses and improves the cluster utilization. We implement our scheduler in Hadoop system and experimental results show that our scheduler provides deadline guarantees for accepted jobs and achieves good cluster utilization.

Original languageEnglish (US)
Title of host publicationProceedings - 2013 IEEE International Conference on High Performance Computing and Communications, HPCC 2013 and 2013 IEEE International Conference on Embedded and Ubiquitous Computing, EUC 2013
PublisherIEEE Computer Society
Pages1536-1544
Number of pages9
ISBN (Print)9780769550886
DOIs
StatePublished - 2014
Event15th IEEE International Conference on High Performance Computing and Communications, HPCC 2013 and 11th IEEE/IFIP International Conference on Embedded and Ubiquitous Computing, EUC 2013 - Zhangjiajie, Hunan, China
Duration: Nov 13 2013Nov 15 2013

Publication series

NameProceedings - 2013 IEEE International Conference on High Performance Computing and Communications, HPCC 2013 and 2013 IEEE International Conference on Embedded and Ubiquitous Computing, EUC 2013

Conference

Conference15th IEEE International Conference on High Performance Computing and Communications, HPCC 2013 and 11th IEEE/IFIP International Conference on Embedded and Ubiquitous Computing, EUC 2013
Country/TerritoryChina
CityZhangjiajie, Hunan
Period11/13/1311/15/13

Keywords

  • MapReduce
  • cluster utilization
  • real-time scheduling

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'Real-time scheduling in MapReduce clusters'. Together they form a unique fingerprint.

Cite this