Title page for ETD etd-11132008-144951


Type of Document Master's Thesis
Author Balman, Mehmet
URN etd-11132008-144951
Title Failure-Awareness and Dynamic Adaptation in Data Scheduling
Degree Master of Science in Systems Science (M.S.S.S.)
Department Computer Science
Advisory Committee
Advisor Name Title
Tevfik Kosar Committee Chair
Gabrielle Allen Committee Member
James R. Van Scotter Committee Member
Keywords
  • adaptive scheduling
  • distributed computing
  • data placement
  • error detection
Date of Defense 2008-11-12
Availability unrestricted
Abstract
Over the years, scientific applications have become more complex and more data intensive. Especially large scale simulations and scientific experiments in areas such as physics, biology, astronomy and earth sciences demand highly distributed resources to satisfy excessive computational requirements. Increasing data requirements and the distributed nature of the resources made I/O the major bottleneck for end-to-end application performance. Existing systems fail to address issues such as reliability, scalability, and efficiency in dealing with wide area data access, retrieval and processing. In this study, we explore data-intensive distributed computing and study challenges in data placement in distributed environments. After analyzing different application scenarios, we develop new data scheduling methodologies and the key attributes for reliability, adaptability and performance optimization of distributed data placement tasks. Inspired by techniques used in microprocessor and operating system architectures, we extend and adapt some of the known low-level data handling and optimization techniques to distributed computing. Two major contributions of this work include (i) a failure-aware data placement paradigm for increased fault-tolerance, and (ii) adaptive scheduling of data placement tasks for improved end-to-end performance. The failure-aware data placement includes early error detection, error classification, and use of this information in scheduling decisions for the prevention of and recovery from possible future errors. The adaptive scheduling approach includes dynamically tuning data transfer parameters over wide area networks for efficient utilization of available network capacity and optimized end-to-end data transfer performance.
Files
  Filename       Size       Approximate Download Time (Hours:Minutes:Seconds) 
 
 28.8 Modem   56K Modem   ISDN (64 Kb)   ISDN (128 Kb)   Higher-speed Access 
  Balman_abstract.pdf 33.60 Kb 00:00:09 00:00:04 00:00:04 00:00:02 < 00:00:01
  Balman_thesis.pdf 9.29 Mb 00:43:01 00:22:07 00:19:21 00:09:40 00:00:49

Browse All Available ETDs by ( Author | Department )

If you have questions or technical problems, please Contact LSU-ETD Support.