Title page for ETD etd-11042009-151525


Type of Document Master's Thesis
Author Akturk, Ismail
Author's Email Address iaktur1@lsu.edu
URN etd-11042009-151525
Title Asynchronous Replication of Metadata Across Multi-Master Servers in Distributed Data Storage Systems
Degree Master of Science in Electrical Engineering (M.S.E.E.)
Department Electrical & Computer Engineering
Advisory Committee
Advisor Name Title
Kosar, Tevfik Committee Co-Chair
Vaidyanathan, Ramachandran Committee Co-Chair
Ramanujam, Jagannathan Committee Member
Trahan, Jerry Lee Committee Member
Keywords
  • distributed data storage
  • metadata replication
  • asynchronous multi-master replication
Date of Defense 2009-10-28
Availability unrestricted
Abstract
In recent years, scientific applications have become increasingly data intensive. The increase in the size of data generated by scientific applications necessitates collaboration and sharing data among the nation's education and research institutions. To address this, distributed storage systems spanning multiple institutions over wide area networks have been developed. One of the important features of distributed storage systems is providing global unified name space across all participating institutions, which enables easy data sharing without the knowledge of actual physical location of data. This feature depends on the ``location metadata'' of all data sets in the system being available to all participating institutions. This introduces new challenges. In this thesis, we study different metadata server layouts in terms of high availability, scalability and performance. A central metadata server is a single point of failure leading to low availability. Ensuring high availability requires replication of metadata servers. A synchronously replicated metadata servers layout introduces synchronization overhead which degrades the performance of data operations. We propose an asynchronously replicated multi-master metadata servers layout which ensures high availability, scalability and provides better performance. We discuss the implications of asynchronously replicated multi-master metadata servers on metadata consistency and conflict resolution. Further, we design and implement our own asynchronous multi-master replication tool, deploy it in the state-wide distributed data storage system called PetaShare, and compare performance of all three metadata server layouts: central metadata server, synchronously replicated multi-master metadata servers and asynchronously replicated multi-master metadata servers.

Files
  Filename       Size       Approximate Download Time (Hours:Minutes:Seconds) 
 
 28.8 Modem   56K Modem   ISDN (64 Kb)   ISDN (128 Kb)   Higher-speed Access 
  Akturk_thesis.pdf 7.66 Mb 00:35:27 00:18:14 00:15:57 00:07:58 00:00:40

Browse All Available ETDs by ( Author | Department )

If you have questions or technical problems, please Contact LSU-ETD Support.