Type of Document Master's Thesis Author Bahsi, Emir Mahmut Author's Email Address firstname.lastname@example.org URN etd-07092008-211059 Title Dynamic Workflow Management for Large Scale Scientific Applications Degree Master of Science in Systems Science (M.S.S.S.) Department Computer Science Advisory Committee
Advisor Name Title Kosar, Tevfik Committee Chair Allen, Gabrielle D. Committee Member White, Christopher D. Committee Member Keywords
- grid computing
- workflow enabling
- site selector
Date of Defense 2008-07-03 Availability unrestricted AbstractThe increasing computational and data requirements of scientific applications have made the usage of large clustered systems as well as distributed resources inevitable. Although executing large applications in these environments brings increased performance, the automation of the process becomes more and more challenging. The use of complex workflow management systems has been a viable solution for this automation process.
In this thesis, we study a broad range of workflow management tools and compare their capabilities especially in terms of dynamic and conditional structures they support, which are crucial for the automation of complex applications. We then apply some of these tools to two real-life scientific applications: i) simulation of DNA folding, and ii) reservoir uncertainty analysis.
Our implementation is based on Pegasus workflow planning tool, DAGMan workflow execution system, Condor-G computational scheduler, and Stork data scheduler. The designed abstract workflows are converted to concrete workflows using Pegasus where jobs are matched to resources; DAGMan makes sure these jobs execute reliably and in the correct order on the remote resources; Condor-G performs the scheduling for the computational tasks and Stork optimizes the data movement between different components.
Integrated solution with these tools allows automation of large scale applications, as well as providing complete reliability and efficiency in executing complex workflows. We have also developed a new site selection mechanism on top of these systems, which can choose the most available computing resources for the submission of the tasks. The details of our design and implementation, as well as experimental results are presented.
Filename Size Approximate Download Time (Hours:Minutes:Seconds)
28.8 Modem 56K Modem ISDN (64 Kb) ISDN (128 Kb) Higher-speed Access Bahsi_thesis.pdf 4.27 Mb 00:19:46 00:10:10 00:08:53 00:04:26 00:00:22
If you have questions or technical problems, please Contact LSU-ETD Support.