Css497 undergraduate research
Download
1 / 18

CSS497 Undergraduate Research - PowerPoint PPT Presentation


  • 64 Views
  • Uploaded on

CSS497 Undergraduate Research. Performance Comparison Among Agent Teamwork, Globus and Condor By Timothy Chuang Advisor: Professor Munehiro Fukuda. Overview. Agent Teamwork – deployment of mobile agents Agents launch, monitor and resume jobs Fault-tolerant

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about ' CSS497 Undergraduate Research' - connie


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Css497 undergraduate research

CSS497 Undergraduate Research

Performance Comparison Among Agent Teamwork, Globus and Condor

By Timothy Chuang

Advisor: Professor Munehiro Fukuda


Overview
Overview

  • Agent Teamwork – deployment of mobile agents

    • Agents launch, monitor and resume jobs

    • Fault-tolerant

  • Condor – opportunist job dispatcher

    • Condor daemon searches for idle computing nodes on which to dispatch jobs

    • Emphasize on job migration upon encountering an error

  • Globus – widely used grid computing middleware

    • MPICH is required for parallel applications


Condor

User

Snapshot

Class Manager

Class Manager

Class Manager

Gateway

Gateway

Gateway

Condor

Condor Pool X


Globus

User

Globus

DUROC/MPICH-G2

LFS

PBS

GRAMs


Snapshot

Methods

Snapshot

Methods

GridTCP

GridTCP

User program wrapper

User program wrapper

Results

Results

snapshot

snapshot

snapshot

User

A

User

B

FTP

Server

snapshots

snapshots

Agent Teamwork

User A’s

Process

User A’s

Process

User B’s

Process

TCP

Communication

Snapshot

Methods

GridTCP

User program wrapper

Sentinel Agent

Sentinel Agent

Sentinel Agent

Commander Agent

Resource Agent

Resource Agent

Commander Agent

BookkeeperAgent

Bookkeeper Agent


Project objectives
Project Objectives

  • Establish reference platform

    • Condor Installation

    • PVM installation

  • Implement parallel applications to run on PVM

    • Matrix Multiplication

    • Wave2D Simulation

    • Mandelbrot Set Simulation

    • Distributed Grep



Problems with condor pvm
Problems with Condor/PVM Teamwork’s check pointing feature

  • Condor no longer fully Supports PVM

    • PVM universe to dispatch jobs in is no longer functional

  • As a result, condor was dropped from the project


Evaluation of agent teamwork s fault tolerance performance
Evaluation of Agent Teamwork’s Fault-tolerance Performance Teamwork’s check pointing feature

  • Applications used

    • Matrix Multiplication

    • Mandelbrot Set Renderer

    • Wave2D Simulation

    • Distributed Grep

  • Fault-tolerance Performance

    • Evaluate the extra overhead of checkpointing and resumption


Challenges
Challenges Teamwork’s check pointing feature

  • Finding a large problem set that can scale well with the increasing number of computing nodes

    • Certain problem sizes are limited to the master node’s memory – Matrix Multiplication

  • Debugging parallel applications

    • Requires going through time consuming diagnosis

  • Finding the best check-pointing frequency for all applications

    • Setting the frequency too low could take up to three hours to finish a job!


Performance matrixmult
Performance - MatrixMult Teamwork’s check pointing feature


Performance wave2d
Performance – Wave2D Teamwork’s check pointing feature


Performance mandelbrot
Performance – Mandelbrot Teamwork’s check pointing feature


Performance distributed grep
Performance – Distributed Grep Teamwork’s check pointing feature


Continued work
Continued Work Teamwork’s check pointing feature

  • Scale problem size to utilize all 64 computing nodes

    • Conduct performance evaluation on multi-clusters

  • Conduct performance evaluation on Globus

    • Compare Globus’ performance with Agent Teamwork


Useful classes
Useful Classes Teamwork’s check pointing feature

  • CSS301 – Technical Writing

  • CSS343 – Data Structures and Algorithms

  • CSS430 – Operating Systems

  • CSS432 – Network Design

  • CSS434 – Parallel and Distributed Computing


Acknowledgements
Acknowledgements Teamwork’s check pointing feature

My Faculty Advisor:

Professor Munehiro Fukuda

UWB Linux System Administrators:

Mr. David Grimmer

Mrs. Meryll Larkin

My Sponsor:

Mr. Joshua Phillips


Questions
Questions? Teamwork’s check pointing feature


ad