1 / 8

HAYK SHOUKOURIAN

Rechnertechnik und Rechnerorganisation / Parallelrechnerarchitektur. Workflow Support in Kepler. HAYK SHOUKOURIAN. Rechnertechnik und Rechnerorganisation / Parallelrechnerarchitektur. What is Kepler ?. Software application for the analysis and modeling of scientific data

kamali
Download Presentation

HAYK SHOUKOURIAN

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Rechnertechnik und Rechnerorganisation / Parallelrechnerarchitektur Workflow Support in Kepler HAYK SHOUKOURIAN

  2. Rechnertechnik und Rechnerorganisation / Parallelrechnerarchitektur What is Kepler? Software application for the analysis and modeling of scientific data Simplifies the creation of executable models by using a visual representation of these processes These representations, or “scientific workflows”, display the flow of data among discrete analysis and modeling components HAYK SHOUKOURIAN

  3. Rechnertechnik und Rechnerorganisation / Parallelrechnerarchitektur Basic Components in Kepler Each actor in a workflow can contain one or more ports used to consume or produce data and communicate with other actors in the workflow. Actors are connected in a workflow via their ports. Director controls the execution of a workflow. (just as a film director oversees a cast and crew) Every workflow must have a director Parameters are configurable values that can be attached to a workflow or to individual directors or actors Actors take their execution instructions from the director actors specify what processing occurs director specifies when it occurs Relations allow users to “branch” a data flow. Branched data can be sent to multiple places in the workflow simultaneously. HAYK SHOUKOURIAN

  4. Rechnertechnik und Rechnerorganisation / Parallelrechnerarchitektur Workflow example required when constructing a feedback loop that uses an SDF director. Synchronization points HAYK SHOUKOURIAN

  5. Rechnertechnik und Rechnerorganisation / Parallelrechnerarchitektur Features • Mature platform supporting multiple models of computation suited to distinct types of analysis (processing sensor data, integrating differential equations) • Freely available under the BSD License • Provides a graphical user interface and a run­-time engine that can execute workflows • either from within the graphical interface or from a command line • Workflows can be nested, allowing complex tasks to be composed from simpler • components • build reusable, modular sub-workflows that can be saved and used for many • different applications • Workflows can leverage the computational power of grid technologies (Globus, • SRB, Web and Soaplab Services), as well as take advantage of Kepler’s native support • for parallel processing HAYK SHOUKOURIAN

  6. Rechnertechnik und Rechnerorganisation / Parallelrechnerarchitektur • EUFORIA project has worked to produce Kepler functionality to allow users to run jobs on both Grid and HPC systems from a Kepler workflow www.euforia-project.eu.com • EUFORIA has a Grid infrastructure which incorporates a large amount of computational resources using the gLite/EGEE software to provide Grid functionality • EUFORIA Grid has the capability to run both parallel (MPI) and serial jobs, whereas the • HPC targets are designed exclusively for parallel programs (generally MPI) Note only have the functionality to submit jobs to resources, upload and download data. The workflow will not automatically port your code onto the resource targeted, it can only run codes which have been pre-compiled for the Grid/HPC resource and are either in place already on that resource or are available ready to run (i.e. as a pre-compiled executable) HAYK SHOUKOURIAN

  7. Rechnertechnik und Rechnerorganisation / Parallelrechnerarchitektur KeplervsTaverna (UC Davis, UC Santa Barbara, and UC San Diego) (University of Manchester) 1. Models of computation they use • Kepler works based on the concept of “directors”, which dictate the models of execution used within a workflow • Taverna uses one particular model of computation, based on the lambda calculus, whereas in Kepler users have a range of models of computation available 2. The user communities being served • Kepler: emphasis so far has been on ecology and geology • Taverna: primarily focused on supporting the Life Sciences community (biology, chemistry and medical imaging) HAYK SHOUKOURIAN

  8. Rechnertechnik und Rechnerorganisation / Parallelrechnerarchitektur THANK YOU Questions? HAYK SHOUKOURIAN

More Related