Enhancing Atomic Data Management in Distributed GRID Environment
100 likes | 182 Views
This study explores the challenges and solutions for managing atomic data in a GRID environment. It discusses the weaknesses of existing systems, introduces a new approach using GRID infrastructure, a virtualized UI, SQL-based data structures, and the integration to VAMDC. The proposed solution aims to streamline data management, improve accessibility, and enhance overall efficiency in astrophysics modeling applications.
Enhancing Atomic Data Management in Distributed GRID Environment
E N D
Presentation Transcript
Databases and applications in a distributed GRID environment Authors: • Juan González • Claudio Mendoza • Tim Kallman • Michael Witthoeft • Javier García
Content summary • Introduction • The weaknesses • Narrow scope • Hard to update and maintain Atomic Data (VAMDC) • A solution approach • GRID infrastructure • Virtualized UI • SQL based data structure • Difficulties GRID Astrophysics modeling application • Benefits
Introduction • What is XSTAR? • Atomic data repositories and XSTARdb • Purpose of XSTARdb • Construct a self-consistent data set to use within XSTAR • Store atomic data with well-defined data types • Build tools to intelligently combine the datasets into a master set • Provide all these tools through the web • Integrate to VAMDC • How do we plan to do it?
The Weaknesses XSTAR frontend • Narrow scope • No web tool available • Advanced computer skills required • System-dependent installation XSTAR backend • Hard to update and maintain • Ever growing sets of data files • Sequential modeling • Time consuming consistency and integrity checks • Manually modified files (error prone) • Lack of standards, conventions and inconsistent data model • Informal version control
A solution approach: gXSTAR • GRID infrastructure Online Users JDL Xstar Web GENIUS • General considerations • The web site will be based on Genius grid portal or VM • Regular jobs will be used to run single XSTAR calculations • Parametric JDL will be used to run XSTAR models
A solution approach: Virtualized UI • XSTAR all set to go! Online Users JDL XSTAR VM Download Site Portable GRID UI • General considerations • No installation/configuration of XSTAR needed • A VM execution software is required • After download, the user’s certificate should be installed in the VM
A solution approach: XSTARdb • SQL based data structure with a web front end Input files Scripts Online Users XSTARdb Web XSTARdb • General considerations • The db structure must be able to describe the data it contains • Build a library of functions to act on the db • Atomic data to be inserted will come in a variety of formats • Being able to browse the database is essential (web and text based) • Integrated within VAMDC frontend
Difficulties gXSTAR • Functionalities extraction in bundled software • Distribution of work load from sequential code schema XSTARdb • Different types of input files and formats • Huge amounts of data to process • Errors within the files • Duplicated information
Benefits gXSTAR • Widely available through the web and VOA • More computing power (HPC) • Flexible search tools XSTARdb • Provides the user with an easier way to test new data • Provides a self-consistent data set to use within XSTAR and VAMDC • Makes it easier to update, maintain and port data