1 / 14

MONITORING, ACCOUNTING AND REGISTRATION SERVICES FOR RUSSIAN GRID NETWORKq

MONITORING, ACCOUNTING AND REGISTRATION SERVICES FOR RUSSIAN GRID NETWORKq. Sergey Belov , Tatiana Goloskokova , Vladimir Korenkov , Nikolay Kutovskiy , Danila Oleynik , Artem Petrosyan , Roman Semenov, Alexander Uzhinskiy LIT JINR

inara
Download Presentation

MONITORING, ACCOUNTING AND REGISTRATION SERVICES FOR RUSSIAN GRID NETWORKq

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. MONITORING, ACCOUNTING AND REGISTRATION SERVICES FOR RUSSIAN GRID NETWORKq Sergey Belov, Tatiana Goloskokova, Vladimir Korenkov, NikolayKutovskiy, DanilaOleynik, ArtemPetrosyan, Roman Semenov, Alexander Uzhinskiy LIT JINR The 5th International Conference "Distributed Computing and Grid-technologies in Science and Education“ (GRID’2012) 20 July 2012, JINR, Dubna, Russia

  2. RGN project • Russian National Grid Network • Development of grid-based networks for high performance computing • To create a network infrastructure and connection to it of largest supercomputer centers, enterprises, high-tech industries and research organizations • Computing power for high-priority industrial tasks and scientific projects • 2010 – start of network backbone’s development • 2011 – core grid services development, application’s interfaces, grid infrastructure prototype • 2012 – further system development, more supercomputers, more applications

  3. Grid and network infrastructure • Main aim • Integration of unique, medium and small supercomputers into a unified distributed computing environment • Development of high-speed backbone network for the infrastructure • Provide secure and clear remote access for users to these resources • Heterogeneous grid environment (hardware, software) • Oriented to parallel tasks rather than single batch tasks • Workflow management • Jobs consist of tasks • Technical points • is a set of grid services with unified interfaces • partially based on Globus Toolkit 4, VOMS • GSI based security model, mainly RESTfulgrid services

  4. Core grid services • Web User Interface • Resource Broker/metascheduler+ Workflow management (RESTful) • Grid gateway to the resources (WS) • Information Service (RESTful) • Monitoring & accounting • Registration service (RESTful) • GSI services: CA, MyProxy, VOMS • GridFTPservers • Non-grid: applications licensing service, billing and audit, application solvers images management

  5. Goals of monitoring • Tracking state of sites and services • Availability • Real operational state • Monitoring of user's jobs and tasks • Keeping history on different system's parameters • Information representation • General infrastructure state in whole • Running jobs and tasks • Separate sites and services (real-time and history) • Visualization of job events

  6. Monitoring of resources and services • State of computational resources by site (based on data from information indexes) • Slots available for tasks • Structure and properties of clusters • Subclusters, nodes, slots, operation system, architecture • Application software • Supported VOs (with ACLs, Access Control Lists) • Monitoring of jobs running on sites (by information from Job managementservers)

  7. Accounting and job monitoring • Goal: to get information, both real-time and historical, on resources utilization and jobs running on RGN infrastructure (by users, VOs, sites) • Information sources: JMSservers, Grid Gateways and local resources managers • Collecting data on jobs and tasks in the system • All jobs events timestamps, real consumed CPU time • Accounting information reports in different views: • by sites, VOs and single users • Aggregation of actual job's execution time from all sites

  8. Monitoring and accounting information flows Infosys central Information index Monitoring andaccounting data storage Information collector JMS Jobmanagementservices Monitoring website Monitoring data provisioning (Web Services) Accounting Informationpublisher Registrationservice

  9. Accounting and jobs monitoring screenshots

  10. Resource centers on the map • Main sites of the infrastructure prototype (2011) • Other supercomputers are in process of connection and testing • Will be more supercomputers from different Russian regionsby the end of 2012 • 10 resource centers are already connected to high-speed VPN http://grid.voskhod.ru

  11. Infrastructure operation visualization with Google Earth Visualization package from the ARDA Dashboard project was used

  12. Registration service • Service for registration of resources and grid services • Main task is to store and provide information about resources and grid services in RGN • Site contacts and other formal information • Services’ endpoints, type, DN, etc. • Resources operational state • Registration and information modification procedures support • Web-interface for users and RESTful interface for system services • Authorization and authentication along with rights and roles of access to add, modify or access information

  13. Registration service’s web interface

  14. Conclusion and results • Were developed architecture and software solution for monitoring , accounting and registration service for the Russian National Grid Network • Experience of developing of the similar services within GridNNN project was very helpful and was heavily used • Created tools were successfully deployed andare in production • Russian National Grid Network project is currently developing, so further work on the services will be continued

More Related