Site report from kek japan
Download
1 / 19

Site Report from KEK, Japan - PowerPoint PPT Presentation


  • 130 Views
  • Uploaded on

Site Report from KEK, Japan. JP-KEK-CRC-01 and JP-KEK-CRC-02 Go Iwai, KEK/CRC Grid Operations Workshop – 2007 Kungliga Tekniska högskolan, Stockholm, Sweden 13-15 June 2007. JP-KEK-CRC-01 and JP-KEK-CRC-02. DEPLOYMENT STATUS AT KEK. KEK Internal Network. KEK External Network.

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about ' Site Report from KEK, Japan' - lonna


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Site report from kek japan

Site Report from KEK, Japan

JP-KEK-CRC-01 and JP-KEK-CRC-02

Go Iwai, KEK/CRC

Grid Operations Workshop – 2007

Kungliga Tekniska högskolan, Stockholm, Sweden

13-15 June 2007


Deployment status at kek

JP-KEK-CRC-01 and JP-KEK-CRC-02

DEPLOYMENT STATUS AT KEK

Grid Operations Workshop at KTH, Stockholm


Logical site overview

KEK Internal Network

KEK External Network

Logical Site Overview

Scoped only for GRIDs

APAN

SuperSINET

  • Taiwan

  • Asia-Pacific region

KEK Firewall

  • Domestic institutes

  • U.S.A

Grid LAN

Central Computing System

New KEK-CC

HPSS

JP-KEK-CRC-02

  • Production System

JP-KEK-CRC-00

JP-KEK-CRC-01

  • Not for WLCG

  • Staff’s training

  • Will Shift to PPS

  • Production System

Grid Operations Workshop at KTH, Stockholm


Physical site overview
Physical Site Overview

KEK-2

KEK-1

Grid Operations Workshop at KTH, Stockholm


Brief summary of lcg deployment
Brief Summary of LCG Deployment

JP-KEK-CRC-01

JP-KEK-CRC-02

since early 2006.

is registered to GOC, is ready to WLCG.

Site Role:

More stable services based on KEK-1 experiences.

Resource and Component:

SL or SLC w/ gLite-3.0 later

CPU: 48, Storage: ~1TB (w/o HPSS)

Full components

Supported VOs:

belle, apdg, g4med, atlasj, ppj, ilc, calice, dteam, ops and ail

  • since Nov. 2005.

  • is registered to GOC, is ready to WLCG

  • is operated by KEK staffs.

  • Site Role:

    • practice for production system JP-KEK-CRC-02.

    • test use among university groups in Japan.

  • Resource and Component:

    • SL-3.0.5 w/ gLite-3.0 later

    • CPU: 14, Storage: ~1.5TB

    • FTS, FTA, RB, MON, BDII, LFC, CE, SE

  • Supported VOs:

    • belle, apdg, g4med, ppj, dteam, ops, calice, ilc and ail

Grid Operations Workshop at KTH, Stockholm


Grid related services
GridRelated Services

  • We have our own GRID CA

    • is started on Feb. 2006, and is recognized by LCG.

    • is accredited by APGRID PMA

    • http://gridca.kek.jp/

  • VO Membership Service

    • Supported VOs:

      • apdg is the VO for Asia-Pacific Data Grid.

      • belle is the VO for Belle experiments.

      • atlasj is the VO for Atlas experiments in Japan.

      • g4med is the VO for Geant4 medical application.

      • PPJ is the VO for the Particle Physics in Japan.

      • ail is the VO for Associated International Laboratory between Japan and France.

    • http://voms.kek.jp/

  • Local Mirror Service

    • SL, SLC, LCG, gLite

    • It takes ~30 minutes to update by using apt-get with CERN or FNAL repositories.

      • ~3 minutes with KEK repository

    • http://hepdg.cc.kek.jp/mirror/

  • Semi-automatic Installation Service

    • WNs can be installed semi-automatically by PXE (Preboot eXecution Environment) and kickstart configuration file.

    • http://hepdg.cc.kek.jp/install/

  • Site Portal

    • http://grid.kek.jp/

Grid Operations Workshop at KTH, Stockholm


People on grid at kek crc
People on Grid at KEK/CRC

  • 7 persons in total

  • CA

    • T. Sasaki and Y. Iida

  • VOMS

    • Y. Watase and G. Iwai

  • Site Operation and Security

    • KEK-0

      • G. Iwai

    • KEK-1

      • T. Sasaki, Y. Iida, Y. Watase and G. Iwai

    • KEK-2

      • T. Sasaki, Y. Watase, and G. Iwai

  • Deployment

    • Y. Watase, Y. Iida and G. Iwai

  • Documentation

    • Y. Watase

  • Networking

    • S. Suzuki, S. Yashiro and Y. Iida

  • Application (SRB, Portal and some Gridified applications)

    • K. Murakami, Y. Iida and G. Iwai

Grid Operations Workshop at KTH, Stockholm


Operation statistics
OPERATION STATISTICS

Grid Operations Workshop at KTH, Stockholm


Submitted ggus tickets in jfy2006
Submitted GGUS Tickets in JFY2006

  • Total number of submitted ticket: 28

    • KEK-1: 11

    • KEK-2: 17

Grid Operations Workshop at KTH, Stockholm


Number of submitted jobs in jfy2006
Number of Submitted Jobs in JFY2006

JP-KEK-CRC-01

JP-KEK-CRC-02

Grid Operations Workshop at KTH, Stockholm


Normalized cpu time in jfy2006 ksi2k hrs
Normalized CPU time in JFY2006(kSI2K*hrs)

JP-KEK-CRC-01

JP-KEK-CRC-02

Grid Operations Workshop at KTH, Stockholm


Virtual organization

Belle Experiment and Accelerator Science

VIRTUAL ORGANIZATION

Grid Operations Workshop at KTH, Stockholm


Vo for the belle experiment
VO for the Belle Experiment

  • Belle VO is federated among 4 countries, 6 institutes, 9 sites.

    • Japan: Nagoya University and KEK

    • Taiwan: ASGC and NCU

    • Australia: University of Melborne

    • Poland: CYFRONET

    • Korea University comes up soon.

  • Started using SRB and LCG

  • Data distribution service using SRB-DSI

    • Belle already has a few PBs data in total including 100s TB DST and MC

      • Bulk file register helps us: Sregister

      • we do not move any of them

    • It is too much difficult to export existing data to LCG physically

    • Benefits both for native SRB users and LCG users

  • SRB-DSI with LCG is in operation now.

KEK

Japan

CYFRONET

Poland

Nagoya Univ.

Japan

ASGC

Taiwan

NCU

Taiwan

Melbourne Univ.

Australia

Grid Operations Workshop at KTH, Stockholm



Vo for the accelerator science
VO for the Accelerator Science

  • Domestic supports

    • Typical case at laboratory: A few staffs, ~10 students and no technician.

  • Start to monitor them centrally over the VO

    • Focused on their operation supports

    • Not only for WLCG sites but also for NON-WLCG sites

  • PPJ VO is started for the accelerator science in Japan.

  • Federated among a few universities.

    • Tohoku Univ., Tsukuba Univ., Kobe Univ., Hiroshima Univ., Nagoya Univ. and KEK.

  • Usage:

    • To share resources and experiences among major groups, ILC, KamLand, CDF and ATLAS without depending on experimental projects.

Hiroshima IT

Grid Operations Workshop at KTH, Stockholm


Conclusion
Conclusion

  • Tools used in daily grid operations

    • Semi –automatic installation tools only for WNs

      • Most of tools are handmade scripts

    • Monitoring tools, e.g.; SAM and GSTAT are very useful.

    • GGUS Search and APWIKI are also.

    • We are testing to audit by using nCircle, vulnerability management system.

  • Scheduled Interventions

    • 11 times in JFY2006

    • Due to

      • Software/hardware upgrade and site reconfiguration

      • Annual maintenance

      • Replacement of host certificate

  • Unscheduled interventions

    • ~10 times/year

    • Ex) Failed to reconfigure the site, or power cut by thunder.

  • Domestic supports in Japan

    • Important mission for KEK.

  • ~90% of problems are detected by the COD, SAM, GSTAT and nagios.

    • Our operation on Grid is supported by great efforts by APROC members in ASGC, Taiwan.

    • We’d like to keep the tighter collaboration with ASGC.

Grid Operations Workshop at KTH, Stockholm


Thank you

END

Grid Operations Workshop at KTH, Stockholm


Lcg with srb at belle vo

Pluggable Extension

SRB-DSI

GridFTP

130.87.104.0/22

LCG with SRB at Belle VO

APAN

KEK-DMZ

GridFTP

SuperSINET

KEK Firewall

SRB

KEK-CC

Grid LAN

KEK-1

130.87.208.0/22

KEK-2

202.13.197.0/24

KEK-FB

130.87.224.0/21

SRB/MCAT

172.22.28.0/24

B-NET

HSM

NFS

New built

Grid Operations Workshop at KTH, Stockholm


Points to cover in each presentation
Points to Cover in Each Presentation

  • tools used in daily grid operations

  • what features are missing to make your work easier

  • examples of the most frequent scheduled interventions at your site

  • examples of the most frequent unscheduled interventions at your site

  • points to improve in communication with ROC, other sites, Vos, rest of the world...

  • How do you plan deployment of updates/new versions so continuous production is not interrupted?

  • Communication with users: how are you informed about operational problems at your site reported by local/remote users? Mail/GGUS/phone/other?

  • Correlation of cross-site issues: is the operations meeting enough for this? How do you do it otherwise?

  • What percentage of real site problems are detected and reported by the COD before you know about them?

  • usefulness of the following operations bodies/meetings and suggestions to improve them:

    • COD

    • your ROC support team

    • operations meeting

Grid Operations Workshop at KTH, Stockholm


ad