1 / 7

BeStMan Gateway on Hadoop Alex Sim, Brian Bockelman Lawrence Berkeley National Laboratory

BeStMan Gateway on Hadoop Alex Sim, Brian Bockelman Lawrence Berkeley National Laboratory University of Nebraska, Lincoln. How it works all together. in PUT/GET. in Ls/Rm/Mkdir/Rmdir. Client. Client. Hadoop. Hadoop. GridFTP file transfers. srmPrepareToGet/Put. TURL.

Download Presentation

BeStMan Gateway on Hadoop Alex Sim, Brian Bockelman Lawrence Berkeley National Laboratory

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. BeStMan Gateway on Hadoop Alex Sim, Brian Bockelman Lawrence Berkeley National Laboratory University of Nebraska, Lincoln

  2. How it works all together in PUT/GET in Ls/Rm/Mkdir/Rmdir Client Client Hadoop Hadoop GridFTP file transfers srmPrepareToGet/Put TURL srmReleaseFiles/srmPutDone srmLs/srmRm/srmMkdir/srmRmdir BeStMan Gateway BeStMan Gateway Gridftp server Gridftp server Gridftp server Gridftp server Gridftp server Gridftp server File system File System . . . . . .

  3. BeStMan with Hadoop at UNL • UNL • UNL is the first instance of a BeStMan Gateway endpoint to pass all the automated CMS tests which is done through EGEE SAM product. • Running BeStMan Gateway on Hadoop for ~110TB disk storage • For US CMS and US ATLAS, it manages data transfers of 1-2 TB an hour • peaks up to 10Gbps, sustains 2 Gbps • Setup • BeStMan-Gateway • On 2x dual core Xeons @ 2.66GHz, 2GB memory, single 160GB, 1Gbit NIC • Running CentOS 5.2 x86_64 with absolutely no tweaks • Hadoop 0.18.1 with custom site patches for approx. 110TB raw disk • Mounts as a normal file system through FUSE • GridFTP servers • Globus version with a custom Hadoop DSI module • 10 GridFTP servers with BeStMan load-balancing mechanism

  4. Client setup • Client setup • Clients on 200 hosts • Each host started one script almost at the same time (+/- 2 seconds) • Each script did 5 sequential srmLs operations

  5. Ttest results • Entire test was completed in 30 seconds

  6. Summary • BeStMan Gateway mode is an implementation of SRM v2.2. • Great for smaller disk-based storage and file systems • BeStMan Gateway mode on some file systems and storage gives scalable performance • Install/maintain through VDT • Works with other SRM v2.2 implementations • Servers: CASTOR, dCache, DPM, StoRM, SRM/SRB, … • Clients: FTS, PhEDEx, glite-url-copy, lcg-cp, srm-copy, srmcp, … • In OSG, WLCG/EGEE, ESG, …

  7. Documents and Support • BeStMan • http://datagrid.lbl.gov/bestman • http://hep-t3.physics.umd.edu/HowToForAdmins.html#osgBestman • http://wt2.slac.stanford.edu/xrootdfs/bestman-gateway.html • https://www.usatlas.bnl.gov/twiki/bin/view/Admins/BestMan • https://twiki.grid.iu.edu/bin/view/Documentation/BestmanGateway • https://twiki.grid.iu.edu/bin/view/Documentation/BestmanGateway-Xrootd • Xrootd • http://xrootd.slac.stanford.edu • http://wt2.slac.stanford.edu/xrootdfs/xrootdfs.html • SRM Collaboration and SRM Specifications • http://sdm.lbl.gov/srm-wg • Contact and support : srm@lbl.gov

More Related