1 / 1

0211-DOLAP

elke
Download Presentation

0211-DOLAP

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. <?xml version="1.0" encoding="US-ascii"?> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd"> <html xmlns="http://www.w3.org/1999/xhtml" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:foaf="http://xmlns.com/foaf/0.1/" version="XHTML+RDFa 1.0"> <head> <meta http-equiv="refresh" content="6; url=http://crd.lbl.gov/wu/"> <meta name="dc:creator" content="Kesheng John Wu"/> <meta name="verify-v1" content="xrcMAS0CYTYQAHBZzj7MVjzSa9jnHwCz9isAkDBVBFw="/> <meta name="Keywords" content="Wu, John, Kesheng, WuKeSheng, FastBit, bitmap index, data warehouse, query-driven visualization, word-aligned hybrid code, bitmap compression, union-find, connected component labeling, Lanczos algorithm, thick-restart Lanczos, symmetric eigenvalue problem"/> <meta name="description" content="This page contains my own research results in scientific data management and parallel and distributed software engineering. The more recent papers are on scientific data management. In particular how to improve the query processing speed through the use of compressed bitmap index. The most important technology used in a compression scheme named the word-aligned hybrid code that significantly improve the operation speed with moderate decrease in compression effectiveness. Earlier papers posted here concentrate on techniques for solving large scale eigenvalue problems arising from scientific and engineering applications. The particular methodologies that are most familiar to me are the Lanczos method, the Arnoldi method and the Davidson method. My main research results are on how to improve the efficiencies of these methods restarting. The thick-restart Lanczos method is found to be very effective method in real-world applications."/> <base href="http://sdm.lbl.gov/%7Ekewu/"/> <link rel="foaf:primaryTopic" href="http://sdm.lbl.gov/%7Ekewu/#me"/> <link rel="SHORTCUT ICON" href="wu.ico"> <title>John Wu @ Berkeley Lab</title> </head> <body about="http://sdm.lbl.gov/%7Ekewu/#me" typeof="foaf:person"> <!-- &nbsp;&nbsp; --> <div style="text-align: right;"> Telephone: <span rel="foaf:phone" href="tel:+1-510-486-6609">1-510-486-6609</span><br/> E-mail: <a rel="foaf:mbox" href="mailto:kwu at lbl dot gov">kwu<img src="gif/at_blue0.gif" width="10" height="10" border=0 align="bottom" alt="@"/>lbl.gov</a> or <a rel="foaf:mbox" href="mailto:john dot wu at acm dot org">john.wu<img src="gif/at_blue0.gif" width="10" height="10" border=0 align="bottom" alt="@"/>acm.org</a><br/> <a href="http://maps.yahoo.com/maps_result.php?q1=one+cyclotron+road,+Berkeley,+CA">Mailstop 50B-3238, 1 Cyclotron Road, Berkeley, CA 94720, USA</a> </div> <div style="text-align: center;font-size:x-large;font-weight:bold"> <span lang="zh-cmn" xlm:lang="zh-cmn" style="font-family:FangSim,SimSun" property="foaf:name">&#27494;&#20811;&#32988;</span> <!--img src="gif/name30.gif" width="84" height="30" border="0" margin=0 alt=""--> &nbsp;&nbsp;&nbsp;&nbsp; <span property="foaf:givenname">Kesheng</span> <span property="foaf:familyname">Wu</span><!--br/--> &nbsp;&nbsp;&nbsp;&nbsp; (<span property="foaf:nick">John</span>) </div> <!--table width=100% border=0><tr><td> </td><td> </td></tr></table--> <div style="width: 18em; float: right; align: right; border-width: 0px; margin: 1em;"> <form action="http://google.lbl.gov/search" method="get" name="gs"> <table cellspacing="0" cellpadding="0" align="center"> <tr> <td valign="middle"><font size="-1"> <input type="text" name="q" size="30" maxlength="256" value=""></font></td> <td valign="middle">&nbsp; <input value="xml_no_dtd" name="output" type="hidden"></input> <input value="date:AD:L:d1" name="sort" type="hidden"></input> <input value="UTF-8" name="ie" type="hidden"></input> <input value="" name="lr" type="hidden"></input> <input value="default_frontend" name="client" type="hidden"></input> <input value="UTF-8" name="oe" type="hidden"></input> <input type="hidden" name="numgm" value="5"></input> <input value="default_frontend" name="proxystylesheet" type="hidden"></input> <input type="hidden" name="site" value="ALL"></input> <input type="hidden" name="num" value="40"></input> <input value="Search" name="btnG" type="submit"></input></td> </tr> </table> </form> </div> <!-- main body of the web page is in a long DL list --> <dl> <p> <dt><font size="+1" color=red><b>Research Interests</b></font> <dd> <img src="gif/ball_pink.gif" width="14" height="14" alt="*"/> Analysis and management of large datasets<br/> <img src="gif/ball_pink.gif" width="14" height="14" alt="*"/> Parallel distributed software systems design and implementation<br/> <img src="gif/ball_pink.gif" width="14" height="14" alt="*"/> Component architecture for scientific software<br/> <img src="gif/ball_pink.gif" width="14" height="14" alt="*"/> Performance tuning for large distributed software<br/> <p> <dt><font size="+1" color=orange><b>Projects</b></font> <div style="float: right; width: 157px; margin: 0.5em;"> <a href="http://sdm.lbl.gov/fastbit"><img src="http://sdm.lbl.gov/%7Ekewu/fastbit/fastbit.gif" width="157" height="63" border=0 align=right alt="Make It A Bit Faster with FastBit"/></a> </div> <dd> <img src="gif/ball_orange.gif" width="14" height="14" alt="*"/> <a href="fastbit">FastBit</a> [<a href="fastbit/publications.html">Publications</a>]: an efficient compressed <a href="http://en.wikipedia.org/wiki/Bitmap_index">bitmap index</a> technology for data intensive sciences. This project addresses the challenges of efficiently searching growing amounts of data collected/generated by various scientific applications, such as <a href="http://www.star.bnl.gov/STAR/comp/train/tut/GC/GridCollector.htm">high-energy physics</a>, <a href="http://scidacreview.org/0602/html/data.html">combustion</a>, <a href="http://vis.lbl.gov/Research/Dex">astrophysics</a>, and <a href="http://vis.lbl.gov/Vignettes/QDV-NetworkTraffic/qdv-vignette.html">network traffic analysis</a>. The FastBit software has received an <a href="http://www.rdmag.com/awards.html">R&amp;D 100 Award</a>; here is a <a href="http://sdm.lbl.gov/%7Ekewu/gif/RD100-photo-2-small.jpg">photo</a> from the award receiption. <dd> <img src="gif/ball_orange.gif" width="14" height="14" alt="*"/> <a href="labeling">Connected Component Labeling</a> [<a href="labeling/publications.html">Publications</a>]: an efficient connected component labeling algorithm. This grows out our work on feature tracking for a combustion data analysis. The key new insight is that there is a way to make use of an <a href="http://sdm.lbl.gov/%7Ekewu/ps/LBNL-57527.html">implicit union-find data structure</a> to speed up the connected component labeling algorithms, which in turn leads to faster algorithms for finding regions of interest. In particular, using compressed bitmaps as representations of points in the regions of interest, we can find the regions in time that is proportional to the the number of points on the boundary of the regions. This is <a href="http://sdm.lbl.gov/%7Ekewu/ps/LBNL-57023.html">faster than the best iso-contouring algorithms</a> and <a href="http://sdm.lbl.gov/%7Ekewu/ps/LBNL-56864.html">much faster than similar region finding algorithms</a>. This is also a basis of some of the work on <a href="http://vis.lbl.gov/Research/Dex">visualization</a> and <a href="http://vis.lbl.gov/Vignettes/QDV-NetworkTraffic/qdv-vignette.html">visual analytics</a>. <dd> <img src="gif/ball_orange.gif" width="14" height="14" alt="*"/> <a href="http://www-vis.lbl.gov/Research/Dex">DEX</a> [<a href="fastbit/publications.html">Publications</a>]: a query-based visualization tool. This project provides a new visualization capability based on the <a href="http://sdm.lbl.gov/fastbit">FastBit technology</a> and the <a href="http://sdm.lbl.gov/%7Ekewu/labeling">fast connected component labeling technology</a>. This effective combination was first demonstrated on a project of <a href="http://sdm.lbl.gov/%7Ekewu/ps/LBNL-52535.html">analyzing combustion simulation data</a>. It is extensively documented in <a href="http://ieeexplore.ieee.org/iel5/10473/33220/01566004.pdf?arnumber=1566004">our paper at IEEE Vis 2005</a>, and also appeared in <a href="http://www.scidacreview.org/0602/html/data.html">a SciDAC review report</a> about the <a href="http://sdmcenter.lbl.gov">Scientific Data Management Center</a>. <dd> <img src="gif/ball_orange.gif" width="14" height="14" alt="*"/> <a href="trlan">TRLan</a> [<a href="trlan/publications.html">Publications</a>]: Thick-restart Lanczos method for symmetric eigenvalue problems. A <a href="http://sdm.lbl.gov/%7Ekewu/trlan.html">Fortran 90 implementation for symmetric eigenvalue problems</a> and <a href="https://codeforge.lbl.gov/frs/?group_id=43">another one in C for Hermitian eigenvalue problems</a> are available with a <a href="http://sdm.lbl.gov/%7Ekewu/trlan-license.txt">BSD-like license</a>. <p> <dt><font size="+1" color=blue><b>Selected Publications</b></font> <dd> <div about="http://doi.acm.org/10.1145/1670243.1670245"> <img src="gif/ball_yellow.gif" width="14" height="14" alt="*"/> <span property="dc:creator">Kesheng Wu</span>, <span property="dc:creator">Arie Shoshani</span>, and <span property="dc:creator">Kurt Stockinger</span>. <span style="font-weight:italic" property="dc:title">Analyses of Multi-Level and Multi-Component Compressed Bitmap Indexes</span>. ACM Transactions on Database Systems v35, Article 2, 2010. <a href="http://doi.acm.org/10.1145/1670243.1670245">DOI 10.1145/1670243.1670245</a><br/> [<a href="http://sdm.lbl.gov/%7Ekewu/ps/LBNL-60891.html">Abstract</a>] [<a href="http://sdm.lbl.gov/%7Ekewu/ps/LBNL-60891.pdf">Draft as LBNL-60891</a>] </div> <dd> <div about="http://www.springerlink.com/content/b67258v347158263/"> <img src="gif/ball_green.gif" width="14" height="14" alt="*"/> <span property="dc:creator">Kesheng Wu</span>, <span property="dc:creator">Ekow Otoo</span>, and <span property="dc:creator">Kenji Suzuki</span>. <span style="font-weight:italic" property="dc:title">Optimizing two-pass connected-component labeling algorithms</span>. Pattern Analysis & Applications, v12(2), pages 117 - 135. 2009. <a href="http://www.springerlink.com/content/b67258v347158263/">DOI 10.1007/s10044-008-0109-y</a><br/> [<a href="ps/LBNL-59102.html">Abstract</a>] [<a href="ps/LBNL-59102.pdf">Draft as LBNL-59102</a>] </div> <dd> <div about="http://doi.acm.org/10.1145/1132863.1132864"> <img src="gif/ball_blue.gif" width="14" height="14" alt="*"/> <span property="dc:creator">Kesheng Wu</span>, <span property="dc:creator">Ekow Otoo</span>, and <span property="dc:creator">Arie Shoshani</span>. <span style="font-weight:italic" property="dc:title">Optimizing bitmap indices with efficient compression</span>. ACM Transactions on Database Systems, v 31, pages 1-38, 2006. <a href="http://doi.acm.org/10.1145/1132863.1132864">DOI 10.1145/1132863.1132864</a>. <br/> [<a href="http://sdm.lbl.gov/%7Ekewu/ps/LBNL-49626.html">Abstract</a>] [<a href="http://sdm.lbl.gov/%7Ekewu/ps/LBNL-49626.pdf">Draft as LBNL-49626</a>] </div> <dd> <div about="http://www.vldb.org/conf/2004/RS1P2.PDF"> <img src="gif/ball_yellow.gif" width="14" height="14" alt="*"/> <span property="dc:creator">Kesheng Wu</span>, <span property="dc:creator">Ekow Otoo</span>, and <span property="dc:creator">Arie Shoshani</span>. <span style="font-weight:italic" property="dc:title">On the Performance of Bitmap Indices for High Cardinality Attributes</span>. <a href="http://www.vldb.org/conf/2004/RS1P2.PDF">VLDB 2004, pages 24 - 35</a>.<br/> [<a href="ps/LBNL-54673.html">Abstract</a>] [<a href="ps/LBNL-54673.pdf">Draft as LBNL-54673</a>] </div> <dd> <div about="http://dx.doi.org/10.1137/S0895479898334605"> <img src="gif/ball_blue.gif" width="14" height="14" alt="*"/> <span property="dc:creator">Kesheng Wu</span> and <span property="dc:creator">Horst Simon</span>. <span style="font-weight:italic" property="dc:title">Thick-restart Lanczos method for large symmetric eigenvalue problems</span>. SIAM Journal on Matrix Analysis and Applications, v 22, No. 2, pp. 602-616, 2001. <a href="http://dx.doi.org/10.1137/S0895479898334605">DOI 10.1137/S0895479898334605</a><br/> [<a href="ps/trlan.html">Abstract</a>] [<a href="ps/trlan.ps">Draft as LBNL-41412</a>]. </div> <dd> <div about="http://prb.aps.org/abstract/PRB/v50/i16/p11355_1"> <img src="gif/ball_blue.gif" width="14" height="14" alt="*"/> <span property="dc:creator">Jim R. Chelikowsky</span>, <span property="dc:creator">Norm Troullier</span>, <span property="dc:creator">Kesheng Wu</span>, and <span property="dc:creator">Yousef Saad</span>. <span style="fount-weight:italic" property="dc:title">Combining a higher-order finite-difference method with ab initio pseudopotentials: application to diatomic molecules</span>. Phys. Rev. B <A HREf="http://prola.aps.org/pdf/PRB/v50/i16/p11355_1">50:11355-64</A>, 1994. <a href="http://dx.doi.org/10.1103/PhysRevB.50.11355">DOI: 10.1103/PhysRevB.50.11355</A> </div> <dd> <div about="http://dx.doi.org/10.1016/0167-2789(93)90188-7"> <img src="gif/ball_green.gif" width="14" height="14" alt="*"/> <span property="dc:creator">Kesheng Wu</span>, <span property="dc:creator">Robert Savit</span>, and <span property="dc:creator">William Brock</span>. <span style="font-weight:italic" property="dc:title">Statistical tests for deterministic effects in broad band time series</span>. Physica D: Nonlinear Phenomena 69(1-2): 172-188. 1993. <a href="http://dx.doi.org/10.1016/0167-2789(93)90188-7">DOI 10.1016/0167-2789(93)90188-7</a> </div> <p> <dd><img src="gif/ball_pink.gif" width="14" height="14" alt="*"/><font size="+1">Publications listed elsewhere on the web</font>: <table cellspacing="5px" cellpadding="2"> <tr><td> [<a typeof="foaf:publications" href="http://scholar.google.com/citations?user=ju1z14aMmRkC">Google Scholar Profile</a>] </td><td> [<a typeof="foaf:publications" href="http://academic.research.microsoft.com/Author/2010433.aspx">Microsoft Academic Profile</a>] </td><td> [<a typeof="foaf:publications" href="http://portal.acm.org/author_page.cfm?id=81100657827&coll=GUIDE&dl=GUIDE&trk=0">ACM Author Profile</a>] </td></tr> <tr><td> [<a typeof="foaf:publications" href="http://google.com/scholar?q=author%3A%22Kesheng+Wu%22&num=100">Google Scholar</a>] </td><td> [<a typeof="foaf:publications" href="http://www.researchgate.net/profile/Kesheng_Wu/">ResearchGate</a>] </td><td> [<a typeof="foaf:publications" href="http://www.dblp.org/search/index.php?query=author:kesheng_wu">DBLP</a>] </td></tr> </table> <!--a typeof="foaf:publications" href="http://citeseer.ist.psu.edu/cs?q=Kesheng+Wu&submit=Citations">CiteSeer</a-->. </dl> <p align=center><img src="gif/blackbar.gif" width="398" height="4" alt="---"/></p> <!--div style="float:left;"--> <font size="-1"> <dl> <dt>Professional associations</dt> <dd>ACM - Distinguished Scientist</dd> <dd>IEEE - Senior Member</dd> <dt> Current work place <dd><a property="foaf:organization" href="http://www.universityofcalifornia.edu/">University of California</a> <dd><a property="foaf:organization" href="http://www.lbl.gov/">Lawrence Berkeley National Laboratory</a>, <a href="http://www.youtube.com/user/berkeleylab">LBNL on youTube</a>, <a href="http://www.ustream.tv/ucevents">UC events</a> <dd><a property="foaf:organization" href="http://crd.lbl.gov/">Computational Research Division</a> <dd><a property="foaf:organization" href="http://hpcrd.lbl.gov/">High Performance Computing Research Department</a> <dd><a property="foaf:organization" href="http://sdm.lbl.gov">Scientific Data Management Group</a> <dt> Earlier work <dd><a property="foaf:schoolhomepage" href="http://www.cs.umn.edu/~kewu/">Scientific computing work at University of Minnesota</a> <dt> Related sites on database research <dd><a href="http://www.spatial.cs.umn.edu/">University of Minnesota Database group</a> <dd><a href="http://db.cs.berkeley.edu/">UC Berkeley Database group</a> <dd><a href="http://infolab.stanford.edu/">Stanford InfoLab</a> <dd><a href="http://dblife.cs.wisc.edu/">DBLife</a> <dt> Related sites on scientific computing (eigenvalues in particular) <dd><a href="http://www.cs.wm.edu/~andreas/software">PRIMME</a> <dd><a href="http://www.caam.rice.edu/software/ARPACK/">ARPACK</a> <dd><a href="http://www-fp.mcs.anl.gov/ccst/research/reports_pre1998/algorithm_development/prism/prism.html">PRISM</a> </dl> </font> <!--/div--> <div style="float:right; text-align: right; margin: 0.5em; padding: 0.5em;"> <a href="http://www.linkedin.com/in/johnwu"><img src="http://sdm.lbl.gov/%7Ekewu/gif/photo2.gif" width="54" height="67" align=right border=0 alt="John Wu"/></a> <div style="font-size:small;"> <a href="http://www.lbl.gov/Disclaimers.html">Disclaimers</a><br/> <a href="http://crd.lbl.gov/wu/"><code>http://crd.lbl.gov/wu</code></a><br/> <script type="text/javascript">document.write(document.lastModified)</script> </div> </div> <script src="http://www.google-analytics.com/ga.js" type="text/javascript"> </script> <script type="text/javascript"> _uacct = "UA-812953-1"; pageTracker._trackPageview(); </script> </body></html> <!-- <img src="/~kewu/gif/charge.gif" alt=" " Align=left border=0/> <img src="/~kewu/gif/nanotube2.gif" Align=left border=0/> <img src="/~kewu/gif/XeFe1.gif" alt="XeFe" Align=center border=0/></a> <img src="/~kewu/gif/write.gif" Align=right border=0/> -->

More Related