1 / 12

UAH GRIDS Center Middleware Testing

UAH GRIDS Center Middleware Testing. Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science and Technology Center 256-961-7806 sredman@itsc.uah.edu Sandra.Redman@msfc.nasa.gov www.itsc.uah.edu.

umay
Download Presentation

UAH GRIDS Center Middleware Testing

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. UAH GRIDS Center Middleware Testing Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science and Technology Center 256-961-7806 sredman@itsc.uah.edu Sandra.Redman@msfc.nasa.gov www.itsc.uah.edu

  2. “…drowning in data but starving for knowledge” Data glut affects business, medicine, military, science How do we leverage data to make BETTER decisions??? User Community Information

  3. Data Mining • Automated discovery of patterns, anomalies from vast observational data sets • Derived knowledge for decision making, predictions and disaster response • http://datamining.itsc.uah.edu

  4. Mining Environment: When,Where, Who and Why? • WHERE • User Workstation • Data Mining Center • GRID • WHEN • Real Time • On-Ingest • On-Demand • Repeatedly • WHO • End Users • Domain Experts • Mining Experts • WHY • Event • Relationship • Association • Corroboration • Collaboration Data Mining

  5. Algorithm Development and Mining (ADaM) ADaM consists of: • a data mining engine • an extensible set of core functional applications to aid researchers in defining and performing data mining operations on spatial data sets • data mining modules as Open Grid Services Architecture (OGSA) services

  6. Input Output HDF HDF-EOS GIF PIP-2 SSM/I Pathfinder SSM/I TDR SSM/I NESDIS Lvl 1B SSM/I MSFC Brightness Temp US Rain Landsat ASCII Grass Vectors (ASCII Text) Intergraph Raster Others... GIF Images HDF-EOS HDF Raster Images HDF SDS Polygons (ASCII, DXF) SSM/I MSFC Brightness Temp TIFF Images Others... ADaM Engine Architecture Preprocessed Data Patterns/ Models Results Data Translated Data Processing Preprocessing Analysis Selection and Sampling Subsetting Subsampling Select by Value Coincidence Search Grid Manipulation Grid Creation Bin Aggregate Bin Select Grid Aggregate Grid Select Find Holes Image Processing Cropping Inversion Thresholding Others... Clustering K Means Isodata Maximum Pattern Recognition Bayes Classifier Min. Dist. Classifier Image Analysis Boundary Detection Concurrence Matrix Dilation and Erosion Histogram Operations Polygon Circumscript Spatial Filtering Texture Operations Genetic Algorithms Neural Networks Others...

  7. NMI Testing ADaM Feature Subset Selection application chosen for testing • Supervised pattern classification is a technique important in many domains • Used to improve both the runtime and accuracy of a supervised pattern classifier by eliminating noisy, irrelevant or redundant attributes or features from the data set. • Feature subset selection is the process of choosing a subset of the features from the original data set in order to maximize classifier accuracy • Both processor and data-intensive

  8. Parallel Version of Cloud Extraction • GOES images can be used to recognize cumulus cloud fields • Cumulus clouds are small and do not show up well in 4km resolution IR channels • Detection of cumulus cloud fields in GOES can be accomplished by using texture features or edge detectors Master Slave 1 Slave 2 Slave 3 GOES Image Laplacian Filter Sobel Horizontal Filter Sobel Vertical Filter Energy Computation Energy Computation Energy Computation Energy Computation Classifier Cloud Image GOES Image Cumulus Cloud Mask • Three edge detection filters are used together to detect cumulus clouds which lends itself to implementation on a parallel cluster

  9. Satellite Data Grid Mining Agent Archive X Grid Processor Grid Mining Agent Grid Mining Agent Satellite Data Grid Processor Grid Processor Archive Y Feature Subset Selection Application • Application ported to linux • Support Vector Machine downloaded and tested • Developed application scripts • Modified for Globus environment by writing simple Globus RSL file • Ran each combination of tools on a different node on the grid • Globus used to execute jobs on different machines • Experimented with both real and synthetic data

  10. Components used in testing • Globus toolkit - the “defacto standard,” an open source software toolkit and libraries for building grid applications; Resource Management, scheduling, information services, file transfer • GSI- OpenSSH - a modified version of OpenSSH that adds support for GSI authentication, providing a single sign-on remote login capability for the Grid • Condor-G - workload management system for compute-intensive jobs; job queueing mechanism, scheduling policy, priority scheme, resource monitoring, and resource management. • Network Weather Service - monitors and dynamically forecasts the performance various network and computational resources can deliver over a given time interval

  11. Some Lessons Learned • Component testing went well • Globus documentation improved, installation trouble-free, application port straight-forward • No problems encountered during Condor-G installation, but found problem with Condor-G under Redhat linux 7.3 when using nss_ldap. Developer provided workaround - start name service caching daemon (nscd) • GSI-OpenSSH installed, but Kerberos authentication did not work since linux was not compiled with PAM option (undocumented) • Network Weather Service installed, but learned we are more interested in MDS

  12. Some Lessons Learned • NMI Testbed Process working well • Answers found through NMI discussion lists from developers and other users • Have to “sell” the grid concept to developers, administrators, users • NMI Work proven helpful in other grid work • TeraGrid • ISS Space-based Science Operations Grid • CEOS Grid • Need more components!

More Related