1 / 5

Data Intensive Computing

Data Intensive Computing. Graph algorithms for irregular, unstructured data John Feo , Pacific Northwest National Laboratory graph500 and data-intensive computing Richard Murphy, Sandia National Laboratories Large-scale knowledge discovery   Steve Reinhardt, Microsoft

lanai
Download Presentation

Data Intensive Computing

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data Intensive Computing • Graph algorithms for irregular, unstructured data • John Feo, Pacific Northwest National Laboratory • graph500 and data-intensive computing • Richard Murphy, Sandia National Laboratories • Large-scale knowledge discovery   • Steve Reinhardt, Microsoft • Data intensive computing at SNL   • Andrew Wilson, Sandia National Laboratories • IBM's InfoSphere Streams   • Roger Rea, IBM • Graph500 and Data Intensive HPC • Richard Murphy, Sandia National Laboratory • Data Analytics • Phillip Morris, Platform Computing • Data Intensive Computing • Richard Altmaier, Intel

  2. 1. Please provide a definition for "Data Intensive Computing".   Please explain the difference between "finding a needle in a haystack" and "knowledge discovery".

  3. 2. "Data Intensive Computing" generally involves analyses of non-numeric data and the number of combinatorial possibilities grows rapidly.  The objective of the analysis is to find in the data, meaningful relationships.  How do we (a) test for convergence when not evaluating all possible combinations and (b) test for statistical significance--when the data is non-numeric?

  4. 3. "Data Intensive Computing" often involves the use of incomplete data.  How does this affect the analysis process?

  5. 4. If you could design an ideal computing architecture for Data Intensive Computing, what would it look like?

More Related