1 / 45

Building a Science B ase for the Information A ge

Building a Science B ase for the Information A ge. John Hopcroft Cornell University Ithaca, NY. Time of change. The information age is a revolution that is changing all aspects of our lives.

job
Download Presentation

Building a Science B ase for the Information A ge

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Building a Science Base for the Information Age John Hopcroft Cornell University Ithaca, NY Xiamen University

  2. Time of change • The information age is a revolution that is changing all aspects of our lives. • Those individuals, institutions, and nations who recognize this change and position themselves for the future will benefit enormously. Xiamen University

  3. Computer Science is changing Early years • Programming languages • Compilers • Operating systems • Algorithms • Data bases Emphasis on making computers useful Xiamen University

  4. Computer Science is changing The future years • Tracking the flow of ideas in scientific literature • Tracking evolution of communities in social networks • Extracting information from unstructured data sources • Processing massive data sets and streams • Extracting signals from noise • Dealing with high dimensional data and dimension reduction Xiamen University

  5. Computer Science is changing • Merging of computing and communication • The wealth of data available in digital form • Networked devices and sensors Drivers of change Xiamen University

  6. Implications for TCS • Need to develop theory to support the new directions • Update computer science education Xiamen University

  7. A short view of the future Xiamen University

  8. Digitization of medical records • Doctor – needs my entire medical record • Insurance company – needs my last doctor visit, not my entire medical record • Researcher – needs statistical information but no identifiable individual information Relevant research – zero knowledge proofs, differential privacy Xiamen University

  9. Xiamen University

  10. Zero knowledge proof • Graph 3-colorability • Problem is NP-hard - No polynomial time algorithm unless P=NP Xiamen University

  11. Zero knowledge proof Xiamen University

  12. Digitization of medical records is not the only system Car and road – gps – privacy Supply chains Transportation systems Xiamen University

  13. Xiamen University

  14. (Full Zoom Out) Animate Map Storm Tracks Total Precipitation Show Severe Regional Radar Zoom Map Click: Zoom In Zoom Out Pan Map NEXRAD Radar Binghamton, Base Reflectivity 0.50 Degree Elevation Range 124 NMI — Map of All US Radar Sites Xiamen University

  15. Xiamen University

  16. When will my bus arrive? Xiamen University

  17. IN 4 TO 5 MINUTES Xiamen University

  18. Web Chord Usage Index Probabilistic Text File Retrieve Text Index Page rank Web Link Graph Web Page Search Rank Discourse Word Centering Anaphora Retrieval Query Search Text Tracking the flow of ideas in scientific literature Yookyung Jo Xiamen University

  19. Xiamen University

  20. Xiamen University

  21. In the past, sociologists could study group of a few thousand individuals. Today with social networks we can study interaction among millions of individuals. One important activity is how communities form and evolve. Xiamen University

  22. Early work • Min cut – two equal size communities • Conductance – minimizes cross edges • Future work • Consider communities with more external edges than internal edges • Find small communities • Track communities over time • Develop appropriate definitions for communities • Understand the structure of different types of social networks Xiamen University

  23. Our view of a community Colleagues at Cornell Classmates TCS Me More connections outside than inside Family and friends Xiamen University

  24. On going research on finding communities Xiamen University

  25. Spectral clustering with K-means. Xiamen University

  26. Spectral clustering with K-means. Xiamen University

  27. Spectral clustering with K-means. Xiamen University

  28. Xiamen University

  29. Instead of two overlapping clusters, we find three clusters. Xiamen University

  30. Instead of clustering the rows of the singular vectors, find the minimum 0-norm vector in the space spanned by the singular vectors. The minimum 0-norm vector is, of course, the all zero vector, so we will require one component to be 1. Xiamen University

  31. Finding the minimum 0-norm vector is NP-hard. Use the minimum 1-norm vector as a proxy. Xiamen University

  32. Xiamen University

  33. Xiamen University

  34. Xiamen University

  35. Xiamen University

  36. Xiamen University

  37. Xiamen University

  38. Minimum 1-norm vector is not an indicator vector. By thresh-holding the components, convert it to an indicator vector for the community. Xiamen University

  39. Xiamen University

  40. Random walk How long? What dimension? Xiamen University

  41. Krylov subspace Find orthonormal basis. Update subspace in each step rather than just the probability vector Xiamen University

  42. Find minimum 1-norm vector in Krylov subspace. Actually allow vector to be close to subspace. Xiamen University

  43. Structure of communities How many communities is a person in? Small, medium, large How many seed points are needed to uniquely specify a community a person is in? Which seeds are good seeds? Etc. Xiamen University

  44. What types of communities are there? How do communities evolve over time? Xiamen University

  45. This is an exciting time for computer science. There is a wealth of data in digital format, information from sensors, and social networks to explore. It is important to develop the science base to support these activities. Xiamen University

More Related