1 / 9

A Paradigm for Space Science Informatics

This article explores the field of informatics and its application in space science research. It discusses the common features of X-informatics, the role of data mining in knowledge discovery, and the potential for space science informatics to become a standalone research sub-discipline. The article concludes by proposing future informatics applications such as query-by-example systems, automated recommendation systems, and semantic annotation services.

sfriedman
Download Presentation

A Paradigm for Space Science Informatics

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Paradigm for Space Science Informatics George Mason University and QSS Group Inc., NASA-Goddard kborne@gmu.edu or kirk.borne@gsfc.nasa.gov and Kirk D. Borne Timothy E. Eastman (presenter) QSS Group Inc., NASA-Goddard eastman@mail630.gsfc.nasa.gov

  2. What is Informatics? • Informatics is the discipline of structuring, storing, accessing, and distributing information describing complex systems. • Examples: • Bioinformatics • Geographic Information Systems (= Geoinformatics) • New! Space Science Informatics • Common features of X-informatics: • Basic data unit is defined • Common community tools operate on data units • Data-centric and Information-centric approaches • Data-driven science • X-informatics is key enabler of scientific discovery in the era of large data science

  3. X-Informatics Compared Discipline X • Bioinformatics • Geoinformatics • Space Sc. Informatics Data Unit • Gene Sequence • Points, Vectors, Polygons • Time Series, Event Lists, Catalogs, Object Parameters Common Tools • BLAST, FASTA • GIS • CDAWeb, Bayes Inference, Cross Correlations, Principal Components

  4. Data-Information-Knowledge-Wisdom • T.S. Eliot (1934): “Where is the wisdom we have lost in knowledge? Where is the knowledge we have lost in information?”

  5. Key Role of Data Mining • Data Mining = an information extraction activity whose goal is to discover hidden knowledge contained in large databases • Data Mining is used to find patterns and relationships in the data • Data Mining is also called KDD • KDD = Knowledge Discovery in Databases • Data Mining is the killer app for scientific databases • Examples: • Clustering Analysis = group together similar items and separate dissimilar items • Classification Prediction = predict the class label • Regression = predict a numeric attribute value • Association Analysis = detect attribute-value conditions that occur frequently together

  6. Space Science Knowledge Discovery

  7. Space Weather Example

  8. Space Science Informatics • Key enabler for new science discovery in large databases • Large data science is here to stay • Common data browse and discovery tools, and common data structures, will enable exponential knowledge discovery within exponentially growing data collections • X-informatics represents the 3rd leg of scientific research: experiment, theory, and data-driven exploration • Space Science Informatics should parallel Bioinformatics and Geoinformatics: become a stand-alone research sub-discipline

  9. Future Work: Informatics Applications • Query-By-Example (QBE) science data systems: • “Find more data entries similar to this one” • “Find the data entry most dissimilar to this one” • Automated Recommendation (Filtering) Systems: • “Other users who examined these data also retrieved the following...” • “Other data sets that are relevant to this data set include...” • Information Retrieval Metrics for Scientific Databases: • Precision: “How much of the retrieved data is relevant to my query?” • Recall: “How much of the relevant data did my query retrieve?” • Semantic Annotation (Tagging) Services: • Report discoveries back to the science database for community reuse • Science / Technical / Math (STEM) Education: • Transparent reuse and analysis of scientific data in inquiry-based classroom learning (http://serc.carleton.edu/usingdata/ , DLESE.org ) • Key concepts that need defining (by community consensus): Similarity, Relevance, Semantics (dictionaries, ontologies)

More Related