80 likes | 216 Views
Semantics through Collective Intelligence. Prof. Dr. Steffen Staab. TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A A A A A A A A A A A A. Collective Intelligence. Collective datasets Hosted public datasets Gated datasets Social networks,…
E N D
Semantics through Collective Intelligence Prof. Dr. Steffen Staab TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAAAAAAAAAAAAA
Collective Intelligence Collective datasets Hosted public datasets Gated datasets Social networks,… Wikipedia style Actually includes Discussions Editor hierarchies Policies Pagerank style highly effective no coordination no control (modulo spamming) Different Flavors • Gene ontology • DBPedia, Public census data • Facebook, LinkedIn • Wikiversity • FAQs • Yahoo Answers, Lycos IQ • Tagging • Flickr, Delicious, … • geotagging
Collective Intelligence Collective datasets Hosted public datasets Gated datasets Social networks,… Wikipedia style Actually includes Discussions Editor hierarchies Policies Pagerank style highly effective no coordination no control (modulo spamming) Different Flavors Sizes History Semantics
Billion Triples Challenge: The Power of Collective Datasets flexible Common approach: Import dump to new data silo extensible scaleable RDFS rules geo ... birthplace WordNet webby Swoogle PlaceOfBirth birthplace GeoNames Geo querying RDFS Rules fulltext Semantic Web? + + + >1Gt Swoogle WordNet GeoNames monolithic + inflexible not scaleable 12 months in 2005/06 700M triples
…but not quite far enough Stronger: • Semantics is weak ) Some Collective Ontology Engineering Bigger: • There is no data like more data ) more data sources to create ) more data sources to include Faster: • Scaleability of querying ) a matter of science, not one of witchcraft! ) Impressive track record from tiny to medium size in 10 years
Impacts • New ways of exploring data • Semaplorer (http://btc.isweb.uni-koblenz.de) • Parallax (http://mqlx.com/~david/parallax/) • New ways of mining data • New ways of relating data
Conclusion New forms of collective intelligence generate semantic data ) Use them!