220 likes | 289 Views
Discover efficient ways to analyze vast data with social media integration and pattern recognition. Explore various use cases like IMDB and fraud detection through graph patterns. Benefit from powerful tools like OCP, GrafMED, and ConceptBrowser for enhanced data processing.
E N D
Index • InformationRetrieval • OCP • GrafMED • ConceptBrowser • DifPubMed • SocialMedia • RecerCaixa • Reviewers Recommender
Index • Information Retrieval • OCP • GrafMED • ConceptBrowser • DifPubMed • SocialMedia • RecerCaixa • Reviewers Recommender
Information Retrieval • IMDB • Goal • Apply DEX in a well-known social network • Perform different kinds of information retrieval queries • Link analysis • Social-oriented queries • Pattern recognition • Keyword search • IMDB Use Case • www.imdb.com • Inherent network-structure of the data
Information Retrieval • IMDB • Source Data • 10 entities, 12 relationships • More than 845,000 titles and 2,000,000 people • Auxiliary tables with casts, roles, genres, extra movie & person info • Dex Graph • Built in less than 21 minutes • More than 25 million nodes • Less than 1.14 GB of DEX data
Information Retrieval • IMDB
Index • Information Retrieval • OCP • GrafMED • ConceptBrowser • DifPubMed • SocialMedia • RecerCaixa • Reviewers Recommender
OCP • Requesters • Spanish Patrimonial Control Office • Goal • Detect fraud in real patrimonial transactions • Data Model • People, societies • Patrimonial transactions • Procedure • An expert defines a fraudulent pattern graph pattern • Graph pattern allows user to find fraudulent people/societies
Index • Information Retrieval • OCP • GrafMED • ConceptBrowser • DifPubMed • SocialMedia • RecerCaixa • Reviewers Recommender
GrafMED • Requesters • Catalan Institute of Oncology • Goal • Support application to identify patterns (rules) in the procedures applied to cancer patients • Data Model • 50000 patients from the Bellvitge hospital (1994 – 2006) • 67 types of tumors • Why DEX? • Querying capability, multiple data sources, navigational characteristics • Larger amount of data, with hundreds of thousands of patients
Index • Information Retrieval • OCP • GrafMED • ConceptBrowser • DifPubMed • SocialMedia • RecerCaixa • Reviewers Recommender
ConceptBrowser • Requesters • Havas Media • Goal • Support application for brainstorming tasks. • Finds not obvious conceptual relations among words and concepts. • Data • 440.882 concepts • 117.278 groups of synonymic words • 116.988 words • 10.922.306 relations among words and concepts • Why DEX? • Querying capability, navigational characteristics
Index • Information Retrieval • OCP • GrafMED • ConcetpBrowser • DifPubMed • SocialMedia • RecerCaixa • Reviewers Recommender
DifPubMed • Requesters • Havas Media • Goal • Bibex extension focused on medical researchers. • Identifies researcher social networks, scientific evolution on a particular medication and researchers influence. • Data • 1.502.599 publications • 2.136.184 researchers • 194.991 medications • 3.437.476 references
Index • Information Retrieval • OCP • GrafMED • ConcetpBrowser • DifPubMed • SocialMedia • RecerCaixa • Reviewers Recommender
SocialMedia • Requesters • Havas Media • Goal • Tool for analyzing information propagation in any social network. • Identifies useful information such as how fast and how far information is propagated in time. • Used Social Networks • Youtube – Users, videos, comments, etc. • Enron – Users, e-mails, etc. • Flickr – Users, photos, comments. • Orkut – Users, media(photos, music, etc.), messages, etc. • Twitter– Users, messages, etc. • Vi.vu– Medical professionals, non-professional users, questions, answers, references, etc.
SocialMedia • Results Influent Persons Distribution
Index • Information Retrieval • OCP • GrafMED • ConcetpBrowser • DifPubMed • SocialMedia • RecerCaixa • Reviewers Recommender
RecerCaixa • Motivation • Selected in the Call for Applications for Research Grants Tool (Recercaixa). • Goal • Support tool for exploring and recommending audiovisual content. • Oriented to be applied in primary and secondary education. • Data Contribution • Catalan public broadcaster Televisió de Catalunya (TV3)
Index • Information Retrieval • OCP • GrafMED • ConcetpBrowser • DifPubMed • SocialMedia • RecerCaixa • Reviewers Recommender
Reviewers Recommender • Requesters • Ministry of Science and Innovation of the Spain Government (MICINN) • Goal • Tool for identifying and recommending experts in a particular topic. • Experts in a topic • People highly contributing to documents related to a topic • Data Model • Document contributors • Documents
Thanks for your attention • Any questions? SPARSITY-TECHNOLOGIES Jordi Girona, 1-3, Edifici K2M 08034 Barcelona info@sparsity-technologies.com http://www.sparsity-technologies.com DAMA-UPC. DATA MANAGEMENT (UPC)Departamentd'Arquitectura de ComputadorsEdifici C6-S103. Campus Nord. Jordi Girona, 1-3. 08034 - Barcelona www.dama.upc.edu