The Medical Information System - MedISys eHealth 2009 Second International ICST Conference on Electronic Healthcare for the 21st century September 23-25, 2009 - Istanbul, Turkey Erik van der Goot & the OPTIMA team ( OPensource Text Information Mining and Analysis )
Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.
The Medical Information System - MedISys
Second International ICST Conference
Electronic Healthcare for the 21st century
September 23-25, 2009 - Istanbul, Turkey
Erik van der Goot & the OPTIMA team (OPensource Text Information Mining and Analysis )
European Commission – Joint Research Centre (JRC)Institute for the Protection and Security of the Citizen (IPSC)[email protected]
Provide open source data collection and analysis for surveillance and epidemiology
Replace manual scanning of multiple newspapers and web portals
Support national and international Public Health (PH) organisations to monitor issues of Public Health concern (e.g. CBRN)
Gather, filter, classify, extract and aggregate health-related information
Monitor trends, detect breaking news
Visualise analysis results
Allows customised views
In combination with RNS tool, allows manual moderation.
Based on JRC’s Europe Media Monitor (EMM) technology (EMM live since 2002; http://emm.newsbrief.eu).
On request / initiative of the EC’s Directorate General for Health and Consumer Protection (DG SANCO).
Password-protected service for Public Health bodies since 2005.
Public service since early 2007 (http://medusa.jrc.it/, restricted functionality).
2009Development time line
Domain specific application
First version 2005
EMM System redesign
Redesign based on EMM
NewsDesk Service (a.k.a. RNS)
EMM Open Source Monitoring Engine
~ 2200 Sources (world-wide, but primary focus on Europe)
~ 4,000 HTML web pages+RSS feeds
~ 100 specialist medical sites
~ 20 commercial newswires
Specialist pay-for sources (LexisMed)
24/7, near continuous monitoring
~80,000 new articles/items per day
Converts dirty html with adverts, menus, html tags, ‘related stories’, etc. into clean and standardised Unicode-encoded RSS format
Use RSS when available
Perform full content analysis
Directorate General Health and Consumer Protection (SANCO)
European Centre for Disease Control, Stockholm (ECDC)
European Food Safety Authority (EFSA)
World Health Organisation (WHO)
National Public Health organisations
Swiss Federal Office of Public Health
Icelandic Ministry of Health
Spanish Ministry of Sanitation & Ministry of Health and Consumer Protection
Institut de Veille Sanitaire (France)
Global Public Health Intelligence Network (Canada)
Danish Emergency Management Agency
Italian Ministry of Health and Ministry of Defence
Dutch Institute of Public Health & Food and Consumer Product Safety Authority
The (general?) public
Currently ~ 1000 visitors, ~ 37000 hits per day on public system
English - French
Spanish - Portuguese
Importance of multilingual information gathering
Italian - German
influenzavirus tipo A
prasečí chřipkaMultilingual and cross lingual analysis (1)
Barack Obama (Eu,yo)
Barak Obama (az,wo)
Барак Обама (ba,uk)
باراك أوباما (ar)
باراك اوباما (ar,fa)
Барак Хуссейн Обама (ru)
Baraque Obama (pt)
บารัค โอบามา (th)
Բարաք Օբամա (hy)
ބަރަކް އޮބާމާ (dv)
באראק אבאמא (yi)
ברק אובאמה (he)
ބަރާކް އޮބާމާ (dv)
بارک اوبامہ (ur)
Documents from all languages get classified according to the same countries and categories.
An increase of the number of media reports on any country-category combination is detected,
independently of the reporting language.
Graphs and alerts may show events not yet reported in your own language.
Results from Helsinki University
Add more news sources or new categories, e.g.
Events: Cricket World Cup, Rugby World Cup, UEFA Euro 2008
Other classes, e.g. deliberate release of chemicals
(on request of recognised users/partners)
Output formats: web pages, email alerts, or RSS feed to integrate into your environment.
daily vs. breaking news only
for daily notification: specify hour
for breaking news: level-dependent
User-selected languages only
Allows MedISys users to further customise their view of the news
Selection of specific languages and feeds
Allows human moderation
Manual selection of news items
Drag and drop compilation of newsletters
Allows moderators to forward news items to user groups
Allows user management
Via SMS alerts, emails or newsletters
Shows overview of relative activity of each category over time
Manual selection of news items, drag and drop compilation of newsletters.
Time line shows overview of relative activity of each category over time.
High coverage: helps monitor a large number of multilingual media reports.
Includes tools to help beat the information overflow:
via clustering, duplicate detection;
categorization; information aggregation; visualisation; mapping
further means are being implemented: e.g. multiligual medical event extraction
Special features of MedISys:
Fully automatic (moderation possible)
Real time (10-minute updates), 24/7
High multilinguality (43 languages)
Part of EMM family of applications, active team: much new functionality to come.