1 / 8

Medical text extraction

15 Sep 2009. Medical text extraction. Objective. A lot of biomedical articles Too troublesome to read through When all you want to know is: Author, Institution, Research Database, Analysis Tools used, etc. Use information retrieval to extract relevant info from articles. Approach. CRF++

Download Presentation

Medical text extraction

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. 15 Sep 2009 Medical text extraction

  2. Objective • A lot of biomedical articles • Too troublesome to read through • When all you want to know is: • Author, Institution, Research Database, Analysis Tools used, etc. • Use information retrieval to extract relevant info from articles

  3. Approach • CRF++ • Training files • XML tagged medical articles • Tagging done by some doctors (from Duke-NUS side)

  4. Tags of importance • Author • Institution • Email • Database Name • Data Analysis Name

  5. Result (1/2) • 3-fold cross-validation • 50 articles used • More available, but not used due to noise (to be cleaned up) • 12 features used

  6. Result (2/2)

  7. Some difficulties • Some peculiar Asian names • Unpredictable for Database name: • <database_name> mfold 3.2 online software </database_name> • <database_name> regional mailing list of the Institute of General Practice, University Hospital Schleswig-Holstein </database_name> • <database_name> hospital and population data set </database_name> • <database_name> national registry </database_name>

  8. The End

More Related