1 / 21

UIMA Introduction

UIMA Introduction. SHARPn Summit June 11, 2012. Outline . UIMA Terminology (not just TLAs) Parts of a UIMA pipeline Running a pipeline Viewing annotations interactively. UIMA Terminology. CAS XCAS JCAS View Analysis Engine ( AE ) / Annotator

audi
Download Presentation

UIMA Introduction

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. UIMA Introduction SHARPn Summit June 11, 2012

  2. Outline • UIMA Terminology (not just TLAs) • Parts of a UIMA pipeline • Running a pipeline • Viewing annotations interactively

  3. UIMA Terminology • CAS XCAS JCAS View • Analysis Engine (AE) / Annotator • XML output: XCAS XMI • Type System JCasGen • CAS Visual Debugger (CVD) • CPE(Collection Processing Engine)

  4. UIMA • Framework • Defining data types • Passing data from one component to another • Tooling • Viewing results • Debugging • Editing XML visually

  5. Data Through a Pipeline • Type System • Defines the data types passed along • CAS(Common Analysis Structure) • Container for the data passed along • Created by UIMA from the Type System

  6. Parts of a UIMA Pipeline • Collection Reader • Read input document • Analysis Engine(s) / Annotator(s) • Process document • CAS Consumer • Output data

  7. Tying a Pipeline Together • CPEdescriptor (Collection Processing Engine) • Collection Reader • Analysis Engine(s) • CAS Consumer • Aggregate analysis engine • Multiple Analysis Engines and their order

  8. UIMA term Collection Reader Analysis Engine Analysis Engine Analysis Engine CAS Consumer Example Read files from a dir Sentence detector Tokenizer annotator Part of Speech tagger Output tokens to DB Pipeline Example

  9. UIMA plugin for Eclipse • Provides visual editors for descriptors • Mini GUI for selecting options • Rather than editing XML directly • An “Update site” exists for installing plugin http://www.apache.org/dist/incubator/uima/eclipse-update-site

  10. UIMA Tooling Options • Tools: • CPE Configurator • CVD (CAS Visual Debugger) • Options: • Command line scripts/.bat files • Run within Eclipse

  11. Running a Pipeline - CPE • cTAKES provides a script and a bat file runctakesCPE • Choose aCPE descriptor, such as test_plaintext.xml from cTAKESdesc/cdpdesc/collection_processing_engine

  12. Viewing Annotations - CVD • Viewing annotations using the CVD • Load the Type System • Load the XCAS or XMI

  13. Annotation Viewers • UIMA tools • CVD (CAS Visual Debugger) • Annotation viewer • Viewing XML output • Any XML viewer • Any text editor

  14. Questions? http://uima.apache.org/

  15. Supplemental slides follow

  16. Options to Run a Pipeline • CPE GUI • CVDGUI • Single Aggregate Analysis Engine • No Collection Reader • Instantiate a CpeDescription and invoke the process() method • uimaFIT– removes dependency on XML

  17. Creating a New Annotator • Within Eclipse • Create Java project • Right click -> Add UIMA Nature • Add UIMA jars to .classpath (Build Path) • Create Analysis Engine (AE) descriptor • Add types to AE descriptor, or optionally create separate Type System descriptor • Write code!

  18. Running an AE in CVD Using CVD to run an Analysis Engine • No Collection Reader • Single Analysis Engine (can be an aggregate) • No CAS Consumer • Load an Analysis Engine • Paste/type in text to process Family history of hyperlipidemia.

  19. Modifying a parameter UIMA’s descriptor editors allow you to modify most parameters without looking at the XML itself.

  20. Links • Getting started with UIMA http://uima.apache.org/doc-uima-annotator.html • UIMA Update site for use in Eclipse http://www.apache.org/dist/incubator/uima/eclipse-update-site

  21. Email address masanz.james@mayo.edu

More Related