1 / 14

Integrating an Enterprise Taxonomy with Local Variations

This article discusses how to integrate an enterprise taxonomy with local variations, including the use of text analytics and governance. It also explores the benefits and challenges of this approach.

genevievec
Download Presentation

Integrating an Enterprise Taxonomy with Local Variations

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Integrating an Enterprise Taxonomy with Local Variations Tom ReamyChief Knowledge Architect KAPS Group Program Chair – Text Analytics World Knowledge Architecture Professional Services http://www.kapsgroup.com

  2. Agenda • Introduction – • Information Environment • Research Approach • Integrated Solution – governance, technology – text analytics • Conclusions

  3. Introduction: KAPS Group • Knowledge Architecture Professional Services – Network of Consultants • Applied Theory – Faceted & emotion taxonomies, natural categories Services: • Strategy – IM & KM - Text Analytics, Social Media, Integration • Taxonomy/Text Analytics, Social Media development, consulting • Text Analytics Quick Start – Audit, Evaluation, Pilot • Partners – Smart Logic, Expert Systems, SAS, SAP, IBM, FAST, Concept Searching, Attensity, Clarabridge, Lexalytics • Clients: Genentech, Novartis, Northwestern Mutual Life, Financial Times, Hyatt, Home Depot, Harvard Business Library, British Parliament, Battelle, Amdocs, FDA, GAO, World Bank, Dept. of Transportation, etc. • Program Chair – Text Analytics World – March 29-April 1 - SF • Presentations, Articles, White Papers – www.kapsgroup.com • Current – Book – Text Analytics: How to Conquer Information Overload, Get Real Value from Social Media, and Add Smart Text to Big Data

  4. Information Environment • Multi-National Financial Institution-10,000+ • Diversity - multiple languages, cultures, information needs and behaviors, organizational cultures • Initial Application – knowledge management networks • Network definition – somewhat by subject area, but also political • Multiple applications – search, browse, web sites • Expertise location, Accounting-resource, analysis • Multiple audiences – internal and external, expert and non-expert (everyone a non-expert in something)

  5. Approach • First step – research into variations • Use cases, levels of granularity • Common terms with different meanings • Interviews with multiple groups, roles, levels • Contextual interviews, information interviews • Taxonomy interviews – suggested terms and relationships • Analysis – taxonomies, search logs suggest facets, HR expertise descriptions, local web sites, keywords, clustering, new terms • Group sessions – representatives of multiple constituencies – talking out the differences

  6. Current Environment Overview • Current form of Topics: Long and flat – 2 levels • Difficult to build on, desire for more specificity for experts and content, usability issues, no place for new topics • Multiple taxonomies – topics, organizational, Web site browse, industry codes • Partial overlaps, conflicting • Political – Social Development & Gender • Variations – official term, relationships of terms • New terms mostly at lower levels and stable structure • Cross-cutting topics – Finance of Education, Poverty

  7. Elements of the Solution • Taxonomy is only one part of the solution • Faceted metadata and text analytics • Enterprise taxonomy – death of? • Analysis of taxonomy – suitable for categorization & views • Structure – not too flat, not too large • Orthogonal categories – easier to tag and easier to map variations • Idea of Views – browse by local variations – map to official topics • Supported by software – Pool Party • Role-based views, Activity-based views • Solution: integration of multiple components – two critical-Governance and Text Analytics

  8. Enterprise Information Integration Data Movement Reference Architecture Data Sources Front-end application Dashboard Ad-hoc query Mobile Apps Portals/web LoB Client UI Mashups Enterprise Search UI Mid-tier layer Analysis & Reporting Data mining, Text Analytics engine Enterprise search engine Statistical Analysis Predictive Analytics Information Services/Semantic Layer Consolidated Repositories Data processing engine (e.g Hadoop) Vocabularies Taxonomies Core Metadata Multilingual Governance Corporate data Model Reference Data Information Standards & Policy ETL Data Services ESB … Operational Data Store Metadata Repository/ Registry Data marts Data Warehouse Master Data Unstructured data Unstructured data Unstructured data Unstructured data Structured Data Structured Data External Data Web content Documents Email Shared drives Business data Statistical data External Metadata Mgt e-publish, Day, Drupal, blogs etc Jive Sharepoint Wbdocs, Jolis SAP, PeopleSoft, Finance etc Factiva News Research dbs DDP

  9. Text Analytics – power and flexibility • Critical – Text Analytics tool • Same taxonomy term but different criteria, rules • Documents tagged for different uses, audiences • Education – for specialists • Deep complex rules, very fine granularity, specialists jargon-acronyms • Education – for generalists • High level rules, general terms, simple • Education within Social Development • Generalist rules plus social development terms – birth weight

  10. Proposed Model for a Taxonomy Eco-System • New Topic Taxonomy • Enhanced structure and coverage – deeper framework to build on • Implemented in new software – flexible, solves old debates • Facets – remove complexity, increase coverage – 10 X 10. • Powered by auto-categorization - tagging, advanced applications • Combine with current data –HR (Expertise), other • Taxonomy is part of an integrated information management • Search, Content management, IM Policy, External Web, etc. • Facets – Subject (topic), Industry, Program, Methods, Business Activities, Organization, Skills, Document Type, Project, Product, Geography

  11. Executive Function Sponsorship • Set overall policy and strategies • Drive direct acceptance Governance Structure Strategic Level Integration with existing Information Management • Resource Decisions • Organizational structure Taxonomy Management • Revise/approve tax structure • Rules for changes • Text Analytics/Research • Manage implementation • Gather & make changes • Content & tagging analysis • Coordinate feedback • Communication / Training • Provide feedback Tax Management Central Prioritize changes, cross-cutting Operational Working Group Tax Management (Anchors & Regions) IM focal Point, KM, web etc (Anchors & Regions, IMT) Users & SMEs Feedback loop IT Systems Coordinate changes in dependent systems

  12. Critical Success Factors: Governance • Governance Policy & Process & Enforcement • Incorporate enforcement into publishing process / Hybrid Auto-cat • Taxonomy management is part of overall information management with additional taxonomy roles/functions • Best Practice: combination of central and distributed teams • Taxonomy specific: Taxonomy Manager – Central & Networks • Revise tax structure, rules for changes, manage implementation • Enforcement – combination of central & Networks • Feedback – metrics – identify need for new terms, remove old terms • Combination of user feedback in application & periodic analysis

  13. Conclusion • Taxonomies are an enterprise resource • Danger of monolithic over-riding local variations • Less useful and/or ignored • Danger of chaos of multiple variations losing ability to coordinate and communicate • Solution: Research into users, use cases, semantic resources • Integrated solution – importance of distributed governance • Integrated solution – text analytics to reflect local variations and provide a means to integrate into unified solution • Facets, text analytics and browse views solve 75%, rest is manageable • No one was entirely happy – must be doing something right

  14. Questions? Tom Reamytomr@kapsgroup.com KAPS Group Knowledge Architecture Professional Services http://www.kapsgroup.com www.TextAnalyticsWorld.comMarch 29-April 1, San Francisco

More Related