1 / 31

Building Frameworks of Organizational Intelligence Library Assessment Conference, 2008 Seattle

Building Frameworks of Organizational Intelligence Library Assessment Conference, 2008 Seattle Joe Zucca Director for Planning and Communication University of Pennsylvania Libraries. Sample Data Structure I: Data Farm Funds Report - Biology.

Download Presentation

Building Frameworks of Organizational Intelligence Library Assessment Conference, 2008 Seattle

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Building Frameworks of Organizational Intelligence Library Assessment Conference, 2008 Seattle Joe Zucca Director for Planning and Communication University of Pennsylvania Libraries

  2. Sample Data Structure I: Data Farm Funds Report - Biology

  3. Sample Data Structure I: Data Farm Funds Report - Casalini Libri

  4. Sample Data Structure II - Data Farm Table - Funds Process

  5. bib_text invoice bib_ master Sample Data Structure III - Voyager Funds Schema invoice_ status Invoice_ line_item line_item purchase_ order fund invoice_ line_item _funds vendor

  6. Service Environment Information Loop Inform ERED Catalog Data Farm Environment Funds SFX | ERM? Integrate Report Builder DYNIX BorrowDirect Circulation WEB Apache ezproxy Ref|Instruct Dashboard Data Streams Data Bureau Clean | Anonymize Normalize People and Network Data Leveraging data structures

  7. E-Resource Use Resolve Resource Resolve People Resolve Places Digital Library Voyager-Supported Services Building Use Staff Census Reference & Instruction (dynamic) Resolution Services LDAP Administration Data Farm Oracle Space-Overview 30+ gb, in 162 tables, for collecting and disseminating management info Circulation Reference Contact Web Analytics Acquisitions Funds Image Collection Use Holdings Consortia ILL/DocDel Tech Processing Workflow 70-Member ILL Coop. Copier | Printer Use Gate Swipes

  8. ERED Catalog Funds SFX | ERM? DYNIX BorrowDirect Circulation Service Environment Data Farm Silos WEB Apache ezproxy Ref|Instruct funds

  9. A different take on a management information framework

  10. Sample Data Structure: EZ-Proxy Log xxx.xx.xxx.xxx|-|zucca|[26/Jul/2007:15:41:01 -0500]| GET https://proxy.library.upenn.edu:443/login?proxySessionID=10335905&url= http://www.csa.com/htbin/dbrng.cgi?username=upenn3&access=upenn34&cat=psycinfo&adv=1 HTTP/1.1| 302|0|http://www.library.upenn.edu/cgi-bin/res/sr.cgi?community=59| Mozilla/5.0 (Macintosh; U; PPC Mac OS X; en) AppleWebKit/418.9.1 (KHTML, like Gecko) Safari/419.3| NGpmb6dT6JXswQH|__utmc=94565761; ezproxy=NGpmb6dT6JXswQH; hp=/; proxySessionID=10335514; __utmc=247612227; __utmz=247612227.1184251774.1.1.utmccn=(direct)| utmcsr=(direct)|utmcmd=(none); UPennLibrary=AAAAAUaWP5oAACa4AwOOAg==; sfx_session_id=s6A37A3E0-3B8E-11DC-80E9-85076F88F67F

  11. Sample Data Structure: EZ-Proxy Log xxx.xx.xxx.xxx|-|zucca|[26/Jul/2007:15:41:01 -0500]|GET https://proxy.library.upenn.edu:443/login?proxySessionID=10335905&url= http://www.csa.com/htbin/dbrng.cgi?username=upenn3&access=upenn34&cat=psycinfo&adv=1 HTTP/1.1| 302|0|http://www.library.upenn.edu/cgi-bin/res/sr.cgi?community=59| Mozilla/5.0 (Macintosh; U; PPC Mac OS X; en) AppleWebKit/418.9.1 (KHTML, like Gecko) Safari/419.3| NGpmb6dT6JXswQH|__utmc=94565761; ezproxy=NGpmb6dT6JXswQH; hp=/; proxySessionID=10335514; __utmc=247612227; __utmz=247612227.1184251774.1.1.utmccn=(direct)| utmcsr=(direct)|utmcmd=(none); UPennLibrary=AAAAAUaWP5oAACa4AwOOAg==; sfx_session_id=s6A37A3E0-3B8E-11DC-80E9-85076F88F67F xxx.xx.xxx.xxx= .edu | on-campus | Van Pelt Library | staff office July 26, 2007: 3:41 p.m. Device= Mac | OSX Browser= Safari

  12. Sample Data Structure: EZ-Proxy Log xxx.xx.xxx.xxx|-|zucca|[26/Jul/2007:15:41:01 -0500]| GET https://proxy.library.upenn.edu:443/login?proxySessionID=10335905&url= http://www.csa.com/htbin/dbrng.cgi?username=upenn3&access=upenn34&cat=psycinfo&adv=1 HTTP/1.1| 302|0|http://www.library.upenn.edu/cgi-bin/res/sr.cgi?community=59| Mozilla/5.0 (Macintosh; U; PPC Mac OS X; en) AppleWebKit/418.9.1 (KHTML, like Gecko) Safari/419.3| NGpmb6dT6JXswQH|__utmc=94565761; ezproxy=NGpmb6dT6JXswQH; hp=/; proxySessionID=10335514; __utmc=247612227; __utmz=247612227.1184251774.1.1.utmccn=(direct)| utmcsr=(direct)|utmcmd=(none); UPennLibrary=AAAAAUaWP5oAACa4AwOOAg==; sfx_session_id=s6A37A3E0-3B8E-11DC-80E9-85076F88F67F Referring URL = library web page listing resources on Psychology (community 59) | Olson, Coordinating Bibliographer

  13. Sample Data Structure: EZ-Proxy Log xxx.xx.xxx.xxx|-|zucca|[26/Jul/2007:15:41:01 -0500]|GET https://proxy.library.upenn.edu:443/login?proxySessionID=10335905&url= http://www.csa.com/htbin/dbrng.cgi?username=upenn3&access=upenn34&cat=psycinfo&adv=1 HTTP/1.1| 302|0|http://www.library.upenn.edu/cgi-bin/res/sr.cgi?community=59| Mozilla/5.0 (Macintosh; U; PPC Mac OS X; en) AppleWebKit/418.9.1 (KHTML, like Gecko) Safari/419.3| NGpmb6dT6JXswQH|__utmc=94565761; ezproxy=NGpmb6dT6JXswQH; hp=/; proxySessionID=10335514; __utmc=247612227; __utmz=247612227.1184251774.1.1.utmccn=(direct)| utmcsr=(direct)|utmcmd=(none); UPennLibrary=AAAAAUaWP5oAACa4AwOOAg==; sfx_session_id=s6A37A3E0-3B8E-11DC-80E9-85076F88F67F zucca [Penn authentication key] = Provost cntr | LIBRARY | staff

  14. Sample Data Structure: EZ-Proxy Log xxx.xx.xxx.xxx|-|zucca|[26/Jul/2007:15:41:01 -0500]| GET https://proxy.library.upenn.edu:443/login?proxySessionID=10335905&url= http://www.csa.com/htbin/dbrng.cgi?username=upenn3&access=upenn34&cat=psycinfo&adv=1 HTTP/1.1| 302|0|http://www.library.upenn.edu/cgi-bin/res/sr.cgi?community=59| Mozilla/5.0 (Macintosh; U; PPC Mac OS X; en) AppleWebKit/418.9.1 (KHTML, like Gecko) Safari/419.3| NGpmb6dT6JXswQH|__utmc=94565761; ezproxy=NGpmb6dT6JXswQH; hp=/; proxySessionID=10335514; __utmc=247612227; __utmz=247612227.1184251774.1.1.utmccn=(direct)| utmcsr=(direct)|utmcmd=(none); UPennLibrary=AAAAAUaWP5oAACa4AwOOAg==; sfx_session_id=s6A37A3E0-3B8E-11DC-80E9-85076F88F67F SessionID 10335905=PsycInfo (resource identifier 7014)

  15. Sample Data Structure: EZ-Proxy Log xxx.xx.xxx.xxx|-|zucca|[26/Jul/2007:15:41:01 -0500]| GET https://proxy.library.upenn.edu:443/login?proxySessionID=10335905&url= http://www.csa.com/htbin/dbrng.cgi?username=upenn3&access=upenn34&cat=psycinfo&adv=1 HTTP/1.1| 302|0|http://www.library.upenn.edu/cgi-bin/res/sr.cgi?community=59| Mozilla/5.0 (Macintosh; U; PPC Mac OS X; en) AppleWebKit/418.9.1 (KHTML, like Gecko) Safari/419.3| NGpmb6dT6JXswQH|__utmc=94565761; ezproxy=NGpmb6dT6JXswQH; hp=/; proxySessionID=10335514; __utmc=247612227; __utmz=247612227.1184251774.1.1.utmccn=(direct)| utmcsr=(direct)|utmcmd=(none); UPennLibrary=AAAAAUaWP5oAACa4AwOOAg==; sfx_session_id=s6A37A3E0-3B8E-11DC-80E9-85076F88F67F sfx_session_id=Journal of Experimental Child Psychology, “Relations Among Musical Skills, Phonological Processing and Early Reading Ability in Preschool Children.”

  16. Sample Data Structure: EZ-Proxy Log xxx.xx.xxx.xxx|-|zucca|[26/Jul/2007:15:41:01 -0500]| GET https://proxy.library.upenn.edu:443/login?proxySessionID=10335905&url= http://www.csa.com/htbin/dbrng.cgi?username=upenn3&access=upenn34&cat=psycinfo&adv=1 HTTP/1.1| 302|0|http://www.library.upenn.edu/cgi-bin/res/sr.cgi?community=59| Mozilla/5.0 (Macintosh; U; PPC Mac OS X; en) AppleWebKit/418.9.1 (KHTML, like Gecko) Safari/419.3| NGpmb6dT6JXswQH|__utmc=94565761; ezproxy=NGpmb6dT6JXswQH; hp=/; proxySessionID=10335514; __utmc=247612227; __utmz=247612227.1184251774.1.1.utmccn=(direct)| utmcsr=(direct)|utmcmd=(none); UPennLibrary=AAAAAUaWP5oAACa4AwOOAg==; sfx_session_id=s6A37A3E0-3B8E-11DC-80E9-85076F88F67F • TRANSLATION: • July 26, 2007 at 3:41 pm • A library staff (Provost’s center affiliate) • Using a staff computer (MAC|OSX|Safari) located in Van Pelt Library and in the same session, • Logged into PsychInfo (CSA) • Made an open-url connection to an article on reading aptitude published in the Journal of Experimental Child Psychology.

  17. Org|Program Staff Unrestricted Restricted Library Program Service Genre Fund Code Course Descriptor Level Budget Expnd Amt Client Program Dept Biblio- graphic Content Dept Reqmnt School URL Client Environment School IP-Domain Dept Rank Date|Time Campus Location Major Events: aStructurefor Library DecisionMetrics EVENT

  18. ERED Catalog Funds SFX | ERM? DYNIX BorrowDirect Circulation Service Environment Data Farm Silos WEB Apache ezproxy Ref|Instruct funds

  19. ERED Catalog Funds SFX | ERM? DYNIX BorrowDirect Circulation WEB Apache ezproxy Ref|Instruct Data Farm Tiered Framework Service Environment Repository processes Analysis oracle DATA STORE query (sql) parse, resolve, anonymize, secure! XML perl XL

  20. Library Program Budget Client Program EVENT Content Environment Client METRIDOC Event Elements Articulated in an XML Schema • <ENVIRONMENTAL> • <v.domain> • <p.domain> • <date> • <time> • <session> • <url> • </ENVIRONMENTAL> • <CLIENT> • <school> • <dept> • <rank> • </CLIENT> • <LIBRARY PRGRM> • <service genre> • <staff_name> • <staff_org> • <staff_prgm> • </LIBRARY PRGRM> • <CONTENT> • <title> • <author> • <holdings> • <call_no> • <isbn> • <issn> • <url> • <res_id> • <sfx_id> • </CONTENT>

  21. URL IP-Domain Environment Date|Time Campus Location Library Program Budget Client Program EVENT Content Environment Client METRIDOC Event Elements Articulated in an XML Schema • <ENVIRONMENTAL> • <v.domain> • <p.domain> • <date> • <time> • <session> • <url> • </ENVIRONMENTAL> • <CLIENT> • <school> • <dept> • <rank> • </CLIENT> • <LIBRARY PRGRM> • <service genre> • <staff_name> • <staff_org> • <staff_prgm> • </LIBRARY PRGRM> • <CONTENT> • <title> • <author> • <holdings> • <call_no> • <isbn> • <issn> • <url> • <res_id> • <sfx_id> • </CONTENT>

  22. School Dept Client Library Program Major Budget Rank Client Program EVENT Content Environment Client METRIDOC Event Elements Articulated in an XML Schema • <ENVIRONMENTAL> • <v.domain> • <p.domain> • <date> • <time> • <session> • <url> • </ENVIRONMENTAL> • <CLIENT> • <school> • <dept> • <rank> • </CLIENT> • <LIBRARY PRGRM> • <service genre> • <staff_name> • <staff_org> • <staff_prgm> • </LIBRARY PRGRM> • <CONTENT> • <title> • <author> • <holdings> • <call_no> • <isbn> • <issn> • <url> • <res_id> • <sfx_id> • </CONTENT>

  23. Service Genre Library Program Budget Library Program Client Program EVENT Org|Program Content Staff Environment Client METRIDOC Event Elements Articulated in an XML Schema • <ENVIRONMENTAL> • <v.domain> • <p.domain> • <date> • <time> • <session> • <url> • </ENVIRONMENTAL> • <CLIENT> • <school> • <dept> • <rank> • </CLIENT> • <LIBRARY PRGRM> • <service genre> • <staff_name> • <staff_org> • <staff_prgm> • </LIBRARY PRGRM> • <CONTENT> • <title> • <author> • <holdings> • <call_no> • <isbn> • <issn> • <url> • <res_id> • <sfx_id> • </CONTENT>

  24. Library Program Budget Client Program EVENT Content Environment Client METRIDOC Event Elements Articulated in an XML Schema • <ENVIRONMENTAL> • <v.domain> • <p.domain> • <date> • <time> • <session> • <url> • </ENVIRONMENTAL> • <CLIENT> • <school> • <dept> • <rank> • </CLIENT> • <LIBRARY PRGRM> • <service genre> • <staff_name> • <staff_org> • <staff_prgm> • </LIBRARY PRGRM> • <CONTENT> • <title> • <author> • <holdings> • <call_no> • <isbn> • <issn> • <url> • <res_id> • <sfx_id> • </CONTENT> Biblio- graphic Content

  25. Data Farm Multi-Tiered Architecture Metridoc Setting MIS Tools & Services Tier III XML XLS Oracle RSS Admin Interface Admin users create metridoc schema, specifying structures for raw data sources Data Ingest Tier I Process Data Raw data ingest and handoff for resolving into Metridoc Schema Repository Repository Tier II SQL Generator spawns tables following user-defined schema Soap Connection Client Issues data as event-level metridoc xml Resolver SQL Generator Soap Connection Logs & other Data sources Like a prism, the SQL Generator parses metridoc info into relational structures within the Data Repository Data Repository Voyager | People Data | ERM

  26. Use of Electronic Journals, Subject Correlations

  27. Penn’s Use of Electronic Journals by Subject: Multi-Dimensional Scaling Model Source data: Ezproxy logs, June-August 2007 and February-March 2008. 215,000 observations total Multidimensional Scale of E-Resource Use by Subject – All Users • Dimension II • “Humanistic-Science” • Dimension II • “Qualitative-Quantitative”

  28. Penn’s Use of Electronic Journals by Faculty: Multi-Dimensional Scaling Model Source data: Ezproxy logs, June-August 2007 and February-March 2008. 215,000 observations total Multidimensional Scale of E-Resource Use by Faculty Grouped by School • Dimension II • “Humanistic-Science” • Dimension II • “Qualitative-Quantitative”

  29. Penn’s Use of Electronic Journals Example of text mining on article keywords

More Related