1 / 28

E xample of a bibliomining system: logs.library.cornell

E xample of a bibliomining system: logs.library.cornell.edu. Adam Chandler Data Discussion on Library Data Cornell University Library June 1, 2012. What is it?. Live demo 1. That’s it? Why bother? Just use Google Analytics. Why not Google Analytics?. l ogs.library.cornell.edu:

nedra
Download Presentation

E xample of a bibliomining system: logs.library.cornell

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Example of a bibliomining system:logs.library.cornell.edu Adam Chandler Data Discussion on Library Data Cornell University Library June 1, 2012

  2. What is it? logs.library.cornell.edu

  3. logs.library.cornell.edu

  4. logs.library.cornell.edu

  5. Live demo 1 logs.library.cornell.edu

  6. That’s it? Why bother? Just use Google Analytics logs.library.cornell.edu

  7. logs.library.cornell.edu

  8. Why not Google Analytics? logs.library.cornell.edu: • Uses Cornell single sign for security and convenience • gives us the freedom to export and use the data anyway we want for our special reporting needs • requires no changes to our websites. Google Analytics requires a section of Javascript code that sends information about each request to Google where it is recorded. Repeated privacy violations from commercial sites such as Facebook are driving some users towards widgets such as ghostery (http://news.ghostery.com/) that block javascript based web tracking. • our flexible designallows us to store logs which cannot easily be tracked with javascript: examples: PURL, checkip, flickr logs.library.cornell.edu

  9. 'bibliomining' Nicholson, S. (2003) The Bibliomining Process: Data Warehousing and Data Mining for Library Decision-Making. Information Technology and Libraries 22 (4) logs.library.cornell.edu

  10. “The term 'bibliomining' was first used by Nicholson and Stanton (2003) in discussing data mining for libraries.In the research literature, most works that contain the terms 'library' and 'data mining' are not talking about traditional library data, but rather using library in the context of software libraries, as data mining is the application of techniques from a large library of tools.In order to make it more conducive for those concerned with data mining in a library setting to locate other works and other researchers, the term 'bibliomining' was created.The term pays homage to bibliometrics, which is the science of pattern discovery in scientific communication.” logs.library.cornell.edu

  11. logs.library.cornell.edu

  12. logs.library.cornell.edu

  13. “Bibliomining is the application of statistical and pattern-recognition tools to large amounts of data associated with library systems in order to aid decision-making or justify services.” logs.library.cornell.edu

  14. “The bibliomining process consists of • ·        determining areas of focus; • ·        identifying internal and external data sources; • ·        collecting, cleaning, and anonymizing the data into a data warehouse; • ·        selecting appropriate analysis tools; • ·        discovery of patterns through data mining and creation of reports with traditional analytical tools; and • ·        analyzing and implementing the results.” logs.library.cornell.edu

  15. Nicholson, S. (2003) The Bibliomining Process: Data Warehousing and Data Mining for Library Decision-Making. Information Technology and Libraries 22 (4) logs.library.cornell.edu

  16. “The process is cyclical in nature: as patterns are discovered, more questions will be raised which will start the process again.As additional areas of the library are explored, the data warehouse will become more complete, which will make the exploration of other issues much easier.” logs.library.cornell.edu

  17. Apache Log logs.library.cornell.edu

  18. Apache Log logs.library.cornell.edu

  19. CUL Logs IP Address Groups CU (Qatar) CU (Weill) Ithaca not CU CU (Campus) CU Lib (Public) NY not Ithaca CU Lib (Staff) USA not NY Overseas logs.library.cornell.edu

  20. logs.library.cornell.edu

  21. Live demo 2 logs.library.cornell.edu

  22. Who uses it and for what? logs.library.cornell.edu

  23. How do I get help? logs.library.cornell.edu

  24. How do I get help? logs.library.cornell.edu

  25. https://confluence.cornell.edu/display/culweblogstool/ logs.library.cornell.edu

  26. Credits logs.library.cornell.edu

  27. Credits logs.library.cornell.edu

  28. logs.library.cornell.edu

More Related