120 likes | 194 Views
This project aims to evaluate the effectiveness of search/discovery tools in Learning Commons by utilizing behavioral metrics and conducting aggregate, longitudinal studies. The project includes tools for usability studies, scalability for hundreds/thousands of participants, realistic tasks, and on-demand playback of sessions. The framework allows unified analysis and query capabilities for internal/external resources, with enforced privacy measures. The approach involves client-side instrumentation using LibX Toolbar for tracking user behavior on web search pages. Detailed events such as clicks, mouse movements, scrolling, and text entries are recorded for analysis and improvement. The system also offers an example application for classifying search intent using machine learning techniques. Emory Libraries' User Behavior Analysis System (EUBA) includes client-side instrumentation and data mining to enhance user experience.
E N D
Instrumenting the Learning Commons Eugene Agichtein, Qi Guo and Ryan Kelly Intelligent Information Access Lab, Math & CS Department Arthur Murphy, Selden Deemer, Kyle Fenton Emory Libraries
Goals/Motivation • Evaluate effectiveness of search/discovery with behavioral metrics (task-specific) • Perform aggregate, longitudinal studies • Tools for usability studies “in the wild” • Scale (hundreds/thousands of “participants”) • Realistic behavior and tasks • On-demand playback of “interesting” sessions • Unified analysis/query framework for internal and external resource access and usage statistics • Web-based query and statistics interface • Access auditing, privacy, anonymity enforced Intelligent Information Access Labhttp://ir.mathcs.emory.edu/
Approach: Client-side instrumentation • Implementation within the Emory Installation of the LibX Toolbar: (http://www.libx.org) • Extended LibX to track UI events: JavaScript patch to sample the mouse movements and other events on pre-specified web search pages. Events are encoded into a string and buffered, and periodically sent to the server (on internal library network). Intelligent Information Access Labhttp://ir.mathcs.emory.edu/
How it works • On login, firefox is started with • http://irlib.library.emory.edu/consent.cgi?user=USERID • If user has previously opted in (or out) • Redirect to Euclid homepage • If new user, show consent form • Store choice in database; if opted in, also store salted hash string for user log in • Can track user behavior over “lifetime” • No way to recover login id by dictionary attack • Can be removed at any time by deleting mapping • LibX sends http requests to server with encoded event strings. Intelligent Information Access Labhttp://ir.mathcs.emory.edu/
User Opt-in (new Learning Commons Users) http://irlib.library.emory.edu/ Intelligent Information Access Labhttp://ir.mathcs.emory.edu/
Events captured (v0.4, deployed) • Button/link clicks/Url changes • Name of the button, link, other meta-info • Mouse movements • (x,y) coordinates sampled ~every 10ms • Scrolling • Start, stop position, ~ every 10ms • Text entry • Query text, options changes • Keypress events • Menu item events • Print, bookmark, save (all of them) Intelligent Information Access Labhttp://ir.mathcs.emory.edu/
Example: Mouse Movement 5 segments: initial, early, middle, late, and end. • “Cheap” proxy for eye-tracking: • Capture physiological characteristics of the mouse trajectories • Segment Properties: • Speed • Acceleration • Rotation • Drift • Hover For each: avg. speed, avg. acceleration, rotation etc. Intelligent Information Access Labhttp://ir.mathcs.emory.edu/
Example applications: Classifying Search Intent • Initial exploration: • Standard supervised machine learning classification techniques • WEKA implementation of SVM and decision trees. Intelligent Information Access Labhttp://ir.mathcs.emory.edu/
Summary of Features Used Intelligent Information Access Labhttp://ir.mathcs.emory.edu/
Example Informational query: “spanish wine” Intelligent Information Access Labhttp://ir.mathcs.emory.edu/
Emory User Behavior Analysis System • EUBA: • Client-side instrumentation, Data mining/machine learning (Qi Guo) • Log DB parsing, indexing, web-based interface for querying, playback, annotation (Ryan Kelly) • Plan: to release the system to research/library community Intelligent Information Access Labhttp://ir.mathcs.emory.edu/
Demo Prototype: http://ir.mathcs.emory.edu/library/private/index.pl user: test password: notsafe Intelligent Information Access Labhttp://ir.mathcs.emory.edu/