1 / 18

Self-Organized Web Usage Regularities

Self-Organized Web Usage Regularities. Problems of foraging information on WWW. Slow accession Difficulty in finding useful information is related to balkanization of Web structure Difficulty to solve this fragmentation problem by designing an effective classification scheme. Solution:

daisy
Download Presentation

Self-Organized Web Usage Regularities

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Self-Organized Web Usage Regularities

  2. Problems of foraging information on WWW • Slow accession • Difficulty in finding useful information is related to balkanization of Web structure • Difficulty to solve this fragmentation problem by designing an effective classification scheme. • Solution: To seek Web regularities in user behavior.

  3. Issues • How to characterize the strong regularities in Web surfing in terms of user navigation strategies. • How to present an information foraging agent-based approach to describing user behavior. • Issues on web self-organization.

  4. Related research works • Web mining for pattern-oriented adaptation: to identify the inter-relationships among different websites, either based on the analysis of the contents in Web pages or based on the discovery of the access patterns from Web log files. • Web data mining: computing association rules, detecting sequential patterns, and discovering classification rules and data clusters. • Web usage mining: analysis of Web usage patterns, such as user access statistical properties, association rules and sequential patterns in user sessions, user classification and web page clusters based on user behavior.

  5. Information Foraging Agent Model (IFAM) • Objectives: to find the inter-relationship between the statistical observations on Web navigation regularities and foraging behavior patterns of individual agents.

  6. IFAM – Artificial Web Space • Artificial Web Space: a collection of websites connected by hyperlinks. D(ci,cj)=(Σ Mk=1(cwik – cwjk)2)1/2 Where D(ci,cj) denotes the Euclidean distance between the content vectors of nodes i and j.

  7. IFAM – Artificial Web Space • Content Distribution Models T + |Xc|, if i=j cwni = |Xc|, otherwise fxc ~ normal (0,) T ~ normal(t, t) Where fxc: probability distribution of weight xc normal (0,): normal distribution with mean 0 and variance . T: content (increment) offset on a topic t: mean of normally distributed offset T t: variance of normally distributed offset T

  8. IFAM – Artificial Web Space • Power-law distribution: T + |Xc|, if i=j cwni = |Xc|, otherwise fxc ~ (Xc + 1) – (+1) , Xc > 0, > 0 Where fxc: probability distribution of weightXc : shape parameter of a power-law distribution T: content (increment) offset on a topic

  9. IFAM – Artificial Web Space • Constructing an Artificial Web 1. For each topic i 2. Create node content vectors End 3. For each node i 4. Initialize the link list of node i 5. For each node j 6. If D(ci,cj) < r 7. Add node j to the link list of node i 8. Add D(ci,cj) to the link list of node i end end end

  10. IFAM – Foraging Agents • Interest Profiles pm = [ pwm1, pwm2…pwmi…pwmM] pwmi Pmi = mj=1pwmi Hm = - mj=1pmilog(pmi) Where pm: preference vector of agent m pwmi : weight of preference on topic i Hm: interest entropy of user m

  11. IFAM – Foraging Agents • Interest Distribution Models 1. Normal distribution: pwmi = X p fxp ~ normal(0,u) Where normal(0,u) denotes the normal distribution with mean 0 and variance u 2. Power-law distribution: pwmi = X p fxp ~ u(Xp+1) -u + 1, Xp > 0, u > 0 Where u denotes the shape parameter of a power-law distribution

  12. IFAM – Foraging in an Artificial Web Space • Random agents: have no strong interests in any specific topics. • Rational agents: have specific interested topics in mind and they forage in order to locate the pages that contain information on those topics. • Recurrent agents: Recurrent agents are those who are familiar with the Web structure and know the whereabouts of interesting contents.

  13. IFAM – Foraging in an Artificial Web Space • Agent Preference Updating: depending on how much information on interesting topics the agent has found and how much the agent has absorbed such information. Pm() = Pm( - 1) -  cn pwmi = 0, for pwmi () < 0, i = 1…M Where  denotes an absorbing factor in [0,1] that implies how much information is accepted by agents on average. Pm() and Pm( - 1) denote an agent’s preference vector after and before accessing information in page n, respectively.

  14. IFAM – Foraging in an Artificial Web Space • Motivation Functions flog(mtv) ~ normal(m, m) Where m andm denotethe mean and variance of the log-normal distribution of mtv, respectively mtv = mem step Where m and mdenote thecoefficient and rate of an exponential function. Step denotes the number of pages/notes that an agent has continuously visited.

  15. IFAM – Foraging in an Artificial Web Space • Rewarding Function Rt =Mi=1(pwmi( -1) - pwmi()

  16. IFAM – Foraging in an Artificial Web Space • Foraging 1. Initialize the nodes and links in an artificial Web space 2. Initialize information foraging agents and their interest profiles 3. For each agent m 4. While the support for the agent S < max_supportm and S > min_supportm 5. Find the hyperlinks inside node n that the agent is presently in 6. Select, based on pk, the hyperlink that connects to the next-level page 7. Forage to the selected page 8. Update the preference weights in the agent’s interest profile 9. Update the support function of the agent End 10. If the support for the agent S > max_supportm 11. Agent m is satisfied with the contents and leaves the Web space Else 12. Agent m is dissatisfied and leaves the Web space End End

  17. Result • The experiment shows that by applying a weighted linear-regression method, the higher the occurrence rate of a depth or a link-click-frequency is, the higher the weight will be.

  18. Self-Organized Agent • To support adaptive organizations between agent, adding modeling technique allows agents to model their interactions with the environment and to recognize and manipulate new environmental scenarios to achieve organizational goals.

More Related