1 / 15

Human-centered Machine Learning

Viral marketing with Stochastic optimal control of TPP. Human-centered Machine Learning. http://courses.mpi-sws.org/hcml-ws18/. Maximizing activity in a social network. Can we steer users’ behavior to maximize activity in a social network ?. Endogenous and exogeneous events.

loraj
Download Presentation

Human-centered Machine Learning

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Viral marketing with Stochastic optimal control of TPP • Human-centered Machine Learning • http://courses.mpi-sws.org/hcml-ws18/

  2. Maximizing activity in a social network Can we steer users’ behavior to maximize activity in asocial network?

  3. Endogenous and exogeneous events Exogenous activityUsers’ actions due todrives external to thenetwork Endogenous activityUsers’ responses to other users’ actions in the network

  4. Multidimensional Hawkes process For each user u, actions as a counting process Nu(t) Nu(t) Intensities or rates(Actions per time unit) Non-negative kernel (memory) User influence matrix Exogenousactions Endogenous actions

  5. Steering endogenous actions Organic actions + Directly incentivized actions Directly incentivized actions Intensities of directly incentivized actions [Zarezade et al., 2018]

  6. Cost to go & Bellman’s principle of optimality Loss Optimization problem Dynamics defined by Jump SDEs To solve the problem, we first define the corresponding optimal cost-to-go: The cost-to-go, evaluated at t0, recovers the optimization problem! [Zarezade et al., 2018]

  7. Cost to go & Bellman’s principle of optimality Loss Optimization problem This is a stochastic optimal control problem for jump SDEs(we know how to solve this!) Dynamics defined by Jump SDEs To solve the problem, we first define the corresponding optimal cost-to-go: The cost-to-go, evaluated at t0, recovers the optimization problem! [Zarezade et al., 2018]

  8. Hamilton-Jacobi-Bellman (HJB) equation Lemma. The optimal cost-to-go satisfies Bellman’s Principle of Optimality t Hamilton-Jacobi-Bellman (HJB)equation Partial differential equation in J (with respect to λ and t) [Zarezade et al., 2018]

  9. Solving the HJB equation Consider a quadratic loss Rewards organic actions Penalizes directly incentivizes actions We propose and then show that the optimal intensity is: Solution to a matrix Riccati differential equation Computed offline once! Closed form solution to a first order ODE [Zarezade et al., 2018]

  10. The Cheshire algorithm Intuition Steering actions means sampling action user & times from u*(t) More in detail Since the intensity function u*(t) is stochastic, we sample from it using: Superposition principle Standard thinning Easy to implement It only requires sampling from inhomog. Poisson! 10 10 [Zarezade et al., 2018]

  11. Experiments on real data Five Twitter datasets (users) where actions are tweets and retweets 1. Fit model parameters Network inference! exogeneous rate influence matrix 2. Simulate steering endogenous actions directly incentivized tweets chosen by each method [Zarezade et al., 2018]

  12. Evaluation metrics & baselines Evaluation metrics Average number of not directly incentivized tweets Average time to reach 30,000 not directly incentivized tweets Baselines MSC [Farajtabar et al., NIPS ’16] OPL [Farajtabar et al., NIPS ’14] PRK (Pagerank) DEG (Out-degree) [Zarezade et al., 2018]

  13. Performance vs. time Series, M(tf) ≈ 5k Sports, M(tf) ≈ 5k Cheshire (in red) triggers 100%-400% more posts than the second best performer. [Zarezade et al., 2018]

  14. Performance vs. # of incentivized tweets Series, M(tf) ≈ 5k Sports, M(tf) ≈ 5k Cheshire (in red) reaches 30K tweets 20-50% faster than the second best performer [Zarezade et al., 2018]

  15. Why Cheshire? “the Cheshire Cat has the ability to appear and disappear in any location” Alice’s Adventures in Wonderland, Lewis Carroll

More Related