Thank you Prof. Dr. Gerhard Boerner !. Stephen, Thomas, Houjun, Me, Robert Jing. Large Scale Statistics in Internet Behaviors. H ongguang Bi Greetingland , LLC Los Angeles, CA. Chapter 1. Chapter 2. Chapter 3. Internet and WWW History, how it works.
Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.
Hongguang BiGreetingland, LLCLos Angeles, CA
Internet and WWW History, how it works
Internet User Behaviors & Privacy
Geo, contextual and behavior targetings, Real-time bidding, Yield management
About Collect User Information, what and how
Cosmology: Nature defines physical laws
Internet: Human defines laws (or specifically: protocols)
Cosmology: Real World
Internet: Information World, or Virtual World
Cosmology: photons, electrons, neutrinos… (monad? Leibniz)
Cosmology: particles => stars => galaxies => clusters etc.
Internet: bits => bytes or integers => words => pages & emails
Cosmology: millions of galaxies detected => billions
Internet: millions to billions of users
Cosmology: goal=> structures, statistics of galaxies
Internet: goal=>behaviors, statistics of users
WWW: March 1989, Tim Berners-Lee http 0.9: 1995; http 1.0: 1996; http 1.1: June 1999, RFC 2616
Mailbox Protocol: 1971
SMTP: 1982, RFC 821Later developments: UUCP, sendmail,
Cookie is the only way that server can insert data into user’s browser.
How does it work?
Client: send request without cookie;
Server: response with a “Set-Cookie” header, containing some informationClient: send request with a “Cookie” header containing the SAME information
Cookie is bound to the specific server, and can be multiple
ResearchGroup: group_id, name, desciption, head
Member: member_id, group_id, name, type (profession, postdoc, student), status (current, left)
Left: left_id, member_id, when, where
A simplified time-series analysis tool
3,5 => 15
4,6 => 24
9,8 => 72
6,7 => 41
=> very fast
The Good side of tracking
The Good side of user tracking