1 / 27

Understanding Website Complexity: Measurements, Metrics, and Implications

Understanding Website Complexity: Measurements, Metrics, and Implications. Michael Butkiewicz Harsha Madhyastha UC Riverside. Vyas Sekar Intel Labs. doubleclick.net. cdn.turner.com. ads.cnn.com. Websites today are very complex! Diverse content from many servers and third party services.

ellema
Download Presentation

Understanding Website Complexity: Measurements, Metrics, and Implications

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Understanding Website Complexity: Measurements, Metrics, and Implications Michael Butkiewicz Harsha Madhyastha UC Riverside Vyas Sekar Intel Labs

  2. doubleclick.net cdn.turner.com ads.cnn.com Websites today are very complex! Diverse content from many servers and third party services cnn.com facebook.com

  3. Users see slow loading websites! 30% > 3 Seconds Median = 2 Seconds 67% of users encounter “slow” sites once a week (gomez.com)

  4. Why does load time matter? Implications for: Website owners End users Browser developers Customization Source: gomez.com

  5. Our work • Comprehensive study of website complexity • Analysis of sites across rank and category • Content and Service level metrics • Key metrics that impact performance

  6. Roadmap • Introduction • Measurement Setup • Complexity • Performance Implications • Discussion and Summary

  7. Measurement Setup • Selecting websites • 1,700 websites from Quantcast top 20k • Primary focus on landing (home) page • Annotated with AlexaCategories • Tools • Firefox + Firebug • No Local Caching • Approach • 4 vantage points (3 EC2, 1 UCR) • Every 60 second one page loaded • ~30 measurements per site per vantage point over 9 weeks

  8. Example Site Download Objects B C A D HTML CSS Image Script Time

  9. Example Firebug Log "log":{ "browser":{ "name":"Firefox”, }, "pages":[{ "startedDateTime":"18:12:59.702-04:00", "title":"Wired News”, "pageTimings":{ "onLoad":4630 }] "entries":[{ "startedDateTime":"18:12:59.702-04:00", "time”:75, "request":{ ... "headers":[{ "name":"Host", "value":"www.wired.com" }, } “response":{ "content":{ "mimeType":"text/html", "size":186013, ... ...

  10. Roadmap • Introduction • Measurement Setup • Complexity • Content-level • Service-level • Performance Implications • Discussion and future work

  11. Number of Objects: Across Categories Median 125 Objects! Median Site = 57 Objects!

  12. Number of Objects: Across Ranks 20% > 100! Not as much difference across rank ranges

  13. Types of Content Median site: 33 Images, 10 JavaScript, 3 CSS, 0 Flash Flash

  14. Normalized types of content Type % Mostly Homogeneous; Flash Skewed

  15. Roadmap • Introduction • Measurement Setup • Complexity • Content-level • Service-level • Performance Implications • Discussion and Summary

  16. Number of Servers Median site requires contacting 8 servers Median News 30 Servers!

  17. Number of Origins 20% Sites > 13 Origins Median site requires contacting 6 origins Median News 20 Origins!

  18. Popular non-origin providers Not just usual suspects! bluekai.com imrworldwide.com invitemedia.com > 5% of sites each! Most common services: Analytics & Advertising Most common objects: Image (small!) and Javascript

  19. Contribution of non-origin services 20% sites > 80% from 3rd Party Median Site Objects / Bytes 30% 35%

  20. Contribution of non-origin services 80% from 3rd Party Median Site Objects / Bytes 30% 35% Time only 15%

  21. Roadmap • Introduction • Measurement Setup • Complexity • Performance Implications • Discussion and Summary

  22. Metric Review • Content-Level Characteristics • Total Objects • Object Type: Number, Size • Absolute & Normalized • Service-Level Characteristics • Number: Server & Origins • Non-Origin Fraction: Servers / Objects / Time • Total combination of 33 metrics

  23. Load Time vs. Metric Correlation

  24. Load Time Variability vs. Metrics

  25. Roadmap • Introduction • Measurement Setup • Complexity • Content-level • Service-level • Performance Implications • Discussion and Summary

  26. Discussion: Many other variables • Client-side plugins • NoScript reduces #objects by half! • Mobile-specific customizations • Mobile version reduces #objects to a quarter • Landing vs. non-landing pages • Non-landing seem less complex

  27. Conclusions • Comprehensive study of Website Complexity • Median site: 57 objects, contacts 8 servers across 6 origins • Categories show more differences than popularity ranks • Non-origin content: • Analytics & advertising popular • large # objects, bytes, servers, not time • Key performance indicators: • Load Time  # objects • Variability  # servers Data: www.cs.ucr.edu/~harsha/web_complexity

More Related