Wanadoo Portals
Download
1 / 22

own Linux-based low-cost leading mass-market service - PowerPoint PPT Presentation


  • 263 Views
  • Uploaded on

Wanadoo Portals. A long-lived home-grown Linux-based low-cost leading mass-market service Pierre AUBERT & Eric OLIVERI FranceTelecom/Home Communication Services/Broadband & Internet. We talk about mass market net & computer engineering. the #1 in France a simple web page

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'own Linux-based low-cost leading mass-market service' - Melvin


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Slide1 l.jpg

Wanadoo Portals

A long-lived home-grown Linux-based low-cost leading mass-market servicePierre AUBERT & Eric OLIVERI FranceTelecom/Home Communication Services/Broadband & Internet


We talk about mass market net computer engineering l.jpg
We talk about mass market net & computer engineering

  • the #1 in France

  • a simple web page

  • the Big Picture :

    pure* Open source – Linux – PC hardware

  • some thoughts about the past and the future

  • works well, costs little  delivers more from less


The 1 portals in france l.jpg
The #1 portals in France

  • wanadoo.fr – voila.fr – MMGs – IM/IRC…

  • 65% reach – 13 M unique visitors

  • 80 M pages to 4.2 M customers daily

  • Every day’s peaks :

    • 150 000 simultaneous users

    • 1 Gb/s Internet content

    • 5 000 new users/mn

    • 100 000 web pages/mn

    • transport conn/s, forks/s, SELECTs/s…


We talk about mass market net computer engineering4 l.jpg
We talk about mass market net & computer engineering

  • the #1 in France

  • a simple web page

  • the Big Picture :

    pure* Open source – Linux – PC hardware

  • some thoughts about the past and the future

  • works well, costs little  delivers more from less



Slide6 l.jpg

Mathopd

Apache/C++

Apache/Oracle

Solaris/Linux

Apache/PHP/mySQL

IIS/ASP/SQLsvr

TUX

LVS

embedded firewalls

…, but …


We talk about mass market net computer engineering7 l.jpg
We talk about mass market net & computer engineering

  • the #1 in France

  • a simple web page

  • the Big Picture :

    pure* Open source – Linux – PC hardware

  • some thoughts about the past and the future

  • works well, costs little  delivers more from less

* well, mostly


Not only a web server l.jpg
Not only a web server

  • Front office

    • Network* – Load Balancers* – Firewalls – DNS

    • Web servers : Apache/Mathopd/TUX, mostly PHP

    • Databases : mySQL, postgreSQL

  • Back office

    • Publishing – Content management

    • Audience collection

    • Databases**

  • Supervision, backups, etc.

  • * sometimes open source/Linux/PC hardware

  • ** most often non-open source/non-Linux/non-PC hardware


Slide9 l.jpg

other services

IMAP, HTTP…

BACK

SERVICES

databases

VPN

databases

web

farms

redirect

N x FRONT

SERVICES

DNS

backups

load-balcers

+ firewalls

+ routers

supervision

content

management

audiencecollection


3 ways to deliver web service l.jpg
3 ways to deliver Web service

  • Apache for dynamic content

    • Mostly PHP, some C++, some SSI

    • 20 M pages for 2.5 M users daily

    • 12 servers : Bi-CPU Intel, type 1 – 60 Mb/s

  • Mathopd for static content

    • Maximize browser cache effect

    • 2000 HTTP/second

    • 6 servers : 1-CPU Intel, type 0 – 25 Mb/s

  • TUX for redirection service

    • Built-in Linux kernel

    • 2500 HTTP/second – 2 Mb/s

    • 3 servers (1 is enough) : 1-CPU Intel, type -1 ;–)


Some other services l.jpg
Some other services

  • Cookies (identity, tracking…)

    • 1250 HTTP/sec

    • 4 servers : 1-CPU Intel type 0 – 10 Mb/s

  • Databases : authentication

    • 1000 reads/mn – 150 simultaneous connections

    • 1 server 1-CPU 1G RAM, type 1

  • Databases : search engine crawler

    • 500 M docs – 4 Tb – 100k writes/mn

    • 12 servers 2-CPU 5x72Gb each, type 1

  • WAN-LAN

    • DNS : 4 servers 1-CPU Intel type 0 – Zebra (BGP/OSPF)

    • Firewall + load-balance appliance : 1-CPU Intel, type -1

  • Tricks : lingerd, nscd, multicast…

    • tuning : 20 000 sockets per IRC server


Supervision etc l.jpg
Supervision etc.

  • 60+ parameters per server on each of 1000 servers

    • Net-SNMP, big brother/big sister, Perl…

    • ad hoc monitoring (~ 30%) : 1 h to develop and deploy

    • 20 servers 1-CPU Intel, type 1

  • At-the-fly graph generation : 50 000 parameters

    • 300+ samples/s – 3 servers Bi-CPU Intel, type 1

    • RRD-Tool + SNMP managers (Perl) + Apache

    • Env – Network – from HW&CPU to application internals

  • Backups, storage

    • Amanda’s self-planning, no routine restore  1/10th FTE

    • 2 servers P2-400MHz 2,4To (2€/GO/y)– cheaper than tape : (6€/GO/y)

      • backs up daily 300 servers


We talk about mass market net computer engineering13 l.jpg
We talk about mass market net & computer engineering

  • the #1 in France

  • a simple web page

  • the Big Picture :

    pure* Open source – Linux – PC hardware

  • some thoughts about the past and the future

  • works well, costs little  delivers more from less

  • * well, mostly


Lessons from the past l.jpg
Lessons from the past

  • Choose what fits best

    • 1996 : from FreeBSD to Solaris

    • 2003 : from Solaris to Linux

    • voila.fr launched 1998 : Apache/PHP/Linux

  • Service first

    • system administrators are front-line

      • diagnose and fix fast (or alternate service/systems)

      • end-user satisfaction = systems health

    • accounting transparency (CAPEX & OPEX per server/day)


Lessons from the past15 l.jpg
Lessons from the past

  • PC hardware is a key factor

    • Failures are facts not problems

    • « Small is beautiful » : cost of redundancy is 1/N

    • Competitive mass market sourcing

      • PC operating system is hw-independent

      • Reuse – assemble components

  • Open source OS is a key factor

    • Best use of hardware resources

    • Flexible kernels – Excellent network support

  • Application level

    • Open source platforms – Internet applications

    • Interoperability is in DNA


Lessons from the past16 l.jpg
Lessons from the past

  • Design – skills

    • Not a sequential « product » process

    • Expert staff allow short & long term savings

  • Efficiency

    • Low-range (90%) = 3 to 5 k€/year

      • High-end PC servers cost 5+ times more

      • Mid-range Unix servers cost 20+ times more

    • Numerous distributed small systems

      • scale at front office

      • keep back office + infra stable

      • distribute bandwidth, IO buffers, axis…


We talk about mass market net computer engineering17 l.jpg
We talk about mass market net & computer engineering

  • the #1 in France

  • a simple web page

  • the Big Picture :

    pure* Open source – Linux – PC hardware

  • some thoughts about the past and the future

  • works well, costs little  delivers more from less

  • * well, mostly


Costs little l.jpg
Costs little

  • Some figures

    • 1 system administrator for 40 servers

    • 1 low-range server = 3 to 5 k€/year OPEX

    • Per unique visitor : 1 €/year

    • Per 1k web pages : 0.5 €

      • margin cost = 2 c€ OPEX, 3 c€ CAPEX

  • Cost breakdown – OPEX

    • 15% shell & core

    • 15% hardware

    • 15% bandwidth

    • 5% maintenance/support

    • 50% man power


Costs little works well l.jpg
Costs little = « works » well

  • Home page Solaris

    • 12 Sun 4-CPU : 6 dynamic + 6 static content

    • 400 k€/year OPEX (amort. included)

    • Additional CAPEX : 10 € per 1k daily pages

  • Home page Linux

    • 18 PC 1&2-CPU : 12 dynamic + 6 static content

    • 100 k€/year OPEX (amort. included)

    • Additional CAPEX : 3 € per 1k daily pages works well


Works well l.jpg
Works well

  • Voila.fr ranked* #1, Wanadoo.fr ranked* #3

    • (*) Google’s home page always #1… but that’s not a real portal home page

  • Both fastest real portal home pages

  • Unavailability mostly due to a sub-service :

    • freeware expensive : fees + maintce + poor support

    • Open source proprietary code : can’t fix

      This sub-service delivers less from more

      ;–)


Next steps l.jpg
Next Steps

  • Convince top management that Open-Source solutions can replace proprietary one!

    • Most agree on web front office (1-tier)

    • Most disagree on 2nd tier: Jonas,Jboss / Websphere, 9iAS

    • All disagree on DB: MySQL, PostgreSQL / Oracle, DB2, SQLs

      • Carefully select an appropriated database for your needs

      • Deploy Oracle on PC + Linux+ attached disks (Instead of Sun + Solaris + SAN or NAS :)


Thank you l.jpg
Thank you

Some Open Source Oriented Question ?

Pierre.Aubert@francetelecom.com

Eric.Oliveri@francetelecom.com


ad