Presentation on paper ten rules for scalable performance in simple operation datastores
Download
1 / 13

Presentation on paper “Ten Rules for Scalable Performance in Simple Operation Datastores ” - PowerPoint PPT Presentation


  • 150 Views
  • Uploaded on

Presentation on paper “Ten Rules for Scalable Performance in Simple Operation Datastores ”. By 鄭秀青 (0156824). Purpose of the paper. Ten rules to help anyone who Wants to choose a DBMS that handles Simple Operations

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about ' Presentation on paper “Ten Rules for Scalable Performance in Simple Operation Datastores ”' - edita


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Presentation on paper ten rules for scalable performance in simple operation datastores

Presentation on paper “Ten Rules for Scalable Performance in Simple Operation Datastores”

By 鄭秀青 (0156824)


Purpose of the paper
Purpose of the paper in Simple Operation

  • Ten rules to help anyone who

    • Wants to choose a DBMS that handles Simple Operations

    • Not considering GPTRS (General-purpose traditional row stores) systems, such as MySQL and MSSQL

  • Simple Operations (SO) is a term coined by the author:

    • Read or write a few items

    • Apply to OLTP model


Rule 1 look for shared nothing scalability
Rule #1 in Simple Operation Look for shared-nothing scalability

  • Each node shares neither main memory nor disk

  • A collection of self-contained nodes are connected to each other by networking

  • Greatly reduces the overheads of synchronisation and locking mechanism

  • Easily scalable until the network bandwidth is exhausted


Rule 2 high level languages are good and need not hurt performance
Rule #2 in Simple Operation High-level languages are good and need not hurt performance

  • Programmers can write less code that is easier to understand

  • No need for the programmers to understand complex storage optimisations

  • Less maintenance when things need to be changed in the database


Rule 3 plan to carefully leverage main memory databases
Rule #3 in Simple Operation Plan to carefully leverage main memory databases

  • Doubling the size of RAM does not mean the performance will become twice as fast

  • CPU overheads need to be considered

  • If the CPU overheads are dealt poorly, even entire database is placed in memory, the performance will only improve marginally


Rule 4 ha and automatic recovery are essential for so scalability
Rule #4 in Simple Operation HA and automatic recovery are essential for SO scalability

  • Few clients today are willing to accept down time in their SO application

  • Most people want redundant hardware and have a second copy of their data

  • Disaster recovery should be considered as an extension of HA

  • The DBMS you choose should have built-in high availability function


Rule 5 on line everything
Rule #5 On-line everything in Simple Operation

  • Users want their database to be “up” all the time

  • In addition to failure recovery, other reasons for taking a DB offline should be considered:

    • Schema changes

    • Index changes

    • Reprovisioning

    • Software upgrade

    • The actions above should be performed without interrupting the DB service


Rule 6 avoid multi node operations
Rule #6 in Simple Operation Avoid multi-node operations

  • Multi-node operation

    • Basically means operations over several servers

  • If the majority of operations involve several servers, the advantages of scalability may lost as the overheads of the cross server communication and synchronisation increase dramatically


Rule 7 don t try to build acid yourself
Rule #7 in Simple Operation Don’t try to build ACID yourself

  • Use a DBMS that provides ACID

  • Do not try to code ACID in application level as it:

    • Complicates the design

    • Is difficult to maintain


Rule 8 look for administrative simplicity
Rule #8 in Simple Operation Look for administrative simplicity

  • Choose a DBMS with easy-to-use administrative tools, including

    • Installation

    • Schema construction

    • Application design

    • Data distribution

    • Tuning

    • Monitoring


Rule 9 pay attention to node performance
Rule #9 in Simple Operation Pay attention to node performance

  • “Node performance is less important compare to linear scalability” is a misconception

  • Assume solution A provides node performance of a factor of 20 better than solution B

    • If solution A requires 50 hardware nodes

    • Solution B will need 1000 nodes


Rule 10 open source gives you more control over your future
Rule #10 in Simple Operation Open source gives you more control over your future

  • Avoid expensive software license and upgrade fees

  • Proprietary technical support is not always superior

  • Several vendors provide open source software consultancy and technical support


Conclusion
Conclusion in Simple Operation

  • Use DB built-in functions whenever possible

  • The performance of an individual node is as important as the overall performance

  • Use open source software to avoid large sum of bills


ad