Infinispan , transactional key- value DataGrid and NoSQL database

Infinispan, transactional key-valueDataGrid and NoSQL database 11. April 2013 Alexander Petrov

Alexander Petrov • Sr. Consultant at Inmeta Consulting • Current project: Skattetaten Grid POC • Previous projects involving grid technologies: • Mattilsynet food authority system. • FrameSolution BPM framework used in Lovisa National Court Authority(Norway), Mattilsynet Food Authority • Other noteworthy projects • Coca Cola Basis ERP system – Coca Cola Bottler factories • mPowerMobilitec 300 million subscribers worldwide, and delivers over 500,000 pieces of content every day.

Usage scenarios • Big data, Databases areslow. Memory is FAST! • Provides huge computing power. • Tax calculation  • Financial organizations • Government organizations use it for communication and data sharing between the different departments. • Scientific computations • MMORPG games

Agenda • General terminology relevant to Distributed Caching • Challenges related to introducing distributed caching to existing system • Metrics and tuning

Distributed Caching - Concepts • Cache JSR – 107 • Java Data Grid JSR - 347 • In memory Data Grid • Cluster • Distribution • Node – a member of a cluster • Transaction awareness • Colocation • Map / Reduce • Consistency

Real World Use Case

Typical J2EE backend

Data access • Transaction scope • Locking\deadlocking • Flushing policies • Mixing the technology stack. • Performance

Legacy Cache

Our end goal • Wow we did it!

Summary • Our Custom cache is super fast, but its cache hit ratio is rather low. • Our custom cache has a tendency of getting dirty as the updates to the shared data can not be propagated. At the same time the separation of the data regions is not full. • Marshaling is a rather slow and heavy process. • We are facing a technological cocktail and we need to keep integrity.

Replication • Write through • Write Behind • Replication Queue

Invalidation

Distribution

More terminology • Eviction • Least Recently Used • First In First Out • LIRS • Custom • Expiration • Invalidation

Caching topologies – Mirrored Cache • Ref. Data vs Transactional • Reference data: Good. Max 30000 reads/sec 1k size • Transactional data: Good. Max 25000 writes/sec 1k size .

Caching topologies – Replica Cache • Reference data: Good. 30000 reads/sec per server. Grow linearly by adding servers. • Transactional data: Not so good. Max 20000writes/second. Drops if you add 3rd server to 2500.

Caching topologies – Partitioned Cache • Ref. Data vs Transactional • Reference data: Good. Max 30000 reads/sec 1k size • Transactional data: Good. Max 25000 writes/sec 1k size

Caching topologies - Partitioned Replica • Reference data(1kb):Good. 30000 reads/sec per server. Grow linearly by adding servers. • Transactional data(1kb):Good. 20000 writes/sec per server. Grow linearly by adding servers.

How to define our topology • What is the size of our cluster? Reads vs. Writes • Communication inside our grid • UDP,TCP • Synchronous vs. Asynchronous. • What about the transaction isolation? • Repeatable Reads vs. Read Committed • What is the nature of our application? • Read intensive data • CMS systems • Write Intensive Data • Document Management System

Level 1 Cache / Near Cache • Level1 cache is Supported only for Distribution mode • Level 1 cache might have a performance Impact in certain systems

Cache stores and loaders • Passivation • Activation • Hibernate

Transactions, Isolation and Locking • Long running transactions need to be avoided. • What is a long running transaction? How long is actually long. • Read Committed vs Repeatable Reads

Classic Deadlocksituation C is locked by TX2 TX1 (Wants update A,B,C) TX2 (Wants to update C,B,A) A is locked by TX1

Repeatable Read What is returned?? TX1 TX2

Cache statistics

Remoting statistics

Locking statistics

Marshaling data • Java serialization • Java externalization • Impact on performance • Generic domain.

Real World Use Case

Data access • Transaction scope • Locking\deadlocking • Flushing policies • Mixing the technology stack. • Performance

Our end goal • Wow we did it!

The End • Thank you for your attention

Used sources http://www.alachisoft.com/ncache/caching-topology.html http://www.infoq.com/news/2011/10/java-data-grid https://github.com/datagrids/spec/wiki http://www.jboss.org/infinispan/documentation http://code.google.com/p/thrift-protobuf-compare/wiki/Benchmarking

Infinispan , transactional key- value DataGrid and NoSQL database

Infinispan , transactional key- value DataGrid and NoSQL database

Presentation Transcript

Database Marketing and Lifetime Value

NoSQL and .NET

NoSQL

NoSQL and Review

Dynamo, Amazon’s NoSQL Database

NoSQL and NOSQL

Solr 4 The NoSQL Database

On Transactional Memory, Spinlocks and Database Transactions

NoSQL and NewSQL

JasDB NoSQL Database

Oracle NoSQL Database and Big Data

DCS 3. Key-value Stores and NoSQL

NOSQL DATABASE Not Only SQL DATABASE

NoSQL and MongoDB

.NET Database Technologies: Using NoSQL databases

Building Oracle NoSQL Database Applications with EclipseLink

NoSQL

NoSQL : Database of Future

1Z0-449 Oracle NoSQL Database Exam Questions