Operational amp Analytical Database Ricardo JimenezPeris LeanXcale CEO amp Founder LeanXcale New database vendor Result of leading edge research in Scalable transactional management Scalable data management ID: 806194
Download The PPT/PDF document "Ultra-Scalable Full SQL Full ACID" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
Ultra-Scalable Full SQL Full ACIDOperational & Analytical DatabaseRicardo Jimenez-PerisLeanXcale CEO & Founder
Slide2LeanXcaleNew database vendorResult of leading edge research in:Scalable transactional managementScalable data managementStorage management
Elasticity
High availability
Currently
working with several big companies in the following verticals
:
Banking
Telecommunications
Retail
Travel technology
Slide3Ultra-Scalable TransactionsSolved how to scale transactions to large scale (i.e. 100 million update transactions per second) in a fully seamless way
Breakthrough result of
15+ years of research
by a tenacious team
Slide4Problem: Lack of Scalable SQL DatabasesMainframe expensive licensing/HWAlternatives:
Mainframe
e
xpensive licensing/HW
Sharding
expensive development---------------------------------------------------------Solution: Ultra-Scalable SQL New generation database:
Ultra-Scalable
to
100s
of
nodes
Full SQL
simplicity
Full ACID
transactional consistency
No
Sharding
fully transparent to the applications
Can replace mainframes
Slide5Evaluation without data manager/logging to see how much throughput can attain the transactional processingScalability
2.35 Million
transactions
per second
Slide6Operational DBData WarehouseCosts of ETLs represent 75% of business analytics
Analytical queries on obsolete data
Copy Process (ETL)
Slide7Cutting costs of business analytics by 75% Real-time Analytical QueriesNo more ETLsBlending OLTP & OLAPMaking Decisions at the Right TimeAnalytical Queries
on Operational Data
Operational Database
Data Warehouse
OLTP + OLAP
OLTP
OLAP
Slide8Problem: Polyglot WorldLack of queries and transactions across data storesLack of consistency guarantees within NoSQL data stores---------------------------------------------------------Solution: Transactional NoSQL & Global Transactions
Queries across data stores
SQL, Neo4J, MongoDB, HBase
Full ACID HBase
Full ACID Neo4J (prototype with MVCC)
Full ACID MongoDB (prototype)
Slide9Problem: Cost of HadoopProgrammatic queries (MR) or subsets of SQL (Hive, Impala)Queries do not observe operational dataETLs required every time---------------------------------------------------------
Solution: Operational Data Lake
Supporting queries across Hadoop data lake and customer operational data
Slide10LeanXcale’s KiVi Storage EngineKiVi is a new storage engine from LeanXcale that is:Multi-Workload.Vectorial.
Ultra-efficient
.
Columnar
.
Fully elastic
Dual SQL and KV interface over relational data.Online aggregation.Inexpensive replication.Efficient distributed indexing.
Efficient multi-versioning
.
Slide11KiVi Key-Value Data StoreOLTP & OLAPQuery Engine
Storage
Transaction
Mng
SQL Engine
Ultra-Scalable
Transactions
Architecture
Slide12Real-Time Big Data
Full SQL Full ACID DB
OLAP over
Operational Data
Ultra-Scalable OLTP
Non-disruptive data migration, continuous load balancing and
Elastic & Ultra-Efficient
Queries across SQL, HBase, MongoDB, Neo4J & Hadoop files
Integration with Data Streaming
Polyglot
What is LeanXcale?
An
Ultra-Scalable
SQL Database for
Any Size
and
Any Workload
Slide13What is the Magic?
Slide14Transactional ProcessingThe transactional management provides ultra-scalabilityFully transparent:No sharding.No required
a priori knowledge about rows to be accessed.
Syntactically: no changes required in the application.
Semantically: equivalent behavior to a centralized system.
Provides
Snapshot Isolation
(the isolation level provided by Oracle when set to “Serializable” isolation).
+
+
Slide15Ultra-Scalable TransactionsTime
LeanXcale
Process &
commits
transactions
in parallel
Provides a consistent viewTraditional systems have a single-node bottleneck
Time
Traditional
transactional DB
vs
Slide16Snapshot Isolation vs. Serializability
Start
End
Reads
Writes
Reads & Writes
Snapshot isolation splits atomicity in two points one at the beginning of the transaction where all reads happen and one at the end of the transaction where all writes happen
Serializability provides a fully atomic view of a transaction, reads and writes happen atomically at a single point in time
Slide17Traditional ApproachCentralized Transaction ManagerSingle-node bottleneck
Central TM
Atomicity
Isolation
Durability
Consistency
Slide18Traditional ApproachCentralized Transaction ManagerSingle-node bottleneck
Central TM
Atomicity
Isolation
Writes
Durability
Isolation
Reads
Slide19Scaling ACID Properties
Atomicity
Atomicity
Atomicity
Isolation
Reads
Durability
Isolation
Writes
Slide20Scaling ACID PropertiesConflict Managers
Loggers
Commit Sequencer
Snapshot Server
Local
TMs
Atomicity
Isolation
Reads
Isolation
Writes
Durability
Slide21Separation of commit from the visibility of committed dataProactive pre-assignment of commit timestamps to committing transactionsTransactions can commit in parallel due to:They do not conflictThey have their commit timestamp already assigned that will determine its serialization order
Visibility is regulated separately to guarantee the reading of fully consistent states
Detection and resolution of conflicts before commit
Main Principles
Slide22Transactional Life Cycle: StartSnapshot Server
Current consistent
snapshot
The local txn mng gets the “start TS” from the snapshot server.
Get start TS
Local Txn
Manager
Slide23Transactional Life Cycle: ExecutionLocal TxnManager
Get start TS
Run on start TS snapshot
Conflict
Manager
The transaction will read the state as of “start TS”.
Write-write conflicts are detected by conflict managers on the fly.
Slide24Transactional Life Cycle: Commit
Get start TS
Run on start TS snapshot
Commit
The local transaction manager orchestrates the commit.
Local Txn
Manager
Slide25Transactional Life Cycle: CommitLogger
Commit
Sequencer
Data Store
Snapshot
Server
Commit TS
writeset
writeset
Commit TS
Local Txn
Manager
Get Commit TS
Log
Public Updates
Report
Snaps Serv
Slide26Transactional Life Cycle: CommitTIMESTAMP 11
TIMESTAMP
15
TIMESTAMP 12
TIMESTAMP 14
TIMESTAMP 13
Time
Sequence of timestamps received by the Snapshot Server
Evolution of the current snapshot at the Snapshot Server
TIMESTAMP
11
TIMESTAMP 12
TIMESTAMP 12
TIMESTAMP 15
TIMESTAMP
11
11
15
12
14
13
11
11
12
12
15
Slide27ConclusionsTransactional management not a bottleneck anymore. We can scale to many million of transactions per second.Combining multiple capabilities in a single database system, such as OLTP and OLAP, is what we believe it is the future of database management.
We are working in this direction.
Slide28Ricardo Jimenez-
Peris
LeanXcale CEO & Co-Founder
rjimenez
@leanxcale.com
www.LeanXcale.com
@LeanXcale