Cassandra for Administrators and Developers Certificate for Chris Dekker
Certificate ID:
588283
Authentication Code:
96355
Certified Person Name:
Chris Dekker
Trainer Name:
Richard Naoufal
Duration Days:
3
Duration Hours:
21
Course Name:
Cassandra for Administrators and Developers
Course Date:
13 June 2019 09:30 to 11 July 2019 16:30
Venue:
Amsterdam
Course Outline:
- Data model of Google’s Bigtable - Architecture of Amazon’s dynamo
Cassandra was inspired by two major systems namely Bigtable and Dynamo.
The first day of the training starts by exploring those two solutions and understanding what makes their strength and their popularity. Followed by which the Cassandra architecture and internals (protocols…) will be addressed along with the different cluster topologies and the consequences they have on how operations (read and right operations) take place.
- Solutions that have influenced Cassandra
- Data Model of Bigtable
- The idea behind Google’s Bigtable
- The data model
- Rows
- column families
- Timestamps
- Building blocks
- Tablets
- Compaction
- Commit logs
- Bloom filters
- Performance
- Applications
- Architecture of Amazon’s Dynamo
- The idea behind Amazon’s Dynamo
- Design considerations
- Peer to peer systems
- Distributed systems and databases
- Architecture
- Partitioning
- Replication
- Versioning
- Hinted Handoff
- Membership and Failure detection
- Performance
- Load distribution
- Divergent versions
- SEDA
- Background
- Thread based concurrency
- Bounded thread pools
- Event driven concurrency
- Structured event queues
- The Staged Event-Driven Architecture
- Goals
- Stages
- Resource controllers
- Asynchronous IO
- Cassandra
- The need for a solution like Cassandra
- RDBMS Foundations
- RDBMS for Scale
- Consistency, consistency levels & the CAP Theorem
- First Words on Cassandra
- Use Cases for Cassandra
- The Cassandra data model
- The Relational Data Model
- The Relational Data Interaction
- Wanted Evolutions
- Logical Structures of the data model
- Map & Sorted Map
- Considerations (time series…)
- A Global view of Cassandra
- The Cluster
- KeySpaces
- Column Families
- Columns
- Elements of architecture 1
- A multidimensional hybrid row columnar structure
- Partitioners and data distribution (Consistent Hashing)
- The Ring representation
- Vnodes
- Read and right operations: Quorum and consistency levels
- Bloom filters
- Caches
- …
- Elements of architecture 2
- System keyspace
- Gossip & Failure Detection
- Anti-Entropy & Read Repair
- Hinted Handoff
- SEDA
- Memtables
- SStables
- Commit logs
- Compaction
- Installation & Administrative operation of a Cassandra Cluster
- The different distributions of Cassandra
- Apache Cassandra
- Datastax Cassandra
- LABS
- A simple Cassandra installation
- The CQL shell (help, Configuration, Keyspaces and tables, moving data with CQLSH, roles, permissions, users, consistency levels)
- Deployment of Cassandra cluster on a single data centre
- Operations on Memtables
- Backup and Recovery
- Deployment of Cassandra cluster on 2 data centres
- Adding and removing a node to a cluster
- Use of different snitches
- Tuning Cassandra
- Tracing to analyse performance
- Bloom filters performance
- Caching (configuring and monitoring)
- Data Modelling with Cassandra
- LABS
- Clustering columns
- Counters and TTL
- The design of row keys and column names
- Compound and composite keys
- Skinny rows
- Wide rows
- Secondary Indexes
- Operations on tuples, maps, sets UDT
- Time series