Apache Cassandra – Development and Administration



The “Apache Cassandra – Development and Administration” class focuses on imparting the core concepts, architecture and design of Apache Cassandra and develops its students to become successful Cassandra professionals.

The class starts with explaining how and where Cassandra fits in a business ecosystem, and how does it impact the Total Cost of Operations (TCO) of a Big Data analytical solution.

Class Highlights:

  • The Cassandra Architecture is discussed in detail to prepare students to fathom & follow the rest of the class.
  • Data Structures are explained along with Data Partitioning and the various choices available.
  • Scheme design is taught with focus on De-normalization techniques & Eventual Consistency.
  • Indexing mechanisms are also explained.
  • The Read and Write paths are explored & Deletes covered.
  • The various Consistency levels are explained.
  • Cassandra’s core strengths in Availability are Failure Handling are described in great details.
  • Compaction & Ring Management concepts are also explained.

From an Administration perspective, the class:

  • Details the process of installing a Cassandra cluster,
  • How to manage the cluster’s data,
  • Explains the various tools, utilities & client APIs available in the industry today, along with what to use when and why.

The Live demos showcase:

  • How to setup a Cassandra cluster
  • Run a 4 node Cassandra cluster,
  • How to use Opscenter,
  • Run Nodetool commands,
  • Use the CLI and CQL shell.

The hands-on labs include:

  • Setting up a cluster, creating the schema & manipulating data in it.
  • Dynamically extending the cluster, running  a constant load on the cluster and observing  the impact of nodes going down.
  • Running “Repair” and monitoring the cluster using Opscenter.

At the end of this two day class, students will be ready to start engaging themselves in a productive and meaningful way with Cassandra for their professional journeys beyond.

About Apache Cassandra:

Apache Cassandra is a highly scalable, high performance and fault tolerant distributed data infrastructure. Cassandra solves both real time and analytical big data problems, from write intensive workloads to sub millisecond caching layer reads to analytical workloads involving petabytes of data using MapReduce. Offering distribution of data across multiple data centers and incremental scalability with no single points of failure, Cassandra is the logical choice when you need reliability without compromising performance.

 

Prerequisites


Developers with basic understanding of database and ACID transactions

Audience


Developers,  Database administrators, Data Analytics professionals, Data architects, Managers

Recommended Readings


  1. Cassandra wiki
  2. Cassandra documentation

Course Dates


10/19/2013 from 9:00 AM to 10/20/2013 5:00 PM
11/16/2013 from 9:00 AM to 11/17/2013 5:00 PM
12/07/2013 from 9:00 AM to 12/08/2013 5:00 PM

Course Duration


Two days Class:
9:00 am till 5:00 pm,
Saturday & Sunday

Location


3200 Coronado Drive,
Santa Clara, CA 95054

The class will also be accessible online & live.

Registration:


 

Contact Information


Training Department
training@thirdeyecss.com
(408) 290-9949 – Ext 3