Apache Cassandra Lunch #80: Using Cassandra for a Content Management System

7/1/2022

Reading time:4

Apache Cassandra Lunch #80: Using Cassandra for a Content Management System - Business Platform Team

This resource is based on an article originally published here.

In Cassandra Lunch #80, we discussed how DataStax Astra can be used to create and track a content management system. We will demo a small application that is a clone of Tik Tok using Astra’s Document API. The live recording of Cassandra Lunch, which includes a more in-depth discussion and a demo, is embedded below in case you were not able to attend live. If you would like to attend Apache Cassandra Lunch live, it is hosted every Wednesday at 12 PM EST. Register here now!

Any discussion involving Cassandra needs to start with the data model. NoSQL systems provide fast read and write operations because tables tend to be created specifically to query certain data. Unlike traditional SQL schemas, joins are not allowed. This requires thinking about the data that is going to be stored and retrieved before creating a table. Special consideration needs to be given for what your Primary, Partition, and Clustering keys. The primary/partition key determines what columns a table can be queried. The clustering key determines how that data can be sorted.

The following Chebotko diagram demonstrates a Cassandra data model for a video application. Primary keys are denoted with a “K” and clustering columns are indicated with a “C” and an arrow up or down.

Using Cassandra for a Content Management System Chetbotko Diagram — Image from: https://scotch.io/tutorials/five-steps-to-an-awesome-data-model-in-apache-cassandra

DataStax Astra

For the demonstration in Cassandra Lunch #80: Using Cassandra for a Content Management System, we used DataStax Astra. Astra is a Cassandra-as-a-Service in the cloud. Some great things to point out about Astra are that it is free to start, database and infrastructure administration are optional, and there are multiple options to choose from when connecting to Cassandra via API.

Document API: schemaless storage of JSON documents
Stargate REST API: perform CRUD operations on data using a cross-language interface.
Stargate GraphQL API: query and mutate tables in your keyspace.
Stargate gRPC API: create CQL queries in Rust, Go, Node.js, or Java.

Stargate Document API

In our demonstration, we used Astra’s Stargate Document API. As mentioned above, this allows us to query and modify data stored as unstructured JSON documents in a collection and does not require the aforementioned data modeling typical of Cassandra. Once a namespace is created we can start adding data by connecting to the API via a URL containing the database id, region, keyspace name, and an authorization token (a Cassandra token) in the authorization header.

https://<ASTRA_DB_ID>-<ASTRA_DB_REGION>-apps.astra.datastax.com/api/rest/v2/namespaces/<ASTRA_DB_KEYSPACE>/collections/<ASTRA_DB_TABLE>

Multiple collections can be stored in a namespace, but a collection can only be stored in a single namespace. Collections are specified once a document is inserted. Once a document is inserted, each value in the JSON object is stored as a cell in the table. Below is an image of how a table is created when a document is submitted to Stargate’s Document API.

Using Cassandra for a Content Management System: image of how a table is created when a JSON object is submitted to Stargate's Document API — Image from: https://stargate.io/2020/10/19/the-stargate-cassandra-documents-api.html

One thing to note is that writes are a batch and will contain inserts and deletes. This can cause document rows to show two different states for a JSON field. If this happens, the document API will resolve by accepting the data with the later write time. To learn more about the Stargate Document API and details surrounding how deletes are handled and API performance checkout Stargate’s blog post.

To follow along in leveraging this information in order to create a Tik Tok clone be sure to check out the video and repository linked below. If you missed Cassandra Lunch #78: Cass Operator, it is embedded below! Additionally, all of our live events can be rewatched on our YouTube channel, so be sure to subscribe and turn on your notifications!

Resources:

Cassandra.Link

Cassandra.Link is a knowledge base that we created for all things Apache Cassandra. Our goal with Cassandra.Link was to not only fill the gap of Planet Cassandra but to bring the Cassandra community together. Feel free to reach out if you wish to collaborate with us on this project in any capacity.

We are a technology company that specializes in building business platforms. If you have any questions about the tools discussed in this post or about any of our services, feel free to send us an email!

Related Articles

astra

cassandra

datastax.astra

Vector Databases Compared - Evaluating DataStax Astra DB Serverless (Vector) and Pinecone Vector Database

2/4/2024

datastax

cassandra

langchain

Super Charge AI Assistants with Superagent and DataStax | DataStax

11/30/2023

migration

datastax

astra

GitHub - datastax/dsbulk-migrator

11/29/2023

graph.visualization

streaming

datastax

Home | Quine, Open Source Streaming Graph for Event-Driven Applications

11/10/2022

mlops

datastax

cassandra

Lift your MLOps pipeline to the cloud with Feast and Astra DB | DataStax

11/9/2022

stargate

cassandra.lunch

cassandra

Apache Cassandra Lunch #87: Cassandra.api, Astra, and Stargate - Business Platform Team

7/8/2022

terraform

datastax

cassandra

Apache Cassandra Lunch #86: DataStax Astra Terraform Provider - Business Platform Team

7/7/2022

cqlsh

cassandra.lunch

cassandra

Apache Cassandra Lunch #77: Connect to DataStax Astra via Standalone CQLSH - Business Platform Team

7/2/2022

datastax

cassandra

spark

Apache Cassandra Lunch #72: Databricks and Cassandra - Business Platform Team

6/28/2022

cassandra.lunch

etl

cassandra

Apache Cassandra Lunch #53: Cassandra ETL with Airflow and Spark - Business Platform Team

6/17/2022

Explore Further

cassandra

acid

open.source

cassandra

GitHub - pmcfadin/awesome-accord: Repository of all kinds of things to help you get up and running with ACID transactions on Apache Cassandra®

1/16/2025

mongo

nocode

elasticsearch

GitHub - ibagroup-eu/Visual-Flow: Visual-Flow main repository

12/2/2024

mongo

nocode

elasticsearch

GitHub - ibagroup-eu/Visual-Flow: Visual-Flow main repository

12/2/2024

migration

proxy

cassandra

GitHub - datastax/cql-proxy: A client-side CQL proxy/sidecar.

11/1/2024

datastax.astra

astra

cassandra

datastax.astra

Vector Databases Compared - Evaluating DataStax Astra DB Serverless (Vector) and Pinecone Vector Database

2/4/2024

datastax

cassandra

langchain

Super Charge AI Assistants with Superagent and DataStax | DataStax

11/30/2023

migration

datastax

astra

GitHub - datastax/dsbulk-migrator

11/29/2023

graph.visualization

streaming

datastax

Home | Quine, Open Source Streaming Graph for Event-Driven Applications

11/10/2022

stargate.document.api

sstable

cassandra

spark

Spark and Cassandra’s SSTable loader

11/1/2024

analytics

cassandra

spark

GitHub - apache/cassandra-analytics: Apache cassandra

9/4/2024

cassandra

event.driven

spark

Build an Event-Driven Architecture with Apache Kafka, Apache Spark, and Apache Cassandra

8/3/2024

python

cassandra

spark

GitHub - andreia-negreira/Data_streaming_project: Data streaming project with robust end-to-end pipeline, combining tools such as Airflow, Kafka, Spark, Cassandra and containerized solution to easy deployment.

12/2/2023

DataStax Astra

Stargate Document API

Resources:

Cassandra.Link

Become part of our

growing community!

Planet Cassandra is a service for the Apache Cassandra® user community to share with each other. From tutorials and guides, to discussions and updates, we're here to help you get the most out of Cassandra. Connect with us and become part of our growing community today.

Get Involved with Planet Cassandra!

We believe that the power of the Planet Cassandra community lies in the contributions of its members. Do you have content, articles, videos, or use cases you want to share with the world?