Apache Cassandra

4/25/2022

Reading time:4

Apache Cassandra

This resource is based on an article originally published here.

Cassandra DataSource for Grafana

Apache Cassandra Datasource for Grafana. This datasource is to visualise time-series data stored in Cassandra/DSE, if you are looking for Cassandra metrics, you may need datastax/metric-collector-for-apache-cassandra instead.

To see the datasource in action, please follow the Quick Demo steps. Documentation is available here

Supports:

Grafana 5.x, 6.x, 7.x (4.x not tested, 8.x WiP not supported yet)
Cassandra 3.x, 4.x (2.x not tested)
DataStax Enterprise 6.x
DataStax Astra (docs)
AWS Keyspaces (limited support) (docs)
Linux, OSX (incl. M1), Windows

Contacts:

Usage

You can find more detailed instructions in the datasource wiki.

Installation

Download the plugin using latest release, please download cassandra-datasource-VERSION.zip and uncompress a file into the Grafana plugins directory (grafana/plugins).
Add the Cassandra DataSource as a datasource at the datasource configuration page.
Configure the datasource specifying contact point and port like "10.11.12.13:9042", username and password. It's recommended to use a dedicated user with read-only permissions only to the table you have to access.
Push the "Save and Test" button, if there is an error message, check the credentials and connection.

Datasource Configuration

Panel Setup

There are two ways to query data from Cassandra/DSE, Query Configurator and Query Editor. Configurator is easier to use but has limited capabilities, Editor is more powerful but requires understanding of CQL.

Query Configurator

Query Configurator is the easiest way to query data. At first, enter the keyspace and table name, then pick proper columns. If keyspace and table names are given correctly, the datasource will suggest the column names automatically.

Time Column - the column storing the timestamp value, it's used to answer "when" question.
Value Column - the column storing the value you'd like to show. It can be the value, temperature or whatever property you need.
ID Column - the column to uniquely identify the source of the data, e.g. sensor_id, shop_id or whatever allows you to identify the origin of data.

After that, you have to specify the ID Value, the particular ID of the data origin you want to show. You may need to enable "ALLOW FILTERING" although we recommend to avoid it.

Example Imagine you want to visualise reports of a temperature sensor installed in your smart home. Given the sensor reports its ID, time, location and temperature every minute, we create a table to store the data and put some values there:

CREATE TABLE IF NOT EXISTS temperature (
    sensor_id uuid,
    registered_at timestamp,
    temperature int,
    location text,
    PRIMARY KEY ((sensor_id), registered_at)
);
insert into temperature (sensor_id, registered_at, temperature, location) values (99051fe9-6a9c-46c2-b949-38ef78858dd0, 2020-04-01T11:21:59.001+0000, 18, "kitchen");
insert into temperature (sensor_id, registered_at, temperature, location) values (99051fe9-6a9c-46c2-b949-38ef78858dd0, 2020-04-01T11:22:59.001+0000, 19, "kitchen");
insert into temperature (sensor_id, registered_at, temperature, location) values (99051fe9-6a9c-46c2-b949-38ef78858dd0, 2020-04-01T11:23:59.001+0000, 20, "kitchen");

In this case, we have to fill the configurator fields the following way to get the results:

Keyspace - smarthome (keyspace name)
Table - temperature (table name)
Time Column - registered_at (occurence)
Value Column - temperature (value to show)
ID Column - sensor_id (ID of the data origin)
ID Value - 99051fe9-6a9c-46c2-b949-38ef78858dd0 ID of the sensor
ALLOW FILTERING - FALSE (not required, so we are happy to avoid)

In case of a few origins (multiple sensors) you will need to add more rows. If your case is as simple as that, query configurator will be a good choice, otherwise please proceed to the query editor.

Query Editor

Query Editor is more powerful way to query data. To enable query editor, press "toggle text edit mode" button.

102781863-a8bd4b80-4398-11eb-8c28-4d06a1f29279

Query Editor unlocks all possibilities of CQL including Used-Defined Functions, aggregations etc.

Example using test_data.cql:

SELECT id, CAST(value as double), created_at FROM test.test WHERE id IN (99051fe9-6a9c-46c2-b949-38ef78858dd1, 99051fe9-6a9c-46c2-b949-38ef78858dd0) AND created_at > $__timeFrom and created_at < $__timeTo

Follow the order of the SELECT expressions, it's important!

Identifier - the first property in the SELECT expression must be the ID, something that uniquely identifies the data (e.g. sensor_id)
Value - The second property must be the value what you are going to show
Timestamp - The third value must be timestamp of the value. All other properties will be ignored

To filter data by time, use $__timeFrom and $__timeTo placeholders as in the example. The datasource will replace them with time values from the panel. Notice It's important to add the placeholders otherwise query will try to fetch data for the whole period of time. Don't try to specify the timeframe on your own, just put the placeholders. It's grafana's job to specify time limits.

103153625-1fd85280-4792-11eb-9c00-085297802117

Development

Developer documentation

Related Articles

kubernetes

datastax

cassandra

DataStax Mission Control :: DataStax Project Mission Control

4/15/2023

cassandra.lunch

grafana.dashboard

cassandra

Apache Cassandra Lunch #62: Grafana Dashboard for Apache Cassandra - Business Platform Team

6/25/2022

cluster

troubleshooting

datastax

GitHub - arodrime/Montecristo: Datastax Cluster Health Check Tooling

4/3/2024

cassandra

langchain

llamaindex

GitHub - michelderu/chat-with-your-data-in-cassandra: Chat with your data stored in DataStax Enterprise, Astra DB and Apache Cassandra - In Natural Language!

3/26/2024

spring

angular

rest

GitHub - jhipster/jhipster-sample-app-cassandra: This is a sample application created with JHipster, with the Cassandra option

3/7/2024

api.management

node

api

LoopBack

3/7/2024

api.management

cassandra

api

GitHub - dreamfactorysoftware/df-cassandra: The DreamFactory Cassandra service

3/7/2024

api.management

snowflake

oracle

GitHub - dreamfactorysoftware/dreamfactory: DreamFactory API Management Platform

3/7/2024

mongo

rest

cassandra

GitHub - stargate/stargate: An open source data gateway

3/7/2024

para

cassandra

api

an open-source, general-purpose, backend framework for the cloud.

3/7/2024

Explore Further

cassandra

cluster

troubleshooting

datastax

GitHub - arodrime/Montecristo: Datastax Cluster Health Check Tooling

4/3/2024

cassandra

langchain

llamaindex

GitHub - michelderu/chat-with-your-data-in-cassandra: Chat with your data stored in DataStax Enterprise, Astra DB and Apache Cassandra - In Natural Language!

3/26/2024

spring

angular

rest

GitHub - jhipster/jhipster-sample-app-cassandra: This is a sample application created with JHipster, with the Cassandra option

3/7/2024

api.management

node

api

LoopBack

3/7/2024

grafana

kubernetes

datastax

cassandra

DataStax Mission Control :: DataStax Project Mission Control

4/15/2023

cassandra.lunch

grafana.dashboard

cassandra

Apache Cassandra Lunch #62: Grafana Dashboard for Apache Cassandra - Business Platform Team

6/25/2022

python

cassandra

spark

GitHub - andreia-negreira/Data_streaming_project: Data streaming project with robust end-to-end pipeline, combining tools such as Airflow, Kafka, Spark, Cassandra and containerized solution to easy deployment.

12/2/2023

python

cassandra

spark

GitHub - airscholar/e2e-data-engineering: An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.

12/2/2023

spark

python

cassandra

spark

GitHub - andreia-negreira/Data_streaming_project: Data streaming project with robust end-to-end pipeline, combining tools such as Airflow, Kafka, Spark, Cassandra and containerized solution to easy deployment.

12/2/2023

python

cassandra

spark

GitHub - airscholar/e2e-data-engineering: An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.

12/2/2023

flink

beam

dataflow

• Google Dataflow - Awesome-Astra

5/10/2023

data.modeling

cassandra

spark

Dealing with Large Spark Partitions

2/17/2023

Cassandra DataSource for Grafana

Usage

Installation

Panel Setup

Query Configurator

Query Editor

Development

Become part of our

growing community!

Planet Cassandra is a service for the Apache Cassandra® user community to share with each other. From tutorials and guides, to discussions and updates, we're here to help you get the most out of Cassandra. Connect with us and become part of our growing community today.

Get Involved with Planet Cassandra!

We believe that the power of the Planet Cassandra community lies in the contributions of its members. Do you have content, articles, videos, or use cases you want to share with the world?