July 2nd, 2014

Spark and Cassandra

Apache Cassandra is the leading distributed database in use at thousands of sites with the world’s most demanding scalability and availability requirements. This talk gives a brief overview of Cassandra, the current state of the DataStax/Databricks partnership, and an update on the integration work the two companies have been working on to provide the best experience for using Spark on Cassandra.

About Martin Van Ryswyk

Martin is responsible for the worldwide software engineering, product development and continued advancement of our integrated enterprise big data platform. He has more than 22 years of experience managing software teams at both small startups and large corporations. During that time, he’s brought products to market in a wide variety of areas such as cloud computing, application lifecycle management, database performance analysis, storage management and systems management. Before joining DataStax, he held numerous senior engineering roles, leading the development and go-to-market strategy for enterprise level technology products at Tidal Software, Luminate, EMC and most recently at Electric Cloud. Martin earned a bachelor of science degree in computer science from the University of California, Davis.

Learn more about Apache Cassandra + Spark Integration: http://planetcassandra.org/getting-started-with-apache-spark-and-cassandra/

Download DataStax’s Spark Driver for Apache Cassandra: https://github.com/datastax/cassandra-driver-spark