Whether running load tests or migrating historic data, loading data directly into Cassandra can be very useful to bypass the system’s write path.
In this webinar, we will look at how data is stored on disk in sstables, how to generate these structures directly, and how to load this data rapidly into your cluster using sstableloader. We’ll also review different use cases for when you should and shouldn’t use this method.
Speaker: Patricia Gorla, Apache Cassandra Architect at The Last Pickle
Patricia Gorla is an Apache Cassandra Architect at The Last Pickle, a Cassandra consultancy. She has been involved in all aspects of software development, from server administration to application development, and from data analysis to data storage. She has worked with companies and governmental entities on all aspects of data migration to non-relational data stores, and training the technical teams on the new architecture. She helped the US Patent & Trademark Office ingest more than 6 million patent documents and images; architect secure search systems for a large mortgage insurer; and introduce Cassandra to a digital marketing firm’s data pipeline. Prior to architecting databases Patricia focused on the analysis and visualization of data. Patricia speaks often at conferences and meetups such as O’Reilly’s StrataConf + Hadoop World, the Datastax Cassandra Summits, and local user groups.