Cassandra Technology
From the Apache Cassandra website:
The database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructure make it the perfect platform for mission-critical data. Cassandra’s support for replicating across multiple datacenters is best-in-class, providing lower latency for your users and the peace of mind of knowing that you can survive regional outages.
From Wikipedia:
Apache Cassandra is a free and open-source, distributed, wide-column store, NoSQL database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure. Cassandra offers support for clusters spanning multiple datacenters,[2] with asynchronous masterless replication allowing low latency operations for all clients. Cassandra was designed to implement a combination of Amazon’s Dynamo distributed storage and replication techniques combined with Google’s Bigtable data and storage engine model.[3]
Cassandra Support in Hop
Hop supports Cassandra in the following metadata objects:
Metadata Types
- Cassandra Connection: Create a connection to your Cassandra database cluster.
Workflow Actions
- Cassandra Exec CQL: Execute Cassandra CQL
Pipeline Transforms
Cassandra Input: Reads from a Cassandra cluster through a CQL query.
Cassandra Output: Write data to a table in a Cassandra cluster.
SSTable Output: Write data to a filesystem directory as a Cassandra SSTable.