2

Considering we have a backend written in NodeJS and uses MySQL and Cassandra as it's databases, if we want to add Spark to the system to do some data analyzing stuff like recommendation, can we do it with Cassandra( I mean using Spark + Cassandra) and reach the same result as we could reach with the Hadoop( Spark + Hadoop)?

I want to know what Hadoop can do that Cassandra can not to? Or what would make it essential to use Hadoop alongside with the Spark?

1 Answers1

1

You can definitely use Spark directly to perform real-time analytics on your Cassandra database using the Spark Cassandra connector.

Most tech giants use this combination for use cases that include recommender engines, personalisation, show what's trending, and fraud detection all in real-time.

Note that we recommend running analytics on a different Cassandra data centre so online user transactions are not affected by the analytics queries. For example, route app traffic to DC1 while Spark apps read/write to DC2 to isolate analytics and OLTP workloads in their own DCs. Cheers!

Erick Ramirez
  • 4,590
  • 1
  • 8
  • 30