Hadoop is a non-relational data storage and processing. Hadoop has shown rapid adoption in the enterprise, and is undoubtedly the leading technology in "big data" data processing. Also, Cassandra is another non-relational system that uses the BigTable data model, but employs Amazon's Dynamo scheme for data distribution and clustering.
If I have the option, what shall i use, Cassandra or Hadoop? or both of them?
Best Regards, Mohamed El-Refaey
Alex Holmes wrote:Given that Cassandra is a real-time system, and Hadoop is batch-based, so they end up complimenting each other.
Perfect then!, that means that if your app needs both requirements(real-time and batch processing), then Hadoop and Cassandra will make a perfect match ..
Thanks Alex for this! are you aware of a project that use both of them (open source or even commercial) ?