There are many distributed platforms or frameworks coming out when the cloud became hot. Do you have a suggestion on which one to choose? Have you ever tried Spark? It's said it has a better performance than Hadoop. And it supports Scala, which will make the program concise and easier with concurrency.