As an independent developer with average sysadmin skills, one problem I've faced while exploring big data products is deploying them cost effectively at modest scale on infra services like AWS and Linode.
Often, the steps to do this the right way are not there, and discovering them becomes a matter of (costly) trial and error.
Does your book cover *deploying* Storm at scale on such services? What are your opinions on its ease of deployment?
We don't cover any specific service such as AWS etc, we do however go through how to deploy Storm.
Deploying Storm is relatively easy. We strongly advise that you use a tool like Puppet to allow for recreateable deploys.
From there, you can easily deploy new nodes to the Cluster and what not.
My #1 piece of advise if you are setting up a cluster is to pay close attention to Zookeeper. Its a vital part of a Storm cluster.
Make sure that you are running Zookeeper machines with fast disk as its IO intensive and a lack of stability with Zookeeper
will impact on your entire cluster.