amit punekar wrote:Hello,
Please note the exception message -
Exception in thread "main"
org.apache.hadoop.mapred.FileAlreadyExistsException: Output directory
hdfs://localhost:54310/user/ubuntu/wordcount/input/vij.txt already
exists
You can delete the file and then try running. I cannot recollect but there is an option to overwrite the files if they exists.
Regards,
Amit
chris webster wrote:If you are just starting out and you just want to explore Hadoop and related tools like Hive, Pig etc, then you might find it easier to use one of the pre-packaged virtual machines from Hortonworks or Cloudera.
For example, I've been using the Hortonworks Sandbox. This gives you an integrated single-node Hadoop installation with tools like Hive, Pig, HCatalog and Hue, plus links to lots of well structured tutorials. The sandbox runs as a virtual machine e.g. inside Virtualbox or VMWare Player, and you can access a lot of the functionality very easily via the browser-based Hue interface. It's a lot easier than installing all these components by hand, and it's a great resource for learning about Hadoop, even if you plan to use a different Hadoop distribution for your project.
amit punekar wrote:Check if this file has any clue why jobtracker would not have started "/home/kumar/hadoop-0.20.2-cdh3u4/logs/hadoop-kumar-jobtracker-kumar.hadoop.out " .
Regards,
Amit
chris webster wrote:If you are just starting out and you just want to explore Hadoop and related tools like Hive, Pig etc, then you might find it easier to use one of the pre-packaged virtual machines from Hortonworks or Cloudera.
For example, I've been using the Hortonworks Sandbox. This gives you an integrated single-node Hadoop installation with tools like Hive, Pig, HCatalog and Hue, plus links to lots of well structured tutorials. The sandbox runs as a virtual machine e.g. inside Virtualbox or VMWare Player, and you can access a lot of the functionality very easily via the browser-based Hue interface. It's a lot easier than installing all these components by hand, and it's a great resource for learning about Hadoop, even if you plan to use a different Hadoop distribution for your project.
chris webster wrote:If you are just starting out and you just want to explore Hadoop and related tools like Hive, Pig etc, then you might find it easier to use one of the pre-packaged virtual machines from Hortonworks or Cloudera.
For example, I've been using the Hortonworks Sandbox. This gives you an integrated single-node Hadoop installation with tools like Hive, Pig, HCatalog and Hue, plus links to lots of well structured tutorials. The sandbox runs as a virtual machine e.g. inside Virtualbox or VMWare Player, and you can access a lot of the functionality very easily via the browser-based Hue interface. It's a lot easier than installing all these components by hand, and it's a great resource for learning about Hadoop, even if you plan to use a different Hadoop distribution for your project.
amit punekar wrote:Check if this file has any clue why jobtracker would not have started "/home/kumar/hadoop-0.20.2-cdh3u4/logs/hadoop-kumar-jobtracker-kumar.hadoop.out " .
Regards,
Amit