IntelliJ Java IDE
The moose likes Distributed Java and the fly likes Hadoop MapReduce Big Moose Saloon
  Search | Java FAQ | Recent Topics
Register / Login
JavaRanch » Java Forums » Java » Distributed Java
Reply Bookmark "Hadoop MapReduce" Watch "Hadoop MapReduce" New topic
Author

Hadoop MapReduce

Ralph Hoch
Greenhorn

Joined: Jun 04, 2011
Posts: 4
Hi,

I'm new to Hadoop and I'm trying to figure out how it works. As for an exercise I should implement something similar to the WordCount-Example. The task is to read in several files, do the WordCount and write and output file for each input file.
Hadoop uses a combiner and shuffles the output of the map-part as an input for the reducer. Then writes one output file (I guess for each instance that is running). I was wondering if it is possible to write one output file for each input file (so keep the words of inputfile1 and write result to outputfile1 and so on). Is it possible to overwrite the Combiner-Class or is there another solution for this (I'm not sure if this should even be solved in a Hadoop-Task but this is the exercise).

Thanks...
 
 
subject: Hadoop MapReduce
 
Threads others viewed
Mapreduce using Java
Trying to list the directory contents in a <fileset>
Formatter question(File)
XSLT newbie problem
Help:
WebSphere development made easy
without the weight of IBM tools
http://www.myeclipseide.com

cast iron skillet 49er

more from paul wheaton's glorious empire of web junk: cast iron skillet diatomaceous earth rocket mass heater sepp holzer raised garden beds raising chickens lawn care CFL flea control missoula heat permaculture