File APIs for Java Developers
Manipulate DOC, XLS, PPT, PDF and many others from your application.
The moose likes Hadoop and the fly likes Hadoop(Beginner Level Question) Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Databases » Hadoop
Bookmark "Hadoop(Beginner Level Question)" Watch "Hadoop(Beginner Level Question)" New topic

Hadoop(Beginner Level Question)

Supraja Jayakumar

Joined: Jul 12, 2011
Posts: 7

I wrote this piece of hadoop to preprocess files and write the files again to the output directory. I see files by name part-000, 0001 and so on being created but they all are empty. I use NullWritable for key. But set Text for value. I am not sure if its because of that.

The following is my code:

Alan Gates

Joined: Nov 29, 2011
Posts: 7
TextInputFormat already splits your input by line and only hands your map function one line at a time. So you don't need the while loop. Also, String.contains() takes a CharSequence, not a regular expression. Unless you are looking for the literal character sequence "[A-Za-z]" you want to use String.matches().
I agree. Here's the link:
subject: Hadoop(Beginner Level Question)
It's not a secret anymore!