aspose file tools
The moose likes Hadoop and the fly likes Hadoop and compression Big Moose Saloon
  Search | Java FAQ | Recent Topics
Register / Login


Win a copy of The Mikado Method this week in the Agile and other Processes forum!
JavaRanch » Java Forums » Databases » Hadoop
Reply Bookmark "Hadoop and compression" Watch "Hadoop and compression" New topic
Author

Hadoop and compression

Andrew Mcmurray
Ranch Hand

Joined: Sep 24, 2005
Posts: 188
Hi all

I am pretty new to the HDFS and was looking for some opinions on some conflicting answers I have recently gotten.

1. Is it a good idea to compress the stream to write the file out to hadoop. One person told me they had got 10x benefit from doing this. Another told me that it was bad to compress b\c the map reduces that ran on the file could not be distributed using compressed files.

2. I read that map reduces running on hadoop works best with file sizes between 500gb and tb size files. Someone told me that the it works better with smaller files.

Any thoughts?

Thanks,

AMD
 
I agree. Here's the link: http://aspose.com/file-tools
 
subject: Hadoop and compression
 
Similar Threads
Beyond Hello World
ant against build.xml
Hadoop MapReduce
how to read a unix compressed file?
Hadoop - One Map and many Reduces