Christopher Webster wrote:
If you're working with Hive tables, then presumably the data will be read from/written to your Hive directories.
Your Hive processing will almost certainly involve several shuffles, and when you write the data to your target table, it will need to be moved again, so you have to assume there will be a lot of data moving around the cluster at certain stages of your process.
Paul Clapham wrote:You seem to have the idea that the input has to be changed in some way to qualify as "transforming".
Paul Clapham wrote:
Doing nothing should be an option.
Monica Shiralkar wrote:Yes, but then why not call it something else than ETL (Extract Transform Load).
Paul Clapham wrote:
Because usually it does a non-trivial transform.
might sometimes not do anything anyway.
Paul Clapham wrote:Because usually it does a non-trivial transform. Occasionally it may do a null transform. There's no need to think of a special word for that when "Transform" describes it perfectly well. People in the computer world are quite comfortable with the idea of operations which might sometimes not do anything anyway.
Monica Shiralkar wrote:I am still trying to understand it. I mean Aeroplanes do flying and we do not call a bus as aeroplane saying it does "NULL Flying".
Ron McLeod wrote:
Paul Clapham wrote:
Java supports the concept of an Identity Function where the value returned is the same as the value passed in to the function. Does that mean that it not a real function?
Thanks. Now, I am first trying to understand that why is Identify Function used. I mean why would we use a function which returns exactly what it receives.
Ron McLeod wrote:
Java supports the concept of an Identity Function where the value returned is the same as the value passed in to the function. Does that mean that it not a real function?
Ron McLeod wrote:
Java supports the concept of an Identity Function where the value returned is the same as the value passed in to the function. Does that mean that it not a real function?
Monica Shiralkar wrote:Thanks. Now I am first trying to (still) understand that why is Identify function used. I mean why would one use a function which would return exactly the same output as the input it receives.
Paul Clapham wrote:
Monica Shiralkar wrote:Thanks. Now I am first trying to (still) understand that why is Identify function used. I mean why would one use a function which would return exactly the same output as the input it receives.
If people can only use things which you personally understand, then the work of programmers is going to come to a standstill .
consider that people can write code which you don't understand, and accept it as valid. Then you will be able to move on.
moose poop looks like football shaped elk poop. About the size of this tiny ad:
Gift giving made easy with the permaculture playing cards
https://coderanch.com/t/777758/Gift-giving-easy-permaculture-playing
|