Win a copy of Reactive Streams in Java: Concurrency with RxJava, Reactor, and Akka Streams this week in the Reactive Progamming forum!
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
programming forums Java Mobile Certification Databases Caching Books Engineering Micro Controllers OS Languages Paradigms IDEs Build Tools Frameworks Application Servers Open Source This Site Careers Other all forums
this forum made possible by our volunteer staff, including ...
Marshals:
  • Campbell Ritchie
  • Liutauras Vilda
  • Junilu Lacar
  • Jeanne Boyarsky
  • Bear Bibeault
Sheriffs:
  • Knute Snortum
  • Tim Cooke
  • Devaka Cooray
Saloon Keepers:
  • Ron McLeod
  • Stephan van Hulst
  • Tim Moores
  • Tim Holloway
  • Carey Brown
Bartenders:
  • Piet Souris
  • Frits Walraven
  • Ganesh Patekar

Question on "Hadoop Real World Solutions Cookbook"

 
Ranch Hand
Posts: 419
Mac jQuery Objective C
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hi,

I was working in a project for business intelligence domain. We were using data warehouse as back-end. PL/SQL was used for implementation of ETL. Some new architect suggested that we should either move to a tool like informatica or use Hadoop. The only problem was we didn't know if hadoop will work for as an alternative of Data Warehouse. So was curious when we say Hadoop Real world solutions. Does it cover solutions like this? Where we have a legacy data warehouse implementation and we are facing performance issues.

Regards,
Pawan.
 
author
Posts: 2
  • Likes 1
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hi Pawan. Thanks for considering the book.

With regards to your problem, Hadoop Real world solutions is designed to get people up and rolling with Hadoop once they've decided it's a good fit for their system. There is no one particular recipe that covers a full transition from a legacy warehouse to using Hadoop, but many of the recipes independently help with a critical piece. For example, we cover using Sqoop to migrate data from a RDBMS (like Oracle) into Hadoop HDFS. That would be a good place to start.

Hive/Pig are excellent data warehousing/processing tools designed to run over Hadoop, and we cover how to use them in very solid detail. You can follow the sample analytics to get an idea of how you would port your PL/SQL implementations to a language like Pig Latin or HiveQL.

So to answer your question, if you've decided Hadoop is the right platform moving forward, many of the Real-World solutions recipes can be combined to help with the overall data migration and ETL transition.

Hope this helps!
 
pawan chopra
Ranch Hand
Posts: 419
Mac jQuery Objective C
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Thanks Brian,

Looking forward to read this book.

 
She still doesn't approve of my superhero lifestyle. Or this shameless plug:
Java file APIs (DOC, XLS, PDF, and many more)
https://products.aspose.com/total/java
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
Boost this thread!