• Post Reply Bookmark Topic Watch Topic
  • New Topic
programming forums Java Mobile Certification Databases Caching Books Engineering Micro Controllers OS Languages Paradigms IDEs Build Tools Frameworks Application Servers Open Source This Site Careers Other Pie Elite all forums
this forum made possible by our volunteer staff, including ...
Marshals:
  • Campbell Ritchie
  • Jeanne Boyarsky
  • Ron McLeod
  • Paul Clapham
  • Liutauras Vilda
Sheriffs:
  • paul wheaton
  • Rob Spoor
  • Devaka Cooray
Saloon Keepers:
  • Stephan van Hulst
  • Tim Holloway
  • Carey Brown
  • Frits Walraven
  • Tim Moores
Bartenders:
  • Mikalai Zaikin

Pentaho compared to other (open source) ETL products

 
Greenhorn
Posts: 12
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
As the title says: What are the strengths of Pentaho Data Integration compared to other ETL platforms (Open Source or not), for example Apatar or Talend?
I have not yet started to look into this but probably will in a couple of months so any pointers will be appreciated.
--
Örjan
 
Saloon Keeper
Posts: 27752
196
Android Eclipse IDE Tomcat Server Redhat Java Linux
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
I've been working with Pentaho for around 2 years now. The actual ETL engine is both performant and capable. I've struggled occasionally with with the UI designer (Spoon), but this is, after all open-source. So I made some enhancements to the Excel Input component and got them committed. They've been part of the system for the last year or so.
 
Greenhorn
Posts: 28
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
I've used Kettle, then PDI :-), for many years as well.

---
The strengths compared to previous ETL solutions most companies have in place (i.e. DTS/SSIS is the most common I've seen if you have MS SQL installed anywhere) includes:

*Strong declarative approach to ETL design.

*Database agnostic approach - don't have to worry about a particular ETL solution working 'great' for one database, and poorly for others.

*JDBC driver access - This is gonna sound odd to the non-JDBC users, but ODBC and ADO.NET providers I keep running into sporadic/unusual issues in driver configuration/server configuration/something else unknown on many different database setups. JDBC has been consistent and reliable, which for ETL is very important. Yes, someone will inevitabely say you loose some performance, well, loosing 1%-5% performance for rock solid reliability is an easy sell for me.

*Built-in warehousing support (dimensions), included, free, in the open source version.

*Customize/create your transformation step using Java (SSIS you can do this with .NET as well). I've created an X12-style EDI parser in about two weeks to solve a particular business need (it was very specific and not contributed unfortunately).

---
Compared to other open source ETL solutions (talend, clover, and several others I've reviewed in the past).

*LGPL license. You can use it in your business without worry.

*Commercial support. You can use it in your business without worry (Talend has this as well).

*Full-featured (Talend has this as well).

*It's got Matt Casters! :-)



 
author
Posts: 19
  • Mark post as helpful
  • send pies
    Number of slices to send:
    Optional 'thank-you' note:
  • Quote
  • Report post to moderator
I'm not very experienced in the use of other ETL tools. What I can assure you is that PDI (aka Kettle) will meet all your expectations. I've been used it for more than 3 years in different kinds of project (both DW and non DW related) and the tool was always capable of doing what I needed to do,
reply
    Bookmark Topic Watch Topic
  • New Topic