Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Common complaints I've heard about Oozie is that it has a high learning curve, not a great UI, and people hate the fact that it is XML based. This is a pretty decent comparison of Oozie vs Luigi (and Azkaban):

http://www.slideshare.net/jcrobak/data-engineermeetup-201309



That presentation was pretty good with the good and bad takes. Do you think frameworks like Casacading or spark make things a lot easier as a higher abstraction on hadoop / different compute model?


I haven't tried Cascading, but I've started doing some stuff with Spark and really like it. I feel like it is usually an easier abstraction to work with and it is a lot easier to prototype and experiment with.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: