Category Archives: Hadoop

More Oozie Learnings

I just finished moving the code from my last post from our dev and qa boxes to production. During the time I was running in qa I did not, but should have, done testing running the process as a cron … Continue reading

Posted in Data Warehouse, Hadoop | Leave a comment

Using Oozie to Process Daily Logs

At Edmunds we are working to move our existing data warehouse system to a new system based on Hadoop and Netezza. At first, our data warehouse team focused on delivering ad impression data from DoubleClick DART as the first production … Continue reading

Posted in Data Warehouse, Hadoop, Java | 5 Comments