I just finished moving the code from my last post from our dev and qa boxes to production. During the time I was running in qa I did not, but should have, done testing running the process as a cron style job. Running the process as a cron style job has taught me a lot about how Oozie handles time. I learned three key lessons:
- All the times entered in Oozie regardless of the time zone specified in the configuration file are in GMT
- ${coord:current(0)} was putting the prior day in because of when it was running with relationship to GMT. Once I figured out the GMT thing I had to change to ${coord:current(-1)}.
- Manipulating the initial-instance variable for the data sets allowed me to back fill prior days. In oder to give our DWH team more test data I set it a week prior to the first actual run I wanted to do.
Tonight I will have another test run, fingers crossed everything should work well.