#5600 Taskd should have a way to handle large % of tasks failing

unreleased
wont-fix
nobody
migration (63)
General
nobody
2015-02-13
2013-01-11
No

taskd should have a way to determine if a large number of tasks have failed. I am thinking this would be most useful to count by task type, across all taskd instances. (Does each taskd instance query mongo for that occasionally? Or a separate script on cron?)

When a large % of errors have occurred, it'll depend on the type of task and the deployment situation to determine what should happen. So needs to be flexible. Some default behaviors that would be useful: email somebody, or stop processing more events of that type.

For upgrades, we'd want to stop the processing of upgrades if there too many failures.

Discussion

  • Dave Brondsema

    Dave Brondsema - 2013-02-25
    • status: open --> wont-fix
    • milestone: forge-backlog --> forge-mar-08
     
  • Dave Brondsema

    Dave Brondsema - 2013-02-25

    Don't think we really need this

     

Log in to post a comment.