Improve reliability of async job execution. ## Acceptance Criteria - Production ready implementation - Includes tests - Documentation updated