Degradation in processing queues identified

Incident Report for signageOS

Postmortem

Date

September 10, 2025

Authors

signageOS Engineering Team

Summary

On September 10, the signageOS platform experienced a temporary disruption that affected the processing of some backend data. While the issue impacted the accuracy and timeliness of device reporting within the platform, it did not affect deployed devices or their content playback.

Impact

  • No impact on deployed devices or playback.
  • Some delays and inconsistencies in device data reporting.
  • Temporary reduced availability of certain backend services.
  • Engineering teams applied manual interventions to restore service stability.

Detection

The issue was identified through internal monitoring and confirmed by engineering investigation.

Contributing Factors

  • A sudden increase in backend workload exceeded system thresholds.
  • Some alerts did not surface quickly enough due to notification settings.
  • Noise from other alerts delayed immediate recognition of the core issue.

Mitigation & Resolution

The team stabilized the platform through service restarts, resource adjustments, and clearing of affected backlogs. Alert thresholds were updated, and system safeguards strengthened to prevent similar situations.

Next Steps

  • Enhance monitoring and alerting for earlier detection.
  • Optimize handling of background workloads to reduce resource strain.
  • Improve processes to ensure critical alerts receive immediate attention.
Posted Sep 10, 2025 - 22:54 CEST

Resolved

This incident has been resolved.
Posted Sep 10, 2025 - 04:55 CEST

Monitoring

A fix has been implemented and we are monitoring the results.
Posted Sep 10, 2025 - 04:48 CEST

Update

We are continuing to work on a fix for this issue.
Posted Sep 10, 2025 - 04:30 CEST

Identified

The issue has been identified and a fix is being implemented.
Posted Sep 10, 2025 - 04:18 CEST

Investigating

We are currently investigating this issue.
Posted Sep 10, 2025 - 04:15 CEST
This incident affected: API, Box, and Platform.