Summary
On Monday, July 14th, between 21:50 UTC and 22:55 UTC, a subset of users experienced delays across several platform features. These included slower email and SMS delivery, delayed automation processing, and temporary API errors.
The incident was caused by a disruption in external network resolution services, which affected how our platform communicated with certain internet resources.
What Was Impacted
During the affected timeframe, users may have noticed:
Delays in email and SMS delivery
Slower automation workflow execution
Temporary issues while accessing API V3
Webhook processing delays
Occasional errors when using redirection links
All services were restored by 22:55 UTC, and performance returned to normal.
Root Cause
The issue was caused by a slowdown in an external service our platform relies on to communicate with systems on the internet. While we have backup mechanisms in place, the slowdown did not immediately trigger a switchover, which led to delays in how certain features responded. Once the external service stabilized, normal performance resumed.
Resolution
Our engineering team quickly identified the issue and closely monitored system performance until the external disruption was resolved. Once normal network resolution was restored, affected services resumed full functionality.
Next Steps
To help prevent similar issues in the future, we are taking the following actions:
Improving how our systems detect and respond to slowdowns in external services
Adjusting fallback behavior to ensure faster recovery during partial disruptions
Enhancing internal monitoring to identify and resolve these types of issues more quickly
We sincerely apologize for the inconvenience caused and appreciate your understanding. Ensuring platform stability and reliability remains our top priority.
Thank you for your continued trust.