Better Failure Detection in Serverless Apps
Too Long; Didn't Read
Error-handling libraries in the code are blind to Lambda specific failures, such as timeouts, wrongly configured packages, and out-of-memory failures. The only prerequisite for log-based error detection and visibility, in general, is that logs are pushed to CloudWatch (in most cases that is the default). From there on, we can do some smart pattern matching and deduction to detect failure scenarios. The ability to detect failures across all functions and connect them with specific invocations, view logs and pull X-ray traces for them significantly reduces the meantime to resolution in failure scenarios!