This topic provides answers to frequently asked questions about Deep Learning Containers (DLC) of Platform for AI (PAI).
What do I do if the "exited with code 137" error message appears?
If the "exited with code 137" error message appears, you can use instances that have a larger memory size, increase the number of worker nodes, or modify the reserved memory size in your code.
In Linux, the error code 137 indicates that the process is forcibly aborted by the SIGKILL signal. The most common reason is high memory usage, also known as the Out of memory (OOM) error. You can identify the cause of insufficient memory based on the memory usage of the worker nodes in the task details and increase the available memory.