Application Real-Time Monitoring Service (ARMS) Application Monitoring is an application performance management (APM) service. Install an ARMS agent to monitor your application without code changes. You can track application health, locate slow or failing interfaces, identify performance bottlenecks, and capture request parameters for faster root-cause analysis.
| You want to... | Use... |
|---|---|
| View application health and topology at a glance | Monitoring and observability |
| Trace calls to databases, external services, and message queues | Interface and dependency tracking |
| Follow a request across services and investigate errors | Tracing and error analysis |
| Profile CPU, memory, or threads at the code level | Advanced diagnostics |
| Set up alerts and customize agent behavior | Events, alerts, and configuration |
Monitoring and observability
Real-time visibility into application health, from high-level topology to host and JVM metrics.
| Feature | Description |
|---|---|
| Application overview | Displays key metrics, upstream and downstream dependent components, and the application topology in a single view. |
| Application details | Displays topology, number of requests, response time, number of slow calls, and HTTP status codes by application and instance. |
| Host monitoring | Tracks host-level CPU, memory, disk, load, network traffic, and network packets. |
| Container monitoring | Tracks pod-level CPU, physical memory, network traffic, and network packets. |
| JVM monitoring | Tracks Java virtual machine (JVM) metrics: instantaneous garbage collections (GCs), heap memory, non-heap memory, metaspace, direct buffer, and JVM threads. |
| Thread pool and connection pool monitoring | Tracks thread pool metrics: core threads, existing threads, maximum allowed threads, active threads, and task queue capacity. |
Interface and dependency tracking
Track how your application communicates with databases, external services, message queues, and scheduled tasks.
| Feature | Description |
|---|---|
| Interface monitoring | Monitors interface call details, including SQL calls, NoSQL calls, exceptions, errors, upstream and downstream services, and traces. |
| Database call monitoring | Tracks database call details: overview, SQL calls, exceptions, call sources, and traces. |
| NoSQL call monitoring | Tracks NoSQL call details: call overview, exceptions, and traces. |
| External call monitoring | Locates slow or faulty external calls from your application. |
| Message queue monitoring | Displays message publishing and topic subscription activity in ApsaraMQ for RocketMQ. |
| Scheduled task monitoring | Tracks scheduled task execution, including overview, SQL calls, NoSQL calls, exceptions, errors, downstream services, and traces. |
Tracing and error analysis
Trace requests across services and drill into exceptions and errors to accelerate root-cause analysis.
| Feature | Description |
|---|---|
| Trace query | Displays details of each interface call, including duration, status, and timestamp. |
| Trace query (by trace ID) | Looks up trace details by a specific trace ID, with support for multiple filter conditions. |
| Trace Explorer | Filters and aggregates stored full trace data in real time. Supports custom diagnostic workflows with flexible query conditions. |
| Exception analysis | Lists application exceptions with detail views for quick triage. |
| Error analysis | Lists application errors with detail views for quick triage. |
| Log analysis | Analyzes logs to pinpoint application exceptions. |
Advanced diagnostics
Profile CPU usage, analyze memory, and inspect threads at the code level with minimal performance overhead.
| Feature | Description |
|---|---|
| Continuous profiling | Diagnoses CPU utilization and memory usage with minimal overhead, categorized by method, class, and line number. Helps optimize programs, reduce latency, increase throughput, and lower costs. |
| Thread profiling | Displays thread-level CPU time consumption and thread counts by type to simulate the code execution process. Use this to locate CPU-intensive threads or slow methods. |
| Memory snapshot | Creates and analyzes memory snapshots to troubleshoot memory leaks and memory waste. |
| Arthas diagnostics | Uses bytecode enhancement to inspect application state without restarting Java processes. |
Events, alerts, and configuration
Set up alerts, review events from Alibaba Cloud services, and customize agent behavior.
| Feature | Description |
|---|---|
| Event center | Centralizes, stores, analyzes, and displays event data from Alibaba Cloud services. Automatically analyzes and displays events from supported services in a unified manner. |
| Alert rule | Creates custom alert rules for your application. When triggered, notifications are sent to specified contacts or a DingTalk group chat. |
| Custom configurations | Adjusts the trace sampling rate, agent switch, and slow SQL query threshold. |