Connect an ApsaraMQ for Kafka instance as an input to Logstash in a virtual private cloud (VPC) - ApsaraMQ for Kafka

An ApsaraMQ for Kafka instance can be connected as an input to Logstash. This topic describes how to use Logstash to consume messages from ApsaraMQ for Kafka in a virtual private cloud (VPC).

Prerequisites

Before you begin, make sure that the following requirements are met:

An ApsaraMQ for Kafka instance is purchased and deployed. For more information, see Purchase and deploy a VPC-connected instance.
Logstash is downloaded and installed. For more information, see Download Logstash.
Java Development Kit (JDK) 8 is downloaded and installed. For more information, see the Java 8 tab.

Step 1: Obtain an endpoint

Logstash establishes a connection to ApsaraMQ for Kafka by using an ApsaraMQ for Kafka endpoint.

Note

ApsaraMQ for Kafka supports the following VPC endpoints:

Default endpoint: The port number 9092 is used.
Simple Authentication and Security Layer (SASL) endpoint: The port number 9094 is used. To use the SASL endpoint, you must first enable the access control list (ACL) feature. For more information, see Enable the ACL feature.

Log on to the ApsaraMQ for Kafka console.
In the Resource Distribution section of the Overview page, select the region where the ApsaraMQ for Kafka instance that you want to manage resides.
On the Instances page, click the name of the instance that you want to connect as an input to Logstash.
In the Endpoint Information section of the Instance Details page, view the endpoints of the instance. In the Configuration Information section, obtain the values of the Username and Password parameters.
Note
For information about the differences among different types of endpoints, see Comparison among endpoints.

Step 2: Create a topic

Perform the following operations to create a topic for storing messages:

Log on to the ApsaraMQ for Kafka console.
In the Resource Distribution section of the Overview page, select the region where the ApsaraMQ for Kafka instance that you want to manage resides.
Important
You must create topics in the region where your Elastic Compute Service (ECS) instance is deployed. A topic cannot be used across regions. For example, if the producers and consumers of messages run on an ECS instance that is deployed in the China (Beijing) region, the topic must also be created in the China (Beijing) region.
On the Instances page, click the name of the instance that you want to manage.
In the left-side navigation pane, click Topics.
On the Topics page, click Create Topic.

In the Create Topic panel, specify the properties of the topic and click OK.

Parameter	Description	Example
Name	The topic name.	demo
Description	The topic description.	demo test
Partitions	The number of partitions in the topic.	12
Storage Engine Note You can specify the storage engine type only if you use a Professional Edition instance. If you use a Standard Edition instance, cloud storage is selected by default.	The type of the storage engine that is used to store messages in the topic. ApsaraMQ for Kafka supports the following types of storage engines: Cloud Storage: If you select this value, the system uses Alibaba Cloud disks for the topic and stores data in three replicas in distributed mode. This storage engine features low latency, high performance, long durability, and high reliability. If you set the Instance Edition parameter to Standard (High Write) when you created the instance, you can set this parameter only to Cloud Storage. Local Storage: If you select this value, the system uses the in-sync replicas (ISR) algorithm of open source Apache Kafka and stores data in three replicas in distributed mode.	Cloud Storage
Message Type	The message type of the topic. Valid values: Normal Message: By default, messages that have the same key are stored in the same partition in the order in which the messages are sent. If a broker in the cluster fails, the order of messages that are stored in the partitions may not be preserved. If you set the Storage Engine parameter to Cloud Storage, this parameter is automatically set to Normal Message. Partitionally Ordered Message: By default, messages that have the same key are stored in the same partition in the order in which the messages are sent. If a broker in the cluster fails, messages are still stored in the partitions in the order in which the messages are sent. Messages in some partitions cannot be sent until the partitions are restored. If you set the Storage Engine parameter to Local Storage, this parameter is automatically set to Partitionally Ordered Message.	Normal Message
Log Cleanup Policy	The log cleanup policy that is used by the topic. If you set the Storage Engine parameter to Local Storage, you must configure the Log Cleanup Policy parameter. You can set the Storage Engine parameter to Local Storage only if you use an ApsaraMQ for Kafka Professional Edition instance. ApsaraMQ for Kafka provides the following log cleanup policies: Delete: the default log cleanup policy. If sufficient storage space is available in the system, messages are retained based on the maximum retention period. After the storage usage exceeds 85%, the system deletes the earliest stored messages to ensure service availability. Compact: the log compaction policy that is used in Apache Kafka. Log compaction ensures that the latest values are retained for messages that have the same key. This policy is suitable for scenarios such as restoring a failed system or reloading the cache after a system restarts. For example, when you use Kafka Connect or Confluent Schema Registry, you must store the information about the system status and configurations in a log-compacted topic. Important You can use log-compacted topics only in specific cloud-native components, such as Kafka Connect and Confluent Schema Registry. For more information, see aliware-kafka-demos.	Compact
Tag	The tags that you want to attach to the topic.	demo

After a topic is created, you can view the topic on the Topics page.

Step 3: Send messages

Perform the following operations to send messages to the topic that you created:

Log on to the ApsaraMQ for Kafka console.
In the Resource Distribution section of the Overview page, select the region where the ApsaraMQ for Kafka instance that you want to manage resides.
On the Instances page, click the name of the instance that you want to manage.
In the left-side navigation pane, click Topics.
On the Topics page, click the name of the topic that you want to manage. In the upper-right corner of the Topic Details page, click Send Message.
In the Start to Send and Consume Message panel, configure the parameters to send a message for testing.
- If you set the Sending Method parameter to Console, perform the following steps:
  1. In the Message Key field, enter the message key. Example: demo.
  2. In the Message Content field, enter the message content. Example: {"key": "test"}.
  3. Configure the Send to Specified Partition parameter to specify whether to send the test message to a specific partition.
    - If you want to send the test message to a specific partition, click Yes and enter the partition ID in the Partition ID field. Example: 0. For information about how to query partition IDs, see View partition status.
    - If you do not want to send the test message to a specific partition, click No.
  4. Use ApsaraMQ for Kafka SDKs or run the Docker commands that are displayed in the Start to Send and Consume Message panel to subscribe to the test message.
- If you set the Sending Method parameter to Docker, perform the following steps to run a Docker container:
  1. Run the Docker commands that are displayed in the Run the Docker container to produce a sample message section to send the test message.
  2. Run the Docker commands that are displayed in the How do I consume a message after the message is sent? section to subscribe to the test message.
- If you set the Sending Method parameter to SDK, select an SDK for the required programming language or framework and an access method to send and subscribe to the test message.

Step 4: Create a consumer group

Perform the following operations to create a consumer group for Logstash.

Log on to the ApsaraMQ for Kafka console.
In the Resource Distribution section of the Overview page, select the region where the ApsaraMQ for Kafka instance that you want to manage resides.
On the Instances page, click the name of the instance that you want to manage.
In the left-side navigation pane, click Groups.
On the Groups page, click Create Group.
In the Create Group panel, enter a group name in the Group ID field and a group description in the Description field, attach tags to the group, and then click OK.
After a consumer group is created, you can view the consumer group on the Groups page.

Step 5: Use Logstash to consume messages

Start Logstash on the server where Logstash is installed, and consume messages from the created topic.

Run the cd command to switch to the bin directory of Logstash.

Create a configuration file named input.conf.

Run the vim input.conf command to create an empty configuration file.
Press the i key to enter the insert mode.

Enter the following content in the configuration file:

input {
 kafka {
     bootstrap_servers => "alikafka-pre-cn-zv**********-1-vpc.alikafka.aliyuncs.com:9092,alikafka-pre-cn-zv**********-2-vpc.alikafka.aliyuncs.com:9092,alikafka-pre-cn-zv**********-3-vpc.alikafka.aliyuncs.com:9092"
     group_id => "logstash_group"
     topics => ["logstash_test"]
     consumer_threads => 12
     auto_offset_reset => "earliest"
 }
}
output {
 stdout{codec=>rubydebug}
}

Parameter	Description	Example
bootstrap_servers	The VPC endpoint of the Message Queue for Apache Kafka instance. ApsaraMQ for Kafka supports the following VPC endpoints: Default endpoint SASL endpoint	alikafka-pre-cn-zv********-1-vpc.alikafka.aliyuncs.com:9092,alikafka-pre-cn-zv******-2-vpc.alikafka.aliyuncs.com:9092,alikafka-pre-cn-zv********-3-vpc.alikafka.aliyuncs.com:9092
group_id	The name of the consumer group.	logstash_group
topics	The name of the topic.	logstash_test
consumer_threads	The number of consumer threads. We recommend that you set this parameter to the number of partitions of the topic.	12
auto_offset_reset	Specifies how the consumer offset is reset. Valid values: earliest: Consumption starts from the earliest message. latest: Consumption starts from the latest message.	earliest

Press the Esc key to return to the CLI mode.
Press the : key to enter the bottom line mode. Enter wq and press the Enter key to save the file and exit.

Run the following command to consume messages:
```
./logstash -f input.conf
```
The following result is returned.

References

For more information about parameter settings, see Kafka input plugin.