All Products
Search
Document Center

Object Storage Service:Use Big Data Tools to connect JetBrains IDEs to OSS

Last Updated:Nov 15, 2023

The Big Data Tools plug-in lets you manage Object Storage Service (OSS) buckets and objects directly from JetBrains IDEs.

What is Big Data Tools?

Big Data Tools is a JetBrains IDE plug-in that you can use to:

  • Connect to remote file systems and storage systems such as OSS.

  • Manage files in a way similar to using a file manager, for example, copy, move, rename, delete, and download objects.

  • Preview files and switch between the Text view and Table view of CSV files.

  • Obtain information about files, such as modification time.

  • Preview binary files, such as Parquet.

  • Open storage buckets and directories in separate tabs.

For more information, see Big Data Tools.

Step 1: Install the Big Data Tools plug-in

  1. Start one of the following JetBrains IDEs:

    • IntelliJ IDEA Ultimate

    • PyCharm Professional

    • DataGrip

    • DataSpell

  2. Open the IDE settings.

    • On macOS, choose IntelliJ IDEA > Preferences from the main menu, or press ⌘,.

    • On Windows or Linux, choose File > Settings, or press Ctrl+Alt+S.

  3. In the Preferences window, install the Big Data Tools plug-in.

    1. In the left-side navigation pane, choose Plugins.

    2. Click the Marketplace tab.

    3. In the User Data search box, enter Big Data Tools.

    4. Click Install next to the Big Data Tools result.

    5. Click OK.

    install-big-data-tools

    After the Big Data Tools plug-in is installed, Big Data Tools appears on the right side of the IDE.

Step 2: Connect to OSS

You can use the Big Data Tools plug-in to connect to supported monitoring services and storage services, such as OSS.

  1. In the right-side bar of the IDE, choose Big Data Tools > Alibaba OSS.open-alibaba-oss

  2. In the Big Data Tools dialog box, complete the following configurations:

    1. Specify a name for the connection to distinguish it from other connections.

    2. Select a region in which the buckets you want to connect are located.

      For more information about all regions in which OSS is available, see Regions and endpoints.

    3. Select a method to obtain buckets.

      • Select Custom roots, and in the Roots field, specify the name of the bucket or the path to a directory in the bucket. If you specify multiple names or paths, separate them with commas (,).

      • Select All buckets in the account. You can then use the bucket filter to show only buckets with particular names.

    4. From the Authentication type list, select Explicit access key and secret key. In the Access key field, enter the AccessKey ID of the RAM user. In the Secret key field, enter the AccessKey secret.

      For more information about how to create an AccessKey pair, see Create an AccessKey pair.

    5. Click OK.

    create-a-big-data-tools-connection

Step 3: Manage OSS buckets and objects

After the connection is established, you can then use and manage OSS buckets and objects in the IDE.

Preview a bucket on a separate tab

You can preview a bucket on a separate tab of the editor.

  1. To preview a bucket on a separate tab, select the bucket and click Open in Separate Viewer. preview-in-a-separate-viewer

    You can view buckets on separate tabs, quickly switch between tabs for different buckets, and expand directories and structured objects such as CSV, Parquet, ORC, and Avro to view data columns and data types.

    preview-result
  2. Optionally, perform the following operations:

    • Click the image.png icon in the tabs at the top of the editor to customize the view (showing and hiding specific columns).

    • Click the image.png icon to upload data from your disk or click the image.png icon to copy an object.

    • Copy and move an object or a directory by dragging it to another bucket or directory.

Query object information

You can use the Big Data Tools plug-in to query object information, such as the modification time, object type, and storage class.

Right-click the object and select Show Info.

view-file-info

A box appears to display information about the object.

view-file-info-result

Preview binary objects

You can use the Big Data Tools plug-in to preview binary objects such as Avro, Parquet, and ORC.

  1. Double-click the object.

    The object is opened on a separate tab of the editor.

  2. Optionally, perform the following operations:

    • Use the search box at the top of the tab to search for a specific string.

      The following example uses a regular expression to search for data entries that contain a specified string.

      preview-a-binary-file-result
    • Sort the table by clicking its headers.

    • Right-click a cell to copy the cell, row, or column, or download the entire table.download