All Products
Search
Document Center

Data Lake Formation:Storage overview

Last Updated:May 14, 2024

Data Lake Formation (DLF) allows you to view information such as storage usage, metadata objects, storage trends, storage class distribution, storage format distribution, and file distribution. The information helps you quickly understand storage resource usage, identify issues, and perform optimization accordingly.

Prerequisites

  • Object Storage Service (OSS) is activated.

  • Location hosting is complete in DLF.

Enable storage overview

  1. Log on to the DLF console, choose Lake Management > Storage Overview in the left-side navigation pane, and then click Enable to enable the Storage Overview feature.

Important

  1. If you enable this feature, OSS buckets of metadatabases are written to statistical files. You are charged for the storage of these files.

  2. No statistical data is generated on the day you enable storage overview. You can view statistical data on the next day.

开通存储概览-立即启用

Feature description

Metadata analysis

Summary of resources

  • Total storage space used and monthly and daily changes: the total OSS storage space used for storing tables that are displayed on the Metadata page.

  • Total number of tables and monthly and daily changes: the total number of tables that are displayed on the Metadata page.

  • Total number of databases and monthly and daily changes: the total number of databases that are displayed on the Metadata page.

  • Monthly and daily API visits: the number of API visits of the current month (calendar month).

资源总计

Trend change

This section displays the trend charts of the storage capacity, table quantity, database quantity, and API visits.

You can select a time period for the query.

趋势变化

Rankings of table and database storage

This section displays the rankings of the OSS storage space used for tables and databases. You can optimize the top-ranked tables and databases based on your business requirements.

表/库存储排名

Storage class distribution

This section displays the distribution of OSS storage classes. OSS provides the following storage classes: Standard, Infrequent Access (IA), Archive, and Cold Archive. You can select storage classes that are suitable for different business data based on your needs to optimize storage costs.

DLF also provides the lifecycle management feature to allow automatically archiving data in data lakes.

存储分层分布

Storage format distribution

This section displays the storage format distribution of tables.

存储格式分布

File distribution and rankings of small files (including ultra-small files)

This section displays the distribution of files at different size levels and rankings of small files (including ultra-small files). This helps you optimize the tables with a large number of small files based on your business requirements to improve query performance.

大小文件分布和排名

Location analysis

Location storage trend analysis

image

Location request trend analysis

image

Location storage ranking

image