MaxFrame APIs are classified into compatibility APIs and MaxFrame-specific APIs. Compatibility APIs integrate with various standard libraries, such as Pandas, to facilitate data processing. MaxFrame-specific APIs are introduced for distributed execution of tasks. When you use MaxFrame APIs to develop jobs, you can obtain data manipulation experience similar to that of standard databases and efficiently run large-scale data processing tasks on the MaxFrame platform.
MaxFrame-specific APIs
API type | API name | Description |
Task session | Starts a MaxFrame job session. The new_session method is called at the beginning of the script to initialize the execution of the entire task. Subsequent data processing is performed based on the interaction between the session object constructed during initialization and the backend service. | |
Input/Output | Builds a DataFrame object based on the data of an entire MaxCompute table, specific partitions of the table, or specific columns of the table. | |
Builds a DataFrame object based on the query results of MaxCompute SQL statements. | ||
Writes DataFrame data to a MaxCompute table. | ||
Task execution | Executes a task. MaxFrame uses a delayed computing model. Task execution is triggered only when the execute method is called. This API allows data conversions to be delayed to the end of the pipeline. This way, multiple operations can be executed at the same time in a single task. This reduces the amount of data transmitted between the client and the database and improves performance. | |
Acquisition of task results | Obtains the computing result of a task. The computing result is not directly transmitted to the client. This can reduce the amount of transmitted data and the workload on the client. To check part of the result data, you can call the fetch method after the execute method to retrieve the result data. |
MaxFrame for Pandas
DataFrame
API type
Reference
Constructor
Attributes
Mathematical calculation
Filtering, projection, and sampling
Sorting
Combining, joining, and merging
User-defined functions (UDFs), aggregates, and windows
Conversion
Series
API type
Reference
Constructor
Mathematical calculation
Filtering, projection, and sampling
Sorting
UDFs, aggregates, and windows
Index
API type
Reference
Constructor
Filtering, projection, and sampling
Conversion
Pandas APIs supported by MaxFrame
DataFrame
APIs that support all parameters:
DataFrame, add, radd, round, sub, rsub, mul, rmul, floordiv, rfloordiv, truediv, rtruediv, div, rdiv, mod, rmod, pow, rpow, eq, ne, lt, gt, le, head, reset_index, drop_duplicates, rename, sort_values, sort_index, join, merge, sample, transform, and agg
APIs that support specific parameters:
Series
APIs that support all parameters:
add, radd, round, sub, rsub, mul, rmul, floordiv, rfloordiv, truediv, rtruediv, div, rdiv, mod, rmod, pow, rpow, eq, ne, lt, gt, ge, le, head, drop, reset_index, drop_duplicates, sort_values, sort_index, sample, transform, and agg
APIs that support specific parameters: None
Index
APIs that support all parameters: drop_duplicates
APIs that support specific parameters:
astype: does not support astype('category').