Data development is the process of generating, collecting, storing, analyzing, computing, extracting, presenting, and sharing data.
Note In the preceding figure, you can perform the steps in the dashed-line boxes in DataWorks.
The data development process involves the following steps:
- Generate data: Each business system generates a large amount of structured data every day and stores the data in databases such as MySQL, Oracle, and ApsaraDB RDS databases.
- Collect and store data: You can synchronize data from business systems to MaxCompute. Then, you can use
the data storage and processing capabilities of MaxCompute to analyze the data.
The Data Integration service of DataWorks supports various data sources. You can use Data Integration to synchronize data from business systems to MaxCompute based on configured scheduling properties.
- Analyze and compute data: After data synchronization, you can use ODPS SQL and ODPS MR nodes to process data in MaxCompute, analyze data, and mine the data for value.
- Extract data: You can export data processing and analysis results to business systems for further processing.
- Present and share data: After data is extracted, you can present the big data processing and analysis results in multiple ways such as reports or a geographic information system (GIS). You can also share the results with other users.