By Ranjith Udayakumar, Alibaba Cloud Tech Share Author. Tech Share is Alibaba Cloud's incentive program to encourage the sharing of technical knowledge and best practices within the cloud community.
"The best vision is insight – Malcolm Forbes". When it comes to data analytics for enterprises, nothing is more important than making accurate and reliable inferences from data. It is no surprise that enterprises are investing heavily on big data analytics as they can reap larger profits with accurate insights. However, this is often easier said than done. Data collected from real-world applications is affected by many variables, making data prediction challenging. Regardless, data analytics remain essential for many, if not all, businesses around the world.
In this article, I will walk you through the process of deciphering data to uncovering hidden insights from this data.
This article is meant for everyone! This includes students who just want to familiarize with general concepts, professional data analysts who want to learn new ways to analyze data, and business decision makers who want to know how to get better insights from business data. If you are not familiar with big data and analytics, you should browse for our free e-learning classes on this subject, and try the Alibaba Cloud Apsara Cloud Certifications to consolidate your knowledge.
This article covers the overall process of deciphering data from conceptual, practical, and best practice perspectives. Anyone with valid data can use this article as a guide to get insights from data with the help of open-source technologies. However, if you are doing data analytics for business intelligence, I strongly recommend using Alibaba Cloud QuickBI.
To use Alibaba Cloud QuickBI, you need to do the following:
For this article, we are going to be looking at:
We will be covering the entire process of deciphering data. The overall process involves:
This multi-part article talks about how to collect data, wrangle the data, ingest the data, model the data, and visualize the data from three viewpoints (conceptual, practical, and best practice).
In the first article in this series, we are going to see how to understand the data better.
When it comes to big data, more data isn't necessarily better. Your data is only as good as your ability to understand and communicate it, which is why understanding the data is so essential.
Once you've got your data, you need to consider the following problems:
You will need to address these questions for your data analysis to be effective. We will provide some generalized answers for the above questions in this article.
We should analyze the data to understand the domain it belongs to. With the domain in mind we should ask right questions against the data to get insights out of it. For example, if the data shows ATM location details, transaction type, number of transactions, and transaction amount, it clearly depicts the data belongs to the BFSI domain.
After we determine the domain, it's now our turn to decide what type of insights that we can infer out of it from the given data. We will do this in our practical section.
We should look for some "interesting" insights. As we discussed earlier, we need to ask right questions against the data to understand it better and decipher insights.
For example, let's assume you have some understanding about the BFSI domain. Then, we should able to differentiate the Facts (Measures) and Dimensions (Other than Measures) from the data to get a clear idea about the data.
It's now our turn to understand what are the facts and dimensions available, what are the right questions that we need to ask to the given data. We can do this in our practical section.
We need to choose the right tool to wrangle, process, visualize the data effectively. There are lot of tools available in market, all of them with their own unique strengths.
When deploying on the cloud, I prefer using Alibaba Cloud Quick BI, which covers the majority of tasks needed to be done in ease at an affordable price.
In this article we are going to utilize Alibaba Cloud QuickBI as a tool to decipher the data to get the insights out of it. We will explore how to do this in our practical section.
As we discussed earlier, we are going to understand the data better with real use cases.
Here we will use the data from ATM Dataset.
What Do You Do with It?
As mentioned previously, we know that this data belongs to the BFSI domain. Specifically, this data talks about ATM Transactions. Now before digging deeper, we need to understand the domain basics and how the business users will see it to proceed with next question.
What Should You Look For?
As we discussed earlier we need to ask right questions to understand the data better. We need to differentiate the Facts (Measures) and Dimensions (Other than the Measures).
The Facts include:
The Dimensions include:
After separating the facts and dimensions, we can now ask questions about the data. Questions may include:
These questions are key to deriving insights from the data. Without the right questions, we can't derive the value we need from the data.
Here we will use the data from Customer360.
What Do You Do with It?
Similar to the previous use case, we know the data belongs to the BFSI domain, specifically on bank customer details. Now before digging deeper, we need to understand the domain basics and how the business users will see it to proceed with next question.
What Should You Look For?
Similarly, we need to differentiate the Facts (Measures) and Dimensions (Other than the Measures).
The Facts are:
The Dimensions are:
After separating the facts and dimensions, we can ask questions such as:
These questions are key to deriving insights from the data. Let's now look at the best practices of understanding data.
Here are some of the best practices when trying to make sense out of data, particularly data relating to the two use cases above.
I hope that this article gives you a better grasp of the basic principles on data analytics, specifically on understanding your data. If you want to know more about big data and analytics, I highly recommend the Alibaba Cloud Apsara Cloud Certifications. You can advance your skills by learning, and even earn official Alibaba Cloud certifications to demonstrate your professional competency.
In the next article of this series, we will be exploring how to wrangle the data. Please ensure that you have registered on Alibaba Cloud because we will be using QuickBI for other articles in this series. Stay tuned.
"Torture the data, and it will confess to anything – Ronald Coase"
Deep Dive into Computer Vision with Neural Networks – Part 2
5 Best Practices for Different Web Application Hosting Scenarios
2,599 posts | 762 followers
FollowAlibaba Clouder - March 1, 2019
Alibaba Clouder - March 1, 2019
Alibaba Clouder - October 15, 2018
Alibaba Clouder - October 9, 2018
Alibaba Clouder - July 15, 2020
Shane Duggan - March 8, 2023
2,599 posts | 762 followers
FollowA new generation of business Intelligence services on the cloud
Learn MoreA powerful and accessible data visualization tool
Learn MoreConduct large-scale data warehousing with MaxCompute
Learn MoreMore Posts by Alibaba Clouder