Characteristics of Data – Central Tendency and Dispersion
February 12, 2025
The Transformations Wrought About by the Digital Age This is the age of Smart Machines that can perform tasks that hitherto were being done by humans. Right from manufacturing automation where robots work on the factory floor to make products, to Artificial Intelligence driven algorithms that have predictive capabilities, and Big Data enabled Search Engines, […]
Demographic Dividend We often hear the term, demographic dividend being bandied about in the media and in books. However, not many are aware what exactly it means and the implications of the same for business and society. The term refers to the advantages that countries have if they have youthful populations. This is because there […]
Business Intelligence (BI) – An ERP product provides hundreds of standard reports, generated weekly or monthly or year end basis which acted as a hardcopy dashboards within the organization. In order to improve report generation functionality, an ERP system provides various report writing tools (such as crystal report writer, safari report writer) as well as […]
Cross Border Mergers and Acquisitions Cross border Mergers and Acquisitions or M&A are deals between foreign companies and domestic firms in the target country. The trend of increasing cross border M&A has accelerated with the globalization of the world economy. Indeed, the 1990s were a “golden decade” for cross border M&A with a nearly 200 […]
Shift in the Process: A shift in the process is defined as a pattern in which seven consecutive data points appear on only one side of the mean. Statistical studies have shown that if the number of points on one side of the mean is less than 7 then it is not a non-random pattern […]
The shape of the data determines the type of tools that can be used to draw conclusions from it. Here is how to graphically plot out the data to find its shape:
Step 1: Plot Data into Categories
To begin with, the data must be divided into equal categories. The categories must have equal intervals to make the data meaningful. Then a frequency table must be prepared from the available data set and the number of times an item occurs within an interval category must be noted down.
Step 2: Draw a Histogram
The next step is to plot the data intervals on a graph paper and create a histogram. A histogram is nothing but a bar chart of a continuous set of data with equal intervals.
Step 3: Join the Midpoints to Find the Shape
The next step is to plot the midpoints of the bars of the histogram. These midpoints must then be joined to develop the curve of the data that is also called the shape of the data.
Amongst the many characteristics of the shape of the data that are important, perhaps the prime category is symmetry. The reasons for the same have been listed below.
The shape of the data is of such prime importance because statistical techniques have been developed which can make decisions about the probability of data based on its shape. The details of the same are as follows:
Symmetrical Data: Symmetrical data sis the easiest type of data to work with. This is because many statistical techniques have been developed for the same. In fact symmetrical data is so common that it is called the normal curve. It also has other names like the bell curve. There are standard measurements available which can tell the probability of a data point occurring based on the number of standard deviations it is away from the mean. From a six sigma point of view it helps understand how the results of a process are likely to be distributed.
Most things which are measured continuously in nature as well as in operations have the normal distribution. It is for this reason that the applications of symmetrical data are enormous.
Skewed Data: Many times the data is not symmetrical i.e it is skewed towards one side. Data can be either positively or negatively skewed. There are statistical techniques available which help us find out the probability distributions of skewed data too. However such techniques are not very well developed. This is because most of the sample data being collected usually follows the normal distribution. Statistical analysis of skewed data is therefore not often performed.
Your email address will not be published. Required fields are marked *