Central Tendency Computation

Machine Learning Patterns, Mechanisms > Data Exploration Patterns > Central Tendency Computation


Central Tendency Computation (Khattak)

How can the make-up of a dataset be determined in terms of the normal set of values?


Before solving a machine learning problem, a preliminary understanding of the input data is required. However, not knowing which techniques to start with can negatively impact the subsequent model development.


The dataset is analyzed and the values that normally occur around the center of a distribution as well the most occurring values are calculated via established statistical techniques.


The dataset is arranged in ascending or descending order, and the measures of central tendency (mean, median, and mode) are calculated.


Query Engine, Processing Engine, Resource Manager, Storage Device
















Variable x belongs to a dataset. An understanding of the value of variable x is required (1). To develop this understanding, the measures of central tendency are found (2). Based on the values of the mean, median and mode, it is determined that the most common value is 5 (3).

Module 12: Fundamental Service API Design & Management

This pattern is covered in Machine  Learning Module 2: Advanced Machine Learning.

For more information regarding the Machine  Learning Specialist curriculum, visit www.arcitura.com/machinelearning.