Associativity Computation


Machine Learning Patterns, Mechanisms > Data Exploration Patterns > Associativity Computation

 

Associativity Computation (Khattak)

How can the existence of relationship(s) between variables in a dataset be determined?

Problem

Gaining an understanding of a dataset and the subsequent model development requires finding connections between variables. Failure to do so results in ineffective models comprising irrelevant variables as predictors.

Solution

The connection between variables is expressed in the form of relationship between variables and is quantified via the application of proven statistical techniques.

Application

Numerical values present in the dataset are taken in pairs and the measures of association (correlation and covariance) are calculated.

Mechanisms

Query Engine, Analytics Engine, Processing Engine, Resource Manager, Storage Device

 

 

 

 

 

 

 

 

 

A dataset contains values of ice cream sold for different temperature readings recorded over three days (1). The measures of association are found in order to determine whether the number of ice creams sold is related to the temperature readings (2). Based on the value of correlation, it is concluded that there is a strong positive relationship between the number of ice creams sold and the temperature readings, which means that as the temperature increases, more ice cream is sold and vice versa (3).

Module 12: Fundamental Service API Design & Management

This pattern is covered in Machine  Learning Module 2: Advanced Machine Learning.

For more information regarding the Machine  Learning Specialist curriculum, visit www.arcitura.com/machinelearning.