5 Association
Discover association rules using the unsupervised machine learning technique of association to find co-occurrences in data.
- About Association
Identify the probability of co-occurring items in a collection using Association. - Transactional Data
Understand transactional data, where a case includes a collection of items like a market basket at checkout. - Association Algorithm
Use the Apriori algorithm in to calculate association rules for frequent itemsets.
Related Topics
Parent topic: Machine Learning Techniques
5.1 About Association
Identify the probability of co-occurring items in a collection using Association.
The relationships between co-occurring items are expressed as Association Rules.
- Association Rules
Identify the probability of co-occurring items in a collection within the data. - Market-Basket Analysis
Use association rules to analyze sales transactions, such as customers frequently buying cereal and milk together. - Association Rules and eCommerce
Apply association rules in eCommerce to personalize web pages by predicting user behavior based on page visits.
Parent topic: Association
5.1.1 Association Rules
Identify the probability of co-occurring items in a collection within the data.
The results of an association model are the rules that identify patterns of association within the data. Oracle Machine Learning for SQL does not support the scoring operation for association modeling.
Association rules can be applied as follows:
- Support: How often do these items occur together in the data?
- Confidence: How frequently the consequent occurs in transactions that contain the antecedent.
- Value: How much business value is connected to item associations
Parent topic: About Association
5.1.2 Market-Basket Analysis
Use association rules to analyze sales transactions, such as customers frequently buying cereal and milk together.
Association rules are often used to analyze sales transactions. For example, it is noted that customers who buy cereal at the grocery store often buy milk at the same time. In fact, association analysis find that 85% of the checkout sessions that include cereal also include milk. This relationship can be formulated as the following rule:
Cereal implies milk with 85% confidence
This application of association modeling is called market-basket analysis. It is valuable for direct marketing, sales promotions, and for discovering business trends. Market-basket analysis can also be used effectively for store layout, catalog design, and cross-sell.
Parent topic: About Association
5.1.3 Association Rules and eCommerce
Apply association rules in eCommerce to personalize web pages by predicting user behavior based on page visits.
Association modeling has important applications in other domains as well. For example, in e-commerce applications, association rules may be used for Web page personalization. An association model might find that a user who visits pages A and B is 70% likely to also visit page C in the same session. Based on this rule, a dynamic link can be created for users who are likely to be interested in page C. The association rule is expressed as follows:
A and B imply C with 70% confidence
Related Topics
Parent topic: About Association
5.2 Transactional Data
Understand transactional data, where a case includes a collection of items like a market basket at checkout.
Unlike other machine learning functions, association is transaction-based. In transaction processing, a case includes a collection of items such as the contents of a market basket at the checkout counter. The collection of items in the transaction is an attribute of the transaction. Other attributes might be a timestamp or user ID associated with the transaction.
Transactional data, also known as market-basket data, is said to be in multi-record case format because a set of records (rows) constitute a case. For example, in the following figure, case 11 is made up of three rows while cases 12 and 13 are each made up of four rows.
Non transactional data is said to be in a single-record case format because a single record (row) constitutes a case. In Oracle Machine Learning, association models can be built using either transactional or non transactional or two-dimensional data formats. If the data is non transactional, it is possible to transform to a nested column to make it transactional before association machine learning activities can be performed. Transactional format is the usual format but, the association rules model does accept two-dimensional input format. For non transactional input format, each distinct combination of the content in all columns other than the case ID column is treated as a unique item.
Parent topic: Association
5.3 Association Algorithm
Use the Apriori algorithm in to calculate association rules for frequent itemsets.
Oracle Machine Learning uses the Apriori algorithm to calculate association rules for items in frequent itemsets.
Related Topics
Parent topic: Association