- Consider the data set of market basket transactions shown in following table:
Customer ID |
Transaction ID |
Items Bought |
1 |
0001 |
{a,d,e,f} |
1 |
0024 |
{a,b,c} |
2 |
0012 |
{b,d,e,f} |
2 |
0031 |
{a,c,e} |
3 |
0015 |
{b,d,f} |
3 |
0022 |
{a,b} |
4 |
0029 |
{a,b,c} |
4 |
0040 |
{a,b,d,e} |
5 |
0033 |
{e,b,d} |
5 |
0038 |
{f,c,e} |
- Compute the support(%) for itemsets {e}, {b, d}, and {b, d, e} by treating each transaction ID as a market basket.
- Use the results in part (a) to compute the confidence for the association rules {b,d} −→ {e} and {e} −→ {b,d}.
- Is confidence a symmetric measure or an asymmetric measure?
- Repeat part (a) by treating each customer ID as a market basket. Each item should be treated as a binary variable (1 if an item appears in at least one transaction bought by the customer, and 0 otherwise.)
- Use the results in part (d) to compute the confidence for the association rules {b, d} −→ {e} and {e} −→ {b, d}.