Heterogeneity

]

# Week 12: .fancy[Heterogeneity]

### <svg aria-hidden="true" role="img" viewBox="0 0 512 512" style="height:1em;width:1em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:white;overflow:visible;position:relative;"><path d="M243.4 2.6l-224 96c-14 6-21.8 21-18.7 35.8S16.8 160 32 160v8c0 13.3 10.7 24 24 24H456c13.3 0 24-10.7 24-24v-8c15.2 0 28.3-10.7 31.3-25.6s-4.8-29.9-18.7-35.8l-224-96c-8.1-3.4-17.2-3.4-25.2 0zM128 224H64V420.3c-.6 .3-1.2 .7-1.8 1.1l-48 32c-11.7 7.8-17 22.4-12.9 35.9S17.9 512 32 512H480c14.1 0 26.5-9.2 30.6-22.7s-1.1-28.1-12.9-35.9l-48-32c-.6-.4-1.2-.7-1.8-1.1V224H384V416H344V224H280V416H232V224H168V416H128V224zm128-96c-17.7 0-32-14.3-32-32s14.3-32 32-32s32 14.3 32 32s-14.3 32-32 32z"/></svg> EMSE 6035: Marketing Analytics for Design Decisions
### <svg aria-hidden="true" role="img" viewBox="0 0 448 512" style="height:1em;width:0.88em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:white;overflow:visible;position:relative;"><path d="M272 304h-96C78.8 304 0 382.8 0 480c0 17.67 14.33 32 32 32h384c17.67 0 32-14.33 32-32C448 382.8 369.2 304 272 304zM48.99 464C56.89 400.9 110.8 352 176 352h96c65.16 0 119.1 48.95 127 112H48.99zM224 256c70.69 0 128-57.31 128-128c0-70.69-57.31-128-128-128S96 57.31 96 128C96 198.7 153.3 256 224 256zM224 48c44.11 0 80 35.89 80 80c0 44.11-35.89 80-80 80S144 172.1 144 128C144 83.89 179.9 48 224 48z"/></svg> John Paul Helveston
### <svg aria-hidden="true" role="img" viewBox="0 0 448 512" style="height:1em;width:0.88em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:white;overflow:visible;position:relative;"><path d="M152 64H296V24C296 10.75 306.7 0 320 0C333.3 0 344 10.75 344 24V64H384C419.3 64 448 92.65 448 128V448C448 483.3 419.3 512 384 512H64C28.65 512 0 483.3 0 448V128C0 92.65 28.65 64 64 64H104V24C104 10.75 114.7 0 128 0C141.3 0 152 10.75 152 24V64zM48 448C48 456.8 55.16 464 64 464H384C392.8 464 400 456.8 400 448V192H48V448z"/></svg> November 16, 2022

]

---

# Houskeeping items

- **Final presentations** will be on 12/14 during normal class hours. You can pre-record a video presentation if you want, or you can do it live.

- **Final reports** (due 12/11) will also be an html page report.

- I am planning on posting all reports (without grades) to the course site as a showcase for future students - **please DM me if you would NOT like your report posted**. ([examples](https://madd.seas.gwu.edu/showcase-2021-fall.html) from last year)

---

# Quiz 5 - last one!

### Link is in the #class channel

]

]

---

# Week 12: .fancy[Heterogeneity]

### 1. Mixed logit (unobserved heterogeneity)
### 2. Sub-group modeling (observed heterogeneity)

---

# Week 12: .fancy[Heterogeneity]

### 1. .orange[Mixed logit (unobserved heterogeneity)]
### 2. Sub-group modeling (observed heterogeneity)

---

background-color: #EEEDEE
class: center

# Two ways of modeling heterogeneity

]

]

---

background-color: #EEEDEE

## .center[Mixed logit]

### Preference parameters follow a distribution<br>**across sample population**

]

## `$$\tilde{u}_j = \beta_1 x_j + \varepsilon_j$$`

## `$$\beta_1 \sim \mathrm{N} (\mu_1, \sigma_1)$$`

Parameter | Estimate | Standard Error 
----------|----------|-----------------
`$\mu_1$`   | 0.1      | 0.01
`$\sigma_1$` | 0.1      | 0.01

]

---

## Which distribution should I use?

**Normal distribution**

When preferences can be positive or negative

e.g. `brand = "n"`

]

**Log-normal distribution**

When preferences should be strictly positive

e.g. `-1*price = "ln"`

]

**Fixed parameter**

When preferences appear to be homogeneous

(e.g. `$\sigma$` is very small)

]

---

### Mixed logits are not equivalent in Preference vs. WTP space

### Preference space

### `$$\tilde{u}_j = \alpha p_j + \beta x_j + \varepsilon_j$$`

### `$$\alpha \sim \ln\mathrm{N} (\mu_1, \sigma_1)$$`

### `$$\beta \sim \mathrm{N} (\mu_2, \sigma_2)$$`

]

---

### Mixed logits are not equivalent in Preference vs. WTP space

### Preference space

### `$$\tilde{u}_j = \alpha p_j + \beta x_j + \varepsilon_j$$`

### `$$\alpha \sim \ln\mathrm{N} (\mu_1, \sigma_1)$$`

### `$$\beta \sim \mathrm{N} (\mu_2, \sigma_2)$$`

### `$$\omega = \frac{\beta}{-\alpha} = \frac{\mathrm{N} (\mu_2, \sigma_2)}{- \ln\mathrm{N} (\mu_1, \sigma_1)}$$`

]

### WTP space

### `$$\tilde{u}_j = \lambda(\omega_1 x_j - p_j) + \varepsilon_j$$`

### `$$\omega_1 \sim \mathrm{N} (\mu_1, \sigma_1)$$`

]

---

# Practice Question 3

a) Use the `logitr` package to estimate the following homogeneous model:

$$
\tilde{u}_j = \beta_1 x_j^{\mathrm{price}} + \beta_2 \delta_j^{\mathrm{feat}} + \beta_3 \delta_j^{\mathrm{dannon}} + \beta_4 \delta_j^{\mathrm{hiland}} + 
\beta_5 \delta_j^{\mathrm{weight}} + \varepsilon_j
$$

where the three `$\delta$` coefficients are dummy variables for Dannon, Hiland, and Weight Watchers brands (Yoplait is the reference level).

b) Use the `logitr` package to estimate the same model but with the following mixing distributions:

- `$\beta_1 \sim \mathrm{N} (\mu_1, \sigma_1)$`
- `$\beta_2 \sim \mathrm{N} (\mu_2, \sigma_2)$`

---

# .center[Estimating mixed logit models with `logitr`]

<br>

## 1. Open `logitr-cars`

## 2. Open `code/8.1-model-mxl.R`

]

---

## Your Turn

As a team, re-estimate the main model you used in your pilot analysis report, but now using a mixed logit model.

Carefully consider which distributions to use (i.e., normal or log-normal) for different variables.

]]

---

# Week 12: .fancy[Heterogeneity]

### 1. Mixed logit (unobserved heterogeneity)
### 2. .orange[Sub-group modeling (observed heterogeneity)]

---

# Two ways of modeling heterogeneity

]

]

---

### .center[Use interactions to model preferences for multiple groups]

### Homogenous model:

### `$\tilde{u}_j = \beta_1 x_j + \varepsilon_j$`

### Two groups: A & B

### `$\tilde{u}_j = \beta_1 x_j + \beta_2 x_j \delta^\mathrm{B} + \varepsilon_j$`

### `$\quad = (\beta_1 + \beta_2 \delta^\mathrm{B}) x_j + \varepsilon_j$`

]

Par.| Meaning 
----|--------
`$\beta_1$` | Effect of `$x_j$` for group A
`$\beta_2$` | _Difference_ in effect of `$x_j$` between groups

]

---

# What's the difference?

## Separate models <svg aria-hidden="true" role="img" viewBox="0 0 512 512" style="height:1em;width:1em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:red;overflow:visible;position:relative;"><path d="M175 175C184.4 165.7 199.6 165.7 208.1 175L255.1 222.1L303 175C312.4 165.7 327.6 165.7 336.1 175C346.3 184.4 346.3 199.6 336.1 208.1L289.9 255.1L336.1 303C346.3 312.4 346.3 327.6 336.1 336.1C327.6 346.3 312.4 346.3 303 336.1L255.1 289.9L208.1 336.1C199.6 346.3 184.4 346.3 175 336.1C165.7 327.6 165.7 312.4 175 303L222.1 255.1L175 208.1C165.7 199.6 165.7 184.4 175 175V175zM512 256C512 397.4 397.4 512 256 512C114.6 512 0 397.4 0 256C0 114.6 114.6 0 256 0C397.4 0 512 114.6 512 256zM256 48C141.1 48 48 141.1 48 256C48 370.9 141.1 464 256 464C370.9 464 464 370.9 464 256C464 141.1 370.9 48 256 48z"/></svg>

$$
\tilde{u}_j^\mathrm{A} = \beta_1^\mathrm{A} x_j + \varepsilon_j^\mathrm{A}
$$

$$
\tilde{u}_j^\mathrm{B} = \beta_1^\mathrm{B} x_j + \varepsilon_j^\mathrm{B}
$$

]

## Single model <svg aria-hidden="true" role="img" viewBox="0 0 512 512" style="height:1em;width:1em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:green;overflow:visible;position:relative;"><path d="M470.6 105.4c12.5 12.5 12.5 32.8 0 45.3l-256 256c-12.5 12.5-32.8 12.5-45.3 0l-128-128c-12.5-12.5-12.5-32.8 0-45.3s32.8-12.5 45.3 0L192 338.7 425.4 105.4c12.5-12.5 32.8-12.5 45.3 0z"/></svg>

`$$\tilde{u}_j = \beta_1 x_j + \beta_2 x_j \delta^\mathrm{B} + \varepsilon_j$$`

]

---

# .center[Accounting for scale differences]

## .center[Separate models <svg aria-hidden="true" role="img" viewBox="0 0 512 512" style="height:1em;width:1em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:red;overflow:visible;position:relative;"><path d="M175 175C184.4 165.7 199.6 165.7 208.1 175L255.1 222.1L303 175C312.4 165.7 327.6 165.7 336.1 175C346.3 184.4 346.3 199.6 336.1 208.1L289.9 255.1L336.1 303C346.3 312.4 346.3 327.6 336.1 336.1C327.6 346.3 312.4 346.3 303 336.1L255.1 289.9L208.1 336.1C199.6 346.3 184.4 346.3 175 336.1C165.7 327.6 165.7 312.4 175 303L222.1 255.1L175 208.1C165.7 199.6 165.7 184.4 175 175V175zM512 256C512 397.4 397.4 512 256 512C114.6 512 0 397.4 0 256C0 114.6 114.6 0 256 0C397.4 0 512 114.6 512 256zM256 48C141.1 48 48 141.1 48 256C48 370.9 141.1 464 256 464C370.9 464 464 370.9 464 256C464 141.1 370.9 48 256 48z"/></svg>]

$$
\tilde{u}_j^\mathrm{A} = \alpha^\mathrm{A} p_j + \beta_1^\mathrm{A} x_j + \varepsilon_j^\mathrm{A}
$$

$$
\tilde{u}_j^\mathrm{B} = \alpha^\mathrm{B} p_j + \beta_1^\mathrm{B} x_j + \varepsilon_j^\mathrm{B}
$$

Imagine you got the following results

- `$\hat{\alpha}^\mathrm{A}$` = 100
- `$\hat{\beta}^\mathrm{A}$` = 200
- `$\hat{\alpha}^\mathrm{B}$` = 1
- `$\hat{\beta}^\mathrm{B}$` = 2

]

## .center[Single model <svg aria-hidden="true" role="img" viewBox="0 0 512 512" style="height:1em;width:1em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:green;overflow:visible;position:relative;"><path d="M470.6 105.4c12.5 12.5 12.5 32.8 0 45.3l-256 256c-12.5 12.5-32.8 12.5-45.3 0l-128-128c-12.5-12.5-12.5-32.8 0-45.3s32.8-12.5 45.3 0L192 338.7 425.4 105.4c12.5-12.5 32.8-12.5 45.3 0z"/></svg>]

`$$\tilde{u}_j = \alpha_1 p_j + \alpha_2 p_j \delta^\mathrm{B} + \beta_1 x_j + \beta_2 x_j \delta^\mathrm{B} + \varepsilon_j$$`

`$$\quad = (\alpha_1 + \alpha_2 \delta^\mathrm{B}) p_j + (\beta_1 + \beta_2 \delta^\mathrm{B}) x_j + \varepsilon_j$$`

]

---

# .center[Accounting for scale differences]

## .center[Preference Space <svg aria-hidden="true" role="img" viewBox="0 0 512 512" style="height:1em;width:1em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:red;overflow:visible;position:relative;"><path d="M175 175C184.4 165.7 199.6 165.7 208.1 175L255.1 222.1L303 175C312.4 165.7 327.6 165.7 336.1 175C346.3 184.4 346.3 199.6 336.1 208.1L289.9 255.1L336.1 303C346.3 312.4 346.3 327.6 336.1 336.1C327.6 346.3 312.4 346.3 303 336.1L255.1 289.9L208.1 336.1C199.6 346.3 184.4 346.3 175 336.1C165.7 327.6 165.7 312.4 175 303L222.1 255.1L175 208.1C165.7 199.6 165.7 184.4 175 175V175zM512 256C512 397.4 397.4 512 256 512C114.6 512 0 397.4 0 256C0 114.6 114.6 0 256 0C397.4 0 512 114.6 512 256zM256 48C141.1 48 48 141.1 48 256C48 370.9 141.1 464 256 464C370.9 464 464 370.9 464 256C464 141.1 370.9 48 256 48z"/></svg>]

$$
\tilde{u}_j^\mathrm{A} = \alpha^\mathrm{A} p_j + \beta_1^\mathrm{A} x_j + \varepsilon_j^\mathrm{A}
$$

$$
\tilde{u}_j^\mathrm{B} = \alpha^\mathrm{B} p_j + \beta_1^\mathrm{B} x_j + \varepsilon_j^\mathrm{B}
$$

Imagine you got the following results

- `$\hat{\alpha}^\mathrm{A}$` = 100
- `$\hat{\beta}^\mathrm{A}$` = 200
- `$\hat{\alpha}^\mathrm{B}$` = 1
- `$\hat{\beta}^\mathrm{B}$` = 2

]

## .center[WTP Space <svg aria-hidden="true" role="img" viewBox="0 0 512 512" style="height:1em;width:1em;vertical-align:-0.125em;margin-left:auto;margin-right:auto;font-size:inherit;fill:green;overflow:visible;position:relative;"><path d="M470.6 105.4c12.5 12.5 12.5 32.8 0 45.3l-256 256c-12.5 12.5-32.8 12.5-45.3 0l-128-128c-12.5-12.5-12.5-32.8 0-45.3s32.8-12.5 45.3 0L192 338.7 425.4 105.4c12.5-12.5 32.8-12.5 45.3 0z"/></svg>]

$$
\tilde{u}_j^\mathrm{A} = \lambda^\mathrm{A}(\omega_1^\mathrm{A} x_j - p) + \varepsilon_j^\mathrm{A}
$$

$$
\tilde{u}_j^\mathrm{B} = \lambda^\mathrm{B}(\omega_1^\mathrm{B} x_j - p) + \varepsilon_j^\mathrm{B}
$$

$$
\omega = \frac{\beta}{- \alpha}
$$

- `$\hat{\omega}^\mathrm{A} = 200 / (-100) = -2$`
- `$\hat{\omega}^\mathrm{B} = 2 / (-1) = -2$`

]

---

# Practice Question 1

Suppose we estimate the following utility model describing preferences for cars:

$$
\tilde{u}_j = \beta_1 x_j^{\mathrm{price}} + \beta_2 x_j^{\mathrm{mpg}} + \beta_3 x_j^{\mathrm{elec}} + \varepsilon_j
$$

a) Using interactions, write out a model that accounts for differences in the effects of `$x_j^{\mathrm{price}}$`, `$x_j^{\mathrm{mpg}}$`, and `$x_j^{\mathrm{elec}}$` between two groups: A and B.

b) Write out the effects of `$x_j^{\mathrm{price}}$`, `$x_j^{\mathrm{mpg}}$`, and `$x_j^{\mathrm{elec}}$` for each group.

---

# Practice Question 2

Suppose we estimate the following utility model describing preferences for chocolate bars between two groups: A & B

$$
\tilde{u}_j = \beta_1 x_j^{\mathrm{price}} + \beta_2 x_j^{\mathrm{caco}} + \beta_3 x_j^{\mathrm{price}}\delta_j^{\mathrm{B}} + \beta_4 x_j^{\mathrm{cacao}}\delta_j^{\mathrm{B}} + \varepsilon_j
$$

The estimated model produces the following coefficients and hessian:

`$\beta$` = [-0.7, 0.1, 0.2, 0.8]

$$
H =
`\begin{bmatrix}
-6000 & 50 & 60 & 70
\\
50 & -700 & 50 & 100
\\
60 & 50 & -300 & 20
\\
70 & 100 & 20 & -6000
\end{bmatrix}`
$$

]

a) Use the `mvrnorm()` function from the `MASS` library to generate 10,000 draws of the model coefficients.

b) Use the draws to compute the mean WTP and 95% confidence intervals of the effects of `$x_j^{\mathrm{price}}$` and `$x_j^{\mathrm{cacao}}$` for each group (A & B).

]

---

# .center[Estimating mixed logit models with `logitr`]

<br>

## 1. Open `logitr-cars`

## 2. Open `code/8.2-model-mnl-groups.R`

---

## Your Turn

### Do this individually, and compare with your teammates:

- Examine the demographic and other variables in your pilot data and specify a model that estimates differences between different groups.
- Write code to estimate that model (or multiple models, e.g. WTP space models).
- Compute and compare WTP across the different groups.

]]