N-Grams Smoothing
Q1. Add-one smoothing works horribly in practice because of giving too much probability mass to unseen n-grams. Prove using an example.
Q2. In Add-δ smoothing, we add a small value 'δ' to the counts instead of one. Apply Add-δ smoothing to the below bigram count table where δ=0.02.
(eos) | John | Read | Fountainhead | Mary | a | Different | Book | She | By | Dickens | |
---|---|---|---|---|---|---|---|---|---|---|---|
(eos) | 0 | 300 | 0 | 0 | 300 | 0 | 0 | 0 | 300 | 0 | 0 |
John | 0 | 0 | 300 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Read | 0 | 0 | 0 | 300 | 0 | 600 | 0 | 0 | 0 | 0 | 0 |
Fountainhead | 300 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Mary | 0 | 0 | 300 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
a | 0 | 0 | 0 | 0 | 0 | 0 | 300 | 300 | 0 | 0 | 0 |
Different | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 300 | 0 | 0 | 0 |
Book | 300 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 300 | 0 |
She | 0 | 0 | 0 | 300 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
By | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Dickens | 300 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
N = 5100 V = 11
Q3. Given S = Dickens read a book, find P(S)
(a) Using unsmoothed probability
(b) Applying Add-One smoothing.
(c) Applying Add-δ smoothing