N-Grams

What is the primary purpose of smoothing in N-Gram models?
Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Which of the following is a unigram?
Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Which of the following best describes a limitation of N-Gram models?
Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

What is the main advantage of using trigrams over bigrams?
Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

A trigram model assigns P(C|A B) = 0.2, P(B|START A) = 0.3, and P(A|START) = 0.4. What is the probability of the sequence START A B C?
Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Which of the following is NOT a use case for N-Gram models?
Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Which of the following best describes the chain rule in the context of N-Gram models?
Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

A bigram model assigns P(D|C) = 0.6 and P(C|START) = 0.3. What is the probability of the sequence START C D?
Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Which of the following is a reason to use N-Gram models in NLP?
Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

What is a context window in the context of N-Gram models?
Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation

Explanation