Building POS Tagger

Textbooks

1. Speech and Language Processing - An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition
BY: Daniel Jurafsky and James H. Martin - Chapter 8
Comprehensive coverage of Part-of-Speech tagging algorithms and Hidden Markov Models.

2. Foundations of Statistical Natural Language Processing
BY: Christopher D. Manning and Hinrich Schütze - Chapter 10
Statistical methods in NLP with detailed coverage of POS tagging techniques.

3. Natural Language Processing with Python
BY: Steven Bird, Ewan Klein and Edward Loper - Chapter 5
Practical implementation of POS taggers using NLTK library.

4. An Introduction to Information Retrieval
BY: Christopher Manning, Prabhakar Raghavan, and Hinrich Schütze
Comprehensive coverage of text processing including POS tagging applications.

Video Lectures and Online Courses

NPTEL Courses:

Stanford University:

YouTube EDU Channels:

Research Papers and Articles

1. A Tutorial on Hidden Markov Models and Selected Applications
BY: Lawrence R. Rabiner
Proceedings of the IEEE, Vol. 77, No. 2, 1989

2. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
BY: John Lafferty, Andrew McCallum, Fernando Pereira
ICML 2001

3. TnT: A Statistical Part-of-Speech Tagger
BY: Thorsten Brants
ANLP 2000

Online Resources and Tools

Academic Resources:

Interactive Tools:

Language Learning Resources:

Additional Reading

For Advanced Study:

  • "Natural Language Understanding" by James Allen - Comprehensive NLP textbook
  • "Handbook of Natural Language Processing" edited by Nitin Indurkhya and Fred J. Damerau
  • "Introduction to Modern Information Retrieval" by G.G. Chowdhury

For Indian Languages:

  • "Computational Processing of the Hindi Language" by Akshar Bharati
  • "Natural Language Processing - A Paninian Perspective" by Akshar Bharti, Vineet Chaitanya and Rajeev Sangal
  • "Hindi Computational Linguistics" by various authors from LTRC, IIIT Hyderabad

Practice Resources

Corpora and Datasets:

Assessment and Practice: