Building POS Tagger
Textbooks
1. Speech and Language Processing - An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition
BY: Daniel Jurafsky and James H. Martin - Chapter 8
Comprehensive coverage of Part-of-Speech tagging algorithms and Hidden Markov Models.
2. Foundations of Statistical Natural Language Processing
BY: Christopher D. Manning and Hinrich Schütze - Chapter 10
Statistical methods in NLP with detailed coverage of POS tagging techniques.
3. Natural Language Processing with Python
BY: Steven Bird, Ewan Klein and Edward Loper - Chapter 5
Practical implementation of POS taggers using NLTK library.
4. An Introduction to Information Retrieval
BY: Christopher Manning, Prabhakar Raghavan, and Hinrich Schütze
Comprehensive coverage of text processing including POS tagging applications.
Video Lectures and Online Courses
NPTEL Courses:
- Natural Language Processing - IIT Bombay
- Machine Learning for Natural Language Processing - IIT Kharagpur
Stanford University:
- CS224N: Natural Language Processing - Stanford
- CS229: Machine Learning - Stanford (for HMM and CRF fundamentals)
YouTube EDU Channels:
- Dan Jurafsky NLP Lectures - POS tagging episodes
- Natural Language Processing - University of Michigan
Research Papers and Articles
1. A Tutorial on Hidden Markov Models and Selected Applications
BY: Lawrence R. Rabiner
Proceedings of the IEEE, Vol. 77, No. 2, 1989
2. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
BY: John Lafferty, Andrew McCallum, Fernando Pereira
ICML 2001
3. TnT: A Statistical Part-of-Speech Tagger
BY: Thorsten Brants
ANLP 2000
Online Resources and Tools
Academic Resources:
- ACL Anthology - Research papers in computational linguistics
- Universal Dependencies - Multi-language POS annotation standards
- Penn Treebank - Standard POS tagging dataset
Interactive Tools:
- spaCy Online Demo - Interactive POS tagging visualization
- Stanford CoreNLP - Online POS tagger demo
- NLTK POS Tagger - Python library for POS tagging
Language Learning Resources:
- Hindi POS Tagset - Hindi linguistic resources
- English Grammar Guide - Parts of speech reference
Additional Reading
For Advanced Study:
- "Natural Language Understanding" by James Allen - Comprehensive NLP textbook
- "Handbook of Natural Language Processing" edited by Nitin Indurkhya and Fred J. Damerau
- "Introduction to Modern Information Retrieval" by G.G. Chowdhury
For Indian Languages:
- "Computational Processing of the Hindi Language" by Akshar Bharati
- "Natural Language Processing - A Paninian Perspective" by Akshar Bharti, Vineet Chaitanya and Rajeev Sangal
- "Hindi Computational Linguistics" by various authors from LTRC, IIIT Hyderabad
Practice Resources
Corpora and Datasets:
- Hindi WordNet - Hindi semantic network
- English WordNet - English semantic database
- CoNLL Shared Task Data - Standard evaluation datasets
Assessment and Practice:
- NLTK Book Exercises - Hands-on POS tagging exercises
- spaCy Tutorials - Practical implementation guides
- Kaggle NLP Competitions - Real-world POS tagging challenges