Back close

Course Detail

Course Name Natural Language Processing
Course Code 23CSE471
Program B. Tech. in Computer Science and Engineering (CSE)
Credits 3
Campus Amritapuri ,Coimbatore,Bengaluru, Amaravati, Chennai

Syllabus

PROFESSIONAL ELECTIVES

Electives in Artificial Intelligence

Unit I

Introduction- History of NLP, Study of Human languages, ambiguity, Phases in natural language processing, applications. Textual sources and Formats. Linguistics resources- Introduction to the corpus, elements in the balanced corpus, (examples -TreeBank, PropBank, WordNet, VerbNet, etc.) Word Level analysis – Regular expressions, Morphological parsing, Types of Morphemes. Tokenization, N-grams, Stemming, Lemmatization, Spell checking. Management of linguistic data with NLTK.

Unit II

Syntactic Analysis – Lexeme, phonemes, phrases and idioms, word order, agreement, tense, aspect and mood and agreement, Context Free Grammar, and spoken language syntax. Parsing- Unification, probabilistic parsing. Part of Speech tagging- Rule-based POS tagging, Stochastic POS tagging, Transformation-based tagging (TBL), Handling of unknown words, named entities, and multi-word expressions.

Semantics Analysis- Meaning representation, semantic analysis, lexical semantics, WordNet -WordNet similarity measures., Synsets and Hypernyms, Word Sense Disambiguation- Selectional restriction, machine learning approaches, dictionary-based approaches.

Unit III

Discourse- Reference resolution, constraints on co-reference, an algorithm for pronoun resolution, text coherence, discourse structure. Information Retrieval-Types of an information retrieval model, Boolean Model, Vector space model-Word2Vec, BERT, Improving user queries. Machine Translation – EM algorithm – Discriminative learning – Deep representation learning – Generative learning.

Applications of NLP- Machine translation, Document Summarization, sentiment Analysis, ChatGPT4

Objectives and Outcomes

Course Objectives

  • This course is devoted to the study of phonological, morphological, and syntactic processing. These areas will be approached from both a linguistic and an algorithmic perspective.
  • The course will focus on the computational properties of natural languages and the algorithms used to process them, as well as the match between grammar formalisms and the linguistic data that needs to be covered.

Course Outcomes

CO1: Understand the models, methods, and algorithms of statistical Natural Language Processing (NLP) for

common NLP tasks.

CO2: Understand mathematical and statistical models for NLP.

CO3: Understand linguistic phenomena and linguistic features relevant to each NLP task.

CO4: Develop probabilistic models for NLP.

CO5: Apply learning models to NLP tasks such as document summarization, machine translation, sentiment

analysis and spell checking

CO-PO Mapping

PO/PSO PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8

</td rowspan=”2″>

PO9 PO10 PO11 PO12 PSO1 PSO2
CO
CO1 3 2 2 3 3 2
CO2 3 2 3 2 3 2
CO3 3 2 3 2 3 2
CO4 3 1 2 2 3 3 2
CO5 3 1 2 2 3 3 2

Evaluation Pattern

Evaluation Pattern: 70:30

Assessment Internal End Semester
Midterm 20
Continuous Assessment – Theory (*CAT) 10
Continuous Assessment – Lab (*CAL) 40
**End Semester 30 (50 Marks; 2 hours exam)

*CAT – Can be Quizzes, Assignments, and Reports

*CAL – Can be Lab Assessments, Project, and Report

**End Semester can be theory examination/ lab-based examination/ project presentation

Text Books / References

Textbook(s)

Martin JH, Jurafsky D. “Speech and language processing: An introduction to natural language processing, computational linguistics, and speech recognition”. Pearson Publication, Second Edition; 2013.

Reference(s)

James A. “Natural language Understanding”, Second Edition, Pearson Education; 2002.

Bharati A., Sangal R., Chaitanya V.“Natural language processing: a Paninian perspective”, PHI; 2000.

Tiwary U S, Siddiqui T. “Natural language processing and information retrieval”. Oxford University Press, Inc.; 2008.

Steven Bird, Ewan Klein, Edward Loper, “Natural Language Processing with Python – Analyzing Text with the Natural Language Toolkit” (O’Reilly 2009, website 2018).

DISCLAIMER: The appearance of external links on this web site does not constitute endorsement by the School of Biotechnology/Amrita Vishwa Vidyapeetham or the information, products or services contained therein. For other than authorized activities, the Amrita Vishwa Vidyapeetham does not exercise any editorial control over the information you may find at these locations. These links are provided consistent with the stated purpose of this web site.

Admissions Apply Now