Professor |
Lingpeng Kong
|
Teaching assistant |
Yiheng Xu
|
Syllabus |
Natural language processing (NLP) is the study of human language from a
computational perspective. The course will be focusing on machine learning
and corpus-based methods and algorithms. We will cover syntactic, semantic
and discourse processing models. We will describe the use of these methods
and models in applications including syntactic parsing, information
extraction, statistical machine translation, dialogue systems, and
summarization. This course starts with language models (LMs), which are both
front and center in natural language processing (NLP), and then introduces
key machine learning (ML) ideas that students should grasp (e.g.
feature-based models, log-linear models and then the neural models). We will
land on modern generic meaning representation methods (e.g. BERT/GPT-3) and
the idea of pretraining / finetuning. |
Introduction by Professor |
Natural language processing (NLP) is the
study of human language from a computational perspective. The course will
investigate modern NLP algorithms from the machine learning perspective. We
will cover syntactic, semantic and other useful models for language. We will
describe the use of these methods and models in applications including
syntactic parsing, information extraction, statistical machine translation,
dialogue systems, and summarization. This course starts with language models
(LMs), which are both front and center in natural language processing (NLP),
and then introduces key machine learning (ML) ideas that students should
grasp (e.g., feature-based models, log-linear models and then the neural
models). We will land on modern generic meaning representation methods
(e.g., BERT/GPT-3) and the idea of pretraining / finetuning / prompt-based
learning. |
Learning Outcomes |
|
Pre-requisites |
- |
Compatibility |
Nil |
Prior knowledge expected |
Basic knowledge about Machine Learning,
Probability, Statistics, and Programming |
Topics covered |
|
Assessment |
|
Course materials |
Prescribed textbook:
- Jurafsky, Daniel, and James H. Martin. "Speech and Language
Processing."
|
Session dates |
|
Add/drop |
1 September, 2022 - 19 September, 2022 |
Maximum class size |
150 |