Monday 10.30-12.00 and Tuesday 10.30-12.00 both room 24.21.01.82
Start: 04.04.2022. Last session: 12.07.2022.
The course will be taught in person, at the university. Further information on the course can be found here on the Moodle course page.
Course description
Parsing is a central task in natural language processing. Its goal is to compute the syntactic structures of sentences. Such a syntactic structure could either be a constituency structure or a dependency structure. The former is in many cases taken to be generated by a context-free grammar (CFG). Consequently, constituency parsing amounts to a) implementing/inducing a context-free grammar and b) using this grammar for parsing. Dependency parsing, in contrast to this, is mostly grammar-less parsing using machine-learning techniques.
In this course, we will mainly concentrate on step b) of CFG-based constituency parsing. We will revise various symbolic parsing algorithms that yield, given a CFG and an input sentence, the set of all parse trees for this sentence. In the second half of the course, we will move on to probabilistic parsing, covering Viterbi parsing and weighted deductive parsing with A* estimates.
For references see the slides of the individual sessions.
A useful tool for trying out different parsing alorithms can be found here.
Schedule and Slides
(under construction. Some of the slides are from previous years.)
19.04.22 Unger’s Parser.
Example of a trace for Unger’s parser.
An implementation of Unger’s Parser (by Simon Petitjean) can be found here.
25.04.22 Top-down Parsing (LL-Parsing).
You can try out top-down parsing with example grammars using the CL Taskbox here (Samya Daleh).
26.04.22 Parsing as Deduction. An example of agenda-based parsing can be found here.
Furthermore, see CL Taskbox for a variety of deduction-based parsing algorithms.
There are weekly exercises for the course. These exercises are obligatory, they have to be handed in via Moodle. The solutions of the exercises will be discussed in the course.
Sowohl für einen BN als auch für eine AP müssen mindestens 9 der 12 Hausaufgabenblätter bearbeitet und sinnvoll gelöst werden. Gruppenarbeit (max. Gruppengröße ist 3) ist erlaubt, bitte alle Namen auf die Abgabe. Eventuell muss auch noch ein Kurzreferat für einen BN gemacht werden.
Für eine AP kann zusätzlich eine individuelle Prüfung vereinbart werden, zum Beispiel eine mündliche Prüfung.
(Für Studierende des BA Computerlinguistik integrativ ist Parsing eine Grundveranstaltung in CL5, hier ist keine AP vorgesehen.)