Go to Main Content

Georgetown University


Detailed Course Information


Fall 2017
May 24, 2019
Transparent Image
Information Select the desired Level or Schedule Type to find available classes for the course.

LING 264 - Multilingual & Parallel Corpor
Parallel and multilingual corpora are collections of natural language data in several languages, constructed using principled design criteria, which contain either aligned translations of the same texts, or distinct but comparable texts. As such, they are vital resources for comparative linguistics, translation studies, multilingual lexicography and machine translation. This course sets out to explore the theoretical problems raised by translated language, as well as practical issues and empirical patterns found in multilingual data. This includes questions such as exploring similarities and differences between closely related languages, such as different Romance or Slavic languages, or very distant ones, such as English and Japanese. The focus of the course is on the study of actual examples of parallel and comparable corpora using computational methods. Students will have access to search engines indexing aligned translations and will learn some of the basics of building a parallel corpus. The course introduces some fundamental computational methodology, but does not have a programming requirement. 

3.000 Credit hours
3.000 Lecture hours

Levels: Undergraduate
Schedule Types: Lecture, Seminar

Linguistics Department

Course Attributes:
Mean Grade is Calculated

Return to Previous New Search
Transparent Image
Skip to top of page