CS 505 -- Natural Language Processing

Wayne Snyder

Associate Professor of Computer Science

Cell: 617 966 (2^10+41) Email: waysnyder@gmail.com

www.cs.bu.edu/fac/snyder/cs505/

 

Prerequisites

CS 131, CS 132, CS 237 or equivalent (discuss with me). It is also expected that you have experience in Python programming equivalent to CS 111. Note that, contrary to the Bulletin description, CS 365 is not required this semester.

Description

Natural language processing (NLP) is a field of Artificial Intelligence which aims to equip computers with the ability to process natural (human) language. The course will explore modern quantitative techniques for the automatic analysis of natural language data using large corpora and statistical, machine learning, and deep learning models. Although techniques from linguistics (e.g., phonetics, morphology, grammars, semantics) are clearly relevant to this subject, we do not assume any prior background in linguistics, and we shall focus on topics which do not involve a significant linguistics component. The emphasis will be on textual corpora in English, but I hope to present an overview of speech processing as the final topic in the course.

Course Materials and Handouts

Assignments

Tests

Late Policy

Homeworks are due at midnight on Sunday in Gradescope. You can submit up to 24 hours late for a 10% penalty. Each of these deadllines has a 6-hour "grace period" so you may submit up to Monday morning 6am for full credit, and Tuesday morning 6am with the late penalty.

There will be no extensions to individuals except for "acts of God" (Covid diagnosis, death in the family etc. -- your laptop breaking is not an act of God, nor is a job interview). I let the grading process run as it will, and I make decisions about exceptions to policy ONLY at the end of term. So my apologies in advance when you ask me for an extension in a moment of crisis, I will unfortunately have to refuse you and remind you that the lowest homework is dropped for *exactly* this reason.

Grades

These percentages are tentative and may be changed at my discretion at any time. Class participation, coming to office hours and wanting to pursue the material beyond the scope of the lecture, Piazza posts with interesting links about the course material, etc. are wonderful and much appreciated, and surely will help your performance in the class. 

Miscellaneous

Collaboration and Academic Conduct