6.835 Intelligent Multimodal Interfaces
Spring 2013
Instructors: Randall Davis, Jeremy Kenneth Scott
TAs: Orit Giguzinsky, Yale Song
Lecture: TR11-12.30 (1-135)
Information:
Prerequisitres:
Some serious programming experience is important. You will need
to be able to use Matlab, but at a level that you can probably
learn during the course.
This course is distinct in subject matter from 6.831 (User
Interface Design and Implementation) and is an appropriate
follow-on.
Units: 3-0-9
Open to and appropriate for advanced undergraduates and graduate
students.
This course will cover the design, implementation, and evaluation
of intelligent and multi-modal user interfaces. It will
involve a combination of papers from the original literature and
hands-on exercises to familiarize students with the basic
technologies for handling speech, vision, and pen-based
interaction, and enable them to explore architectures for combining
modalities in a variety of ways.
There are substantial readings from original literature that go
with each lecture. Students will have the opportunity to implement
example interfaces in a series of course mini-projects, as well as
a final project of their own design.
Topics will include:
• Gaze tracking and gaze-aware interfaces
• Gesture recognition
• Mobile image-based interfaces
• Multimodal conversational models
• Audiovisual speech recognition
• Pen-based interaction
• Multimodal fusion and adaptation
algorithms
• Methods for combining channels and/or mutual
disambiguation
Announcements
No announcements