Pixeltone: A Multimodal Interface for Image Editing

G. Laput, M. Dontcheva, G. Wilensky, W. Chang, A. Agarwala, J. Linder, and E. Adar (CHI 2013)

Abstract

Photo editing can be a challenging task, and it becomes even more difficult on the small, portable screens such as camera phones that are now frequently used to edit images. To address this problem we present PixelTone, a multimodal photo editing interface that combines speech and direct manipulation. We observe existing image editing practices and derive a set of principles that guide our design. In particular, we use natural language for expressing desired changes to an image, and sketching to localize these changes to specific regions. To support the language commonly used in photo-editing we develop a customized natural language interpreter that maps user phrases to specific image processing operations. Finally, we perform a user study that evaluates and demonstrates the effectiveness of our interface.

This work will be presented at the CHI Conference (CHI 2013) in Paris, France

 Download the paper (PDF)

This work was completed during an internship at Adobe Research in San Francisco.

Masters Projects

Pixeltone: A Multimodal Interface for Image Editing

Pixeltone: A Multimodal Interface for Image Editing

Undergraduate Projects

Labs and Fun Stuff