Modeling the Relationships Between Tasks, Descriptions and Features


Users often describe what they want to accomplish with an application in a language that is very different from the application's domain language. To address this gap between system and human language, we propose modeling an application's domain language by mining a large corpus of Web documents about the application using deep learning techniques. A high dimensional vector space representation can model the relationships between user tasks, system commands, and natural language descriptions and supports mapping operations. We demonstrate the feasibility of this approach with a system, CommandSpace, for the popular photo editing application Adobe Photoshop.


Eytan Adar, Mira Dontcheva, and Gierad Laput. 2014. CommandSpace: modeling the relationships between tasks, descriptions and features. In Proceedings of the 27th annual ACM symposium on User interface software and technology (UIST '14). ACM, New York, NY, USA, 167-176. DOI=10.1145/2642918.2647395


  • Photoshop.dat.bin.gz (46Mb). This is version of our vector dataset (length=200). It should be suitable for most applications but you'll need word2vec (or another implementation) to use it.
  • Indexedcommands.dat has a list of the indexed system (Photoshop) features. These will appear in the vector file above. Note that some features are also standard terms (e.g., "nudge").
  • Commonwords.tar.gz has pairs of commands and the most common verbs, nouns, etc. between them. There is some noise due to multiple word meanings. We selected the most common form of the word based on POS tagging in the dataset.