Multimodal Input
Combining Voice, Gesture, Gaze, and Other Signals.
Multimodal Input means accepting and blending two or more input types, such as speech, touch, gaze, or sensor data. By fusing these signals a system gains a clearer picture of user intent and can respond more accurately in real time.