We present a novel approach to dialogue state tracking and referring
expression resolution tasks. Successful contextual understanding of multi-turn
spoken dialogues requires resolving referring expressions across turns and
tracking the entities relevant to the conversation across turns