This article is an engaging read. I tend to bounce back and forth between natural language and GUI or CLI-based modalities – of course, in the last few years I have leaned towards the natural language one due to LLM usage. 1
The article argues (somewhat reasonably) that more structured, inline GUI elements are preferable and reduce interaction latency. I do tend to get frustrated sometimes when I have to copy-paste text in an LLM web interface, or am only given “pick one out of a numbered list” as the option. An additional modality I would like to see in LLMs is a way to refer back to chunks of text or output – something like pilcrows
-
I remember constructing natural-language like interfaces of my own the hard way, and giving up after about 10 or so scripts. I don’t think I could have imagined then that I could use trivially use natural language commands in under a decade! ↩︎