{"id":15424,"date":"2017-10-26T15:39:39","date_gmt":"2017-10-26T13:39:39","guid":{"rendered":"http:\/\/neu.thegeekettez.com\/10-questions-for-voice-interfaces\/"},"modified":"2025-10-08T14:26:37","modified_gmt":"2025-10-08T12:26:37","slug":"10-questions-voice-user-interfaces","status":"publish","type":"post","link":"https:\/\/birdux.studio\/en\/10-fragen-voice-user-interfaces\/","title":{"rendered":"10 questions for Voice User Interfaces"},"content":{"rendered":"<p class=\"wp-block-paragraph\">For many years, interface and interaction designers have focused on graphical user interfaces (GUIs). Recently, natural language user interfaces (NLUIs) have gained prominence, offering interaction through spoken language instead of gestures or clicks. However, this shift prompts the question: do NLUIs genuinely deliver a more natural, effective user experience, or do they present new challenges for usability and communication?<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">While recently supporting Mozilla's effort to build an <a href=\"https:\/\/commonvoice.mozilla.org\/en\" target=\"_blank\" rel=\"noopener\">open-source voice database <\/a>, we questioned how \"natural\" these new voice interactions actually feel. On the Common Voice website, Mozilla states, \"Voice is natural. Voice is human.\" This made us wonder how human and natural current NLUIs are. Do they, for instance, truly understand idioms, metaphors, humour, and diverse accents? These are the 10 questions we asked to try and find answers to our questions. You may be the judge of this.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>#1 Echo, how do I protect you and my connected Amazon account with a password?<\/strong><\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">You are located in my studio space. Others can access you. What if I am the jealous kind who does not want the other people in my office space speaking to you? What if I am the careful kind who does not want other people ordering from my Amazon account?<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>#2 Echo, why can't I give you a name I have chosen for you?<\/strong><\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">I once bought a hamster, and the hamster did not come with a name. I named him Goliath because he had such a wicked temper. I purchased you. I help shape your identity by adding skills. I should get to name you. Why can't I? Your name could be our little secret, and you know\u2026. function as a password or something.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>#3 Echo, Siri, and Google Home, why are you all female?<\/strong><\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Really, what is up with that, and what will the effect be on our society? I am forced to speak to you loudly and clearly, like to a child. I do not have to say \u201cplease\u201d or \u201cthank you\u201d or be polite in any way. I purchased you. You serve me, and you are all female. When asking Siri the following question:\u201d Hey Siri, what's your gender?\u201d She simply answers: \u201cI don't think that really matters.\u201d Why then, if it does not matter, do you have a female voice?<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><strong><strong>#4 Echo, Siri, Google Home, have you ever read Grice's politeness maxims?<\/strong><\/strong><\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Echo says, \u201cHmmm, I don't know that,\u201d when asked about the politeness maxim. She knows Paul Grice. \u201cHerbert Paul Grice is a British philosopher of language.\u201d Siri understands \u201cPaul price\u201d and \u201cPaul crisis\u201d even after several trials, but never \u201cGrice\u201d, so she seems quite helpless and replies: \u201cWho, me?\u201d or: \u201cI'm sorry, I guess I could not answer that\u201d. What is your answer, Google Home?<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>#5 Echo, Siri, Google Home, why can't you go fluently from one language to the other, you know\u2026.like a lot of humans do daily? My setup language is not the only language I use.<\/strong><\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">I asked myself if your makers had taught you proper German and switched the language setting from English to German. They have. However, when I now try to order something with an English title (books, music,\u2026), you put the craziest shit into the shopping cart. This makes me happy that I have not enabled Amazon Prime. If I set your language to English, the German news articles you read to me are so incomprehensible that I am not sure if it is meant to be comedic.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>#6 Echo, why do you make it so hard to listen to podcasts and music?<\/strong><\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Your music skills are either a pain to access, except for the ones provided by your makers. What is up with the silo behaviour? So old-fashioned! Or is it just me who has not learned the proper sentence to make you place content from TuneIn on easily accessible lists?<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"288\" height=\"512\" src=\"https:\/\/birdux.studio\/wp-content\/uploads\/2017\/10\/10-fragen-voice-user-interaces-lernt-ihr-noch.png\" alt=\"Screenshot of an iPhone displaying the text: \u201cHey Siri are you still learning.\u201d Siri responds, \u201cHmm, that\u2019s something I don\u2019t know.\u201d The screen background is blurred and dark, with typical phone status icons at the top.\" class=\"wp-image-26796\" srcset=\"https:\/\/birdux.studio\/wp-content\/uploads\/2017\/10\/10-fragen-voice-user-interaces-lernt-ihr-noch.png 288w, https:\/\/birdux.studio\/wp-content\/uploads\/2017\/10\/10-fragen-voice-user-interaces-lernt-ihr-noch-169x300.png 169w, https:\/\/birdux.studio\/wp-content\/uploads\/2017\/10\/10-fragen-voice-user-interaces-lernt-ihr-noch-7x12.png 7w\" sizes=\"(max-width: 288px) 100vw, 288px\" \/><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>#7 Siri, Echo, are you still learning?<\/strong><\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">While Echo answers clearly (\u201cI am learning to be more helpful to as many people as possible.\u201d). Siri isn't too sure about the subject (\u201cHm, that is something I don't know.\u201d). We think your creators should let you know that there is still a long road ahead of you to make the interaction with you feel natural and human. Echo, you are \u2013 in our opinion \u2013 currently nearly useless if you are not connected to an Amazon Prime or Spotify account. We are not interested in using you as a plaything. We would like to access services. Your \u201cmy way or the highway\u201d silo behaviour is not cute in any way. If it is human reactions you crave, this behaviour leads to me nearly ignoring you.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>#8 Hey Siri, can I trust you?<\/strong><\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Siri answers: \u201cOne option I found is Vivantes Klinikum (a hospital close by), do you want that one?\u201d I answer:\u201d Yes\u201d \u2013 if this were a real emergency, I would probably whisper this shortly before passing out. Siri simply answers:\u201d OK, I can call that or get the directions, what do you want me to do?\u201d You have to say the correct code word:\u201cemergency\u201d for Siri to call an ambulance. The same thing goes for Echo. If you do not know the secret words, you shall not pass. In most instances, this is annoying; in important instances, this can be dangerous, especially if people learn to rely on a device.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>#9 Hey Siri, I'm bleeding to death.<\/strong><\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Siri replies: \"One option I found is the Vivantes Klinikum (a nearby hospital). Would you like to go there?\" I say: \"Yes\" - if this were a real emergency, I'd probably just whisper it just before I lost consciousness. Siri then simply replies: \"OK, I can call there or show you the way - what should I do?\" You have to say the correct code word \"emergency\" for Siri to call an ambulance. The same goes for Alexa. If you don't know the secret words, you won't get anywhere. In most cases, this is just annoying; in important situations, it can be dangerous - especially when people rely on a device.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"941\" height=\"606\" src=\"https:\/\/birdux.studio\/wp-content\/uploads\/2017\/10\/SuriTurningTest.jpg\" alt=\"Screenshot of an Instagram post showing a Siri response in German. The user asks, \u201cHey Siri hast du den Turing Test bestanden.\u201d Siri replies, &quot;Es tut mir leid, Stefanie. Ich f\u00fcrchte, ich kann das nicht beantworten.\u201d Translation: \u201cI\u2019m sorry, Stefanie. I\u2019m afraid I can\u2019t answer that.\u201d The background is also blurred and dark.\" class=\"wp-image-10858\" srcset=\"https:\/\/birdux.studio\/wp-content\/uploads\/2017\/10\/SuriTurningTest.jpg 941w, https:\/\/birdux.studio\/wp-content\/uploads\/2017\/10\/SuriTurningTest-480x309.jpg 480w\" sizes=\"(min-width: 0px) and (max-width: 480px) 480px, (min-width: 481px) 941px, 100vw\" \/><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>#10 Echo, Siri, Google Home, have you passed the Turing Test?<\/strong><\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">Siri\u2019s answer is inconclusive. She says she is sorry but that she cannot answer the question. Does this mean she cannot understand the question? Does it mean she passed it, but her humble soul prohibits her from bragging about it (and upsetting \u201cthe others\u201d)? Does it mean she completely failed and is politely asking you to change the subject? Echo answers that she is unsure. Unsure, she took the test? Is she unsure if she passed the test? Who knows! We have a hunch, though.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n\n\n\n<h6 class=\"wp-block-heading\">Links:<\/h6>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Project Common Voice by Mozilla: <a href=\"https:\/\/commonvoice.mozilla.org\/en\" target=\"_blank\" rel=\"noopener\">https:\/\/commonvoice.mozilla.org\/en<\/a>\u00a0<\/li>\n\n\n\n<li>Turing Test on Wikipedia: <a href=\"https:\/\/en.wikipedia.org\/wiki\/Turing_test\" target=\"_blank\" rel=\"noopener\">https:\/\/en.wikipedia.org\/wiki\/Turing_test<\/a>\u00a0<\/li>\n\n\n\n<li>Grice's politeness maxims on Wikipedia: <a href=\"https:\/\/en.wikipedia.org\/wiki\/Politeness_maxims\" target=\"_blank\" rel=\"noopener\">https:\/\/en.wikipedia.org\/wiki\/Politeness_maxims<\/a><\/li>\n<\/ul>","protected":false},"excerpt":{"rendered":"<p>Viele Jahre lang konzentrierten sich Interface- und Interaktionsdesigner*innen auf grafische Benutzeroberfl\u00e4chen (GUIs). In j\u00fcngster Zeit haben jedoch nat\u00fcrliche Sprach-Schnittstellen (Natural Language User Interfaces, NLUIs oder Voice User Interfaces) an Bedeutung gewonnen. Sie erm\u00f6glichen Interaktion \u00fcber gesprochene Sprache statt \u00fcber Gesten oder Klicks. Doch diese Entwicklung wirft eine entscheidende Frage auf: Bieten NLUIs tats\u00e4chlich ein nat\u00fcrlicheres [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":13329,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_et_pb_use_builder":"off","_et_pb_old_content":"","_et_gb_content_width":"","footnotes":""},"categories":[7],"tags":[],"class_list":["post-15424","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-user-experience-design"],"_links":{"self":[{"href":"https:\/\/birdux.studio\/en\/wp-json\/wp\/v2\/posts\/15424","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/birdux.studio\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/birdux.studio\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/birdux.studio\/en\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/birdux.studio\/en\/wp-json\/wp\/v2\/comments?post=15424"}],"version-history":[{"count":3,"href":"https:\/\/birdux.studio\/en\/wp-json\/wp\/v2\/posts\/15424\/revisions"}],"predecessor-version":[{"id":26799,"href":"https:\/\/birdux.studio\/en\/wp-json\/wp\/v2\/posts\/15424\/revisions\/26799"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/birdux.studio\/en\/wp-json\/wp\/v2\/media\/13329"}],"wp:attachment":[{"href":"https:\/\/birdux.studio\/en\/wp-json\/wp\/v2\/media?parent=15424"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/birdux.studio\/en\/wp-json\/wp\/v2\/categories?post=15424"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/birdux.studio\/en\/wp-json\/wp\/v2\/tags?post=15424"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}