skip to main content
10.1145/3290605.3300326acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
research-article

StreetWise: Smart Speakers vs Human Help in Public Slum Settings

Published: 02 May 2019 Publication History

Abstract

This paper explores the use of conversational speech question and answer systems in the challenging context of public spaces in slums. A major part of this work is a comparison of the source and speed of the given responses; that is, either machine-powered and instant or human-powered and delayed. We examine these dimensions via a two-stage, multi-sited deployment. We report on a pilot deployment that helped refine the system, and a second deployment involving the installation of nine of each type of system within a large Mumbai slum for a 40-day period, resulting in over 12,000 queries. We present the findings from a detailed analysis and comparison of the two question-answer corpora; discuss how these insights might help improve machine-powered smart speakers; and, highlight the potential benefits of multi-sited public speech installations within slum environments.

Supplementary Material

MP4 File (pn6275.mp4)
Supplemental video

References

[1]
Bruce Balentine. 2007. It's Better to Be a Good Machine Than a Bad Person: Speech Recognition and Other Exotic User Interfaces in the Twilight of the Jetsonian Age. ICMI Press, Annapolis, MD, USA.
[2]
Frank Bentley, Chris Luvogt, Max Silverman, Rushani Wirasinghe, Brooke White, and Danielle Lottridge. 2018. Understanding the LongTerm Use of Smart Speaker Assistants. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 2, 3, Article 91 (Sept. 2018), 24 pages.
[3]
Devanuj and Anirudha Joshi. 2013. Technology Adoption by 'Emergent' Users: The User-usage Model. In Proceedings of the 11th Asia Pacific Conference on Computer Human Interaction (APCHI '13). ACM, New York, NY, USA, 28--38.
[4]
Paul Dourish. 2004. What We Talk about When We Talk about Context. Personal and Ubiquitous Computing 8, 1 (Feb. 2004), 19--30.
[5]
Nick Fox. 2018. The Google Assistant is going global. (2018). Retrieved 3rd September 2018 from https://www.blog.google/products/assistant/ google-assistant-going-global/
[6]
Google. 2018. Neighbourly: Ask Local Questions & Get Answers. (2018). Retrieved 3rd September 2018 from https://play.google.com/ store/apps/details?id=com.google.android.apps.nbu.society
[7]
Richard Harper. 2010. Texture: Human Expression in the Age of Communications Overload. MIT Press, Cambridge, MA.
[8]
Lucy Hattersley. 2018. AIY Voice Essentials. (2018). Retrieved 5th March 2018 from https://www.raspberrypi.org/magpi/issues/ essentials-aiy-v1/
[9]
Anirudha Joshi, Girish Dalvi, Manjiri Joshi, Prasad Rashinkar, and Aniket Sarangdhar. 2011. Design and Evaluation of Devanagari Virtual Keyboards for Touch Screen Mobile Phones. In Proceedings of the 13th International Conference on Human Computer Interaction with Mobile Devices and Services (MobileHCI '11). ACM, New York, NY, USA, 323-- 332.
[10]
Arun Kumar, Nitendra Rajput, Dipanjan Chakraborty, Sheetal K. Agarwal, and Amit A. Nanavati. 2007. WWTW: The World Wide Telecom Web. In Proceedings of the 2007 Workshop on Networked Systems for Developing Regions (NSDR '07). ACM, New York, NY, USA, Article 7, 6 pages.
[11]
Ewa Luger and Abigail Sellen. 2016. "Like Having a Really Bad PA": The Gulf Between User Expectation and Experience of Conversational Agents. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI '16). ACM, New York, NY, USA, 5286--5297.
[12]
Audrey Mbogho and Michelle Katz. 2010. The Impact of Accents on Automatic Recognition of South African English Speech: A Preliminary Investigation. In Proceedings of the 2010 Annual Research Conference of the South African Institute of Computer Scientists and Information Technologists (SAICSIT '10). ACM, New York, NY, USA, 187--192.
[13]
NPR and Edison Research. 2018. The Smart Audio Report, Spring 2018. Technical Report. National Public Media LLC. https:// nationalpublicmedia.com/smart-audio-report/
[14]
Peter Pirolli. 2007. Information Foraging Theory: Adaptive Interaction with Information. Oxford University Press, Oxford, UK.
[15]
Martin Porcheron, Joel E. Fischer, Stuart Reeves, and Sarah Sharples. 2018. Voice Interfaces in Everyday Life. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (CHI '18). ACM, New York, NY, USA, 640:1--640:12.
[16]
Aung Pyae and Tapani N. Joelsson. 2018. Investigating the Usability and User Experiences of Voice User Interface: A Case of Google Home Smart Speaker. In Proceedings of the 20th International Conference on Human-Computer Interaction with Mobile Devices and Services Adjunct (MobileHCI '18). ACM, New York, NY, USA, 127--131.
[17]
Question Box. 2018. Question Box: Overview. (2018). Retrieved 21st September 2018 from http://www.questionbox.org/overview/
[18]
Quora. 2018. About Quora -- Quora. (2018). Retrieved 21st September 2018 from https://www.quora.com/about
[19]
Agha Ali Raza, Rajat Kulshreshtha, Spandana Gella, Sean Blagsvedt, Maya Chandrasekaran, Bhiksha Raj, and Roni Rosenfeld. 2016. Viral Spread via Entertainment and Voice-Messaging Among Telephone Users in India. In Proceedings of the Eighth International Conference on Information and Communication Technologies and Development (ICTD '16). ACM, New York, NY, USA, 1:1--1:10.
[20]
Agha Ali Raza, Farhan Ul Haq, Zain Tariq, Mansoor Pervaiz, Samia Razaq, Umar Saif, and Roni Rosenfeld. 2013. Job Opportunities Through Entertainment: Virally Spread Speech-Based Services for Low-Literate Users. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '13). ACM, New York, NY, USA, 2803--2812.
[21]
Simon Robinson, Jennifer Pearson, Shashank Ahire, Rini Ahirwar, Bhakti Bhikne, Nimish Maravi, and Matt Jones. 2018. Revisiting "Hole in the Wall" Computing: Private Smart Speakers and Public Slum Settings. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (CHI '18). ACM, New York, NY, USA, Article 498, 11 pages.
[22]
Emanuel A. Schegloff, Gail Jefferson, and Harvey Sacks. 1977. The Preference for Self-Correction in the Organization of Repair in Conversation. Language 53, 2 (1977), 361--382.
[23]
Lucy Suchman. 2002. Located Accountabilities in Technology Production. Scandinavian Journal of Information Systems 14, 2 (Sept. 2002), 91--105.
[24]
Lucy Suchman. 2002. Practice-Based Design of Information Systems: Notes from the Hyperdeveloped World. The Information Society 18, 2 (March 2002), 139--144.
[25]
Harold Thimbleby. 2013. Reasons to Question Seven Segment Displays. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '13). ACM, New York, NY, USA, 1431--1440.
[26]
Aditya Vashistha, Edward Cutrell, Gaetano Borriello, and William Thies. 2015. Sangeet Swara: A Community-Moderated Voice Forum in Rural India. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI '15). ACM, New York, NY, USA, 417--426.
[27]
Marion Walton and Vera Vukovic. 2003. Cultures, Literacy, and the Web: Dimensions of Information "Scent". Interactions 10, 2 (March 2003), 64--71.
[28]
YouGov. 2018. Smart Speaker Ownership Doubles in Six Months. (April 2018). Retrieved 7th August 2018 from https://yougov.co.uk/news/ 2018/04/19/smart-speaker-ownership-doubles-six-months/

Cited By

View all
  • (2024)Investigating the Integration and the Long-Term Use of Smart Speakers in Older Adults’ Daily Practices: Qualitative StudyJMIR mHealth and uHealth10.2196/4747212(e47472)Online publication date: 12-Feb-2024
  • (2024)WorkFit: Designing Proactive Voice Assistance for the Health and Well-Being of Knowledge WorkersProceedings of the 6th ACM Conference on Conversational User Interfaces10.1145/3640794.3665561(1-14)Online publication date: 8-Jul-2024
  • (2024)Challenges and Opportunities Designing Voice User Interfaces for Emergent UsersHuman-Computer Interaction10.1007/978-3-031-60449-2_1(3-16)Online publication date: 29-Jun-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems
May 2019
9077 pages
ISBN:9781450359702
DOI:10.1145/3290605
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 May 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. emergent users
  2. public space interaction
  3. speech appliances

Qualifiers

  • Research-article

Funding Sources

Conference

CHI '19
Sponsor:

Acceptance Rates

CHI '19 Paper Acceptance Rate 703 of 2,958 submissions, 24%;
Overall Acceptance Rate 6,199 of 26,314 submissions, 24%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)44
  • Downloads (Last 6 weeks)7
Reflects downloads up to 21 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Investigating the Integration and the Long-Term Use of Smart Speakers in Older Adults’ Daily Practices: Qualitative StudyJMIR mHealth and uHealth10.2196/4747212(e47472)Online publication date: 12-Feb-2024
  • (2024)WorkFit: Designing Proactive Voice Assistance for the Health and Well-Being of Knowledge WorkersProceedings of the 6th ACM Conference on Conversational User Interfaces10.1145/3640794.3665561(1-14)Online publication date: 8-Jul-2024
  • (2024)Challenges and Opportunities Designing Voice User Interfaces for Emergent UsersHuman-Computer Interaction10.1007/978-3-031-60449-2_1(3-16)Online publication date: 29-Jun-2024
  • (2023)Benefits of Community Voice: A Framework for Understanding Inclusion of Community Voice in HCI4DProceedings of the ACM on Human-Computer Interaction10.1145/36101747:CSCW2(1-26)Online publication date: 4-Oct-2023
  • (2023)Situating Automatic Speech Recognition Development within Communities of Under-heard Language SpeakersProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581385(1-17)Online publication date: 19-Apr-2023
  • (2023)"This machine is for the aides": Tailoring Voice Assistant Design to Home Health Care WorkProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581118(1-19)Online publication date: 19-Apr-2023
  • (2022)Designing a Smart Speaker for Emergent Users: Human Plus AI ResponseProceedings of the 13th Indian Conference on Human-Computer Interaction10.1145/3570211.3570217(67-72)Online publication date: 9-Nov-2022
  • (2022)The City as a Personal AssistantProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/35345736:2(1-31)Online publication date: 7-Jul-2022
  • (2022)Outside Where? A Survey of Climates and Built Environments in Studies of HCI outdoorsProceedings of the 2022 CHI Conference on Human Factors in Computing Systems10.1145/3491102.3507656(1-15)Online publication date: 29-Apr-2022
  • (2022)Can’t Touch This: Rethinking Public Technology in a COVID-19 EraProceedings of the 2022 CHI Conference on Human Factors in Computing Systems10.1145/3491102.3501980(1-14)Online publication date: 29-Apr-2022
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media