research-article

StreetWise: Smart Speakers vs Human Help in Public Slum Settings

Authors:

Jennifer Pearson,

Simon Robinson,

Thomas Reitmaier,

Shashank Ahire,

Anirudha Joshi,

Bhakti BhikneAuthors Info & Claims

CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems

Paper No.: 96, Pages 1 - 13

https://doi.org/10.1145/3290605.3300326

Published: 02 May 2019 Publication History

Abstract

This paper explores the use of conversational speech question and answer systems in the challenging context of public spaces in slums. A major part of this work is a comparison of the source and speed of the given responses; that is, either machine-powered and instant or human-powered and delayed. We examine these dimensions via a two-stage, multi-sited deployment. We report on a pilot deployment that helped refine the system, and a second deployment involving the installation of nine of each type of system within a large Mumbai slum for a 40-day period, resulting in over 12,000 queries. We present the findings from a detailed analysis and comparison of the two question-answer corpora; discuss how these insights might help improve machine-powered smart speakers; and, highlight the potential benefits of multi-sited public speech installations within slum environments.

Supplementary Material

MP4 File (pn6275.mp4)

Supplemental video

Download
162.25 MB

References

[1]

Bruce Balentine. 2007. It's Better to Be a Good Machine Than a Bad Person: Speech Recognition and Other Exotic User Interfaces in the Twilight of the Jetsonian Age. ICMI Press, Annapolis, MD, USA.

[2]

Frank Bentley, Chris Luvogt, Max Silverman, Rushani Wirasinghe, Brooke White, and Danielle Lottridge. 2018. Understanding the LongTerm Use of Smart Speaker Assistants. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 2, 3, Article 91 (Sept. 2018), 24 pages.

Digital Library

[3]

Devanuj and Anirudha Joshi. 2013. Technology Adoption by 'Emergent' Users: The User-usage Model. In Proceedings of the 11th Asia Pacific Conference on Computer Human Interaction (APCHI '13). ACM, New York, NY, USA, 28--38.

Digital Library

[4]

Paul Dourish. 2004. What We Talk about When We Talk about Context. Personal and Ubiquitous Computing 8, 1 (Feb. 2004), 19--30.

Digital Library

[5]

Nick Fox. 2018. The Google Assistant is going global. (2018). Retrieved 3rd September 2018 from https://www.blog.google/products/assistant/ google-assistant-going-global/

[6]

Google. 2018. Neighbourly: Ask Local Questions & Get Answers. (2018). Retrieved 3rd September 2018 from https://play.google.com/ store/apps/details?id=com.google.android.apps.nbu.society

[7]

Richard Harper. 2010. Texture: Human Expression in the Age of Communications Overload. MIT Press, Cambridge, MA.

[8]

Lucy Hattersley. 2018. AIY Voice Essentials. (2018). Retrieved 5th March 2018 from https://www.raspberrypi.org/magpi/issues/ essentials-aiy-v1/

[9]

Anirudha Joshi, Girish Dalvi, Manjiri Joshi, Prasad Rashinkar, and Aniket Sarangdhar. 2011. Design and Evaluation of Devanagari Virtual Keyboards for Touch Screen Mobile Phones. In Proceedings of the 13th International Conference on Human Computer Interaction with Mobile Devices and Services (MobileHCI '11). ACM, New York, NY, USA, 323-- 332.

Digital Library

[10]

Arun Kumar, Nitendra Rajput, Dipanjan Chakraborty, Sheetal K. Agarwal, and Amit A. Nanavati. 2007. WWTW: The World Wide Telecom Web. In Proceedings of the 2007 Workshop on Networked Systems for Developing Regions (NSDR '07). ACM, New York, NY, USA, Article 7, 6 pages.

Digital Library

[11]

Ewa Luger and Abigail Sellen. 2016. "Like Having a Really Bad PA": The Gulf Between User Expectation and Experience of Conversational Agents. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI '16). ACM, New York, NY, USA, 5286--5297.

Digital Library

[12]

Audrey Mbogho and Michelle Katz. 2010. The Impact of Accents on Automatic Recognition of South African English Speech: A Preliminary Investigation. In Proceedings of the 2010 Annual Research Conference of the South African Institute of Computer Scientists and Information Technologists (SAICSIT '10). ACM, New York, NY, USA, 187--192.

Digital Library

[13]

NPR and Edison Research. 2018. The Smart Audio Report, Spring 2018. Technical Report. National Public Media LLC. https:// nationalpublicmedia.com/smart-audio-report/

[14]

Peter Pirolli. 2007. Information Foraging Theory: Adaptive Interaction with Information. Oxford University Press, Oxford, UK.

Digital Library

[15]

Martin Porcheron, Joel E. Fischer, Stuart Reeves, and Sarah Sharples. 2018. Voice Interfaces in Everyday Life. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (CHI '18). ACM, New York, NY, USA, 640:1--640:12.

Digital Library

[16]

Aung Pyae and Tapani N. Joelsson. 2018. Investigating the Usability and User Experiences of Voice User Interface: A Case of Google Home Smart Speaker. In Proceedings of the 20th International Conference on Human-Computer Interaction with Mobile Devices and Services Adjunct (MobileHCI '18). ACM, New York, NY, USA, 127--131.

Digital Library

[17]

Question Box. 2018. Question Box: Overview. (2018). Retrieved 21st September 2018 from http://www.questionbox.org/overview/

[18]

Quora. 2018. About Quora -- Quora. (2018). Retrieved 21st September 2018 from https://www.quora.com/about

[19]

Agha Ali Raza, Rajat Kulshreshtha, Spandana Gella, Sean Blagsvedt, Maya Chandrasekaran, Bhiksha Raj, and Roni Rosenfeld. 2016. Viral Spread via Entertainment and Voice-Messaging Among Telephone Users in India. In Proceedings of the Eighth International Conference on Information and Communication Technologies and Development (ICTD '16). ACM, New York, NY, USA, 1:1--1:10.

Digital Library

[20]

Agha Ali Raza, Farhan Ul Haq, Zain Tariq, Mansoor Pervaiz, Samia Razaq, Umar Saif, and Roni Rosenfeld. 2013. Job Opportunities Through Entertainment: Virally Spread Speech-Based Services for Low-Literate Users. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '13). ACM, New York, NY, USA, 2803--2812.

Digital Library

[21]

Simon Robinson, Jennifer Pearson, Shashank Ahire, Rini Ahirwar, Bhakti Bhikne, Nimish Maravi, and Matt Jones. 2018. Revisiting "Hole in the Wall" Computing: Private Smart Speakers and Public Slum Settings. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (CHI '18). ACM, New York, NY, USA, Article 498, 11 pages.

Digital Library

[22]

Emanuel A. Schegloff, Gail Jefferson, and Harvey Sacks. 1977. The Preference for Self-Correction in the Organization of Repair in Conversation. Language 53, 2 (1977), 361--382.

[23]

Lucy Suchman. 2002. Located Accountabilities in Technology Production. Scandinavian Journal of Information Systems 14, 2 (Sept. 2002), 91--105.

Digital Library

[24]

Lucy Suchman. 2002. Practice-Based Design of Information Systems: Notes from the Hyperdeveloped World. The Information Society 18, 2 (March 2002), 139--144.

[25]

Harold Thimbleby. 2013. Reasons to Question Seven Segment Displays. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '13). ACM, New York, NY, USA, 1431--1440.

Digital Library

[26]

Aditya Vashistha, Edward Cutrell, Gaetano Borriello, and William Thies. 2015. Sangeet Swara: A Community-Moderated Voice Forum in Rural India. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI '15). ACM, New York, NY, USA, 417--426.

Digital Library

[27]

Marion Walton and Vera Vukovic. 2003. Cultures, Literacy, and the Web: Dimensions of Information "Scent". Interactions 10, 2 (March 2003), 64--71.

Digital Library

[28]

YouGov. 2018. Smart Speaker Ownership Doubles in Six Months. (April 2018). Retrieved 7th August 2018 from https://yougov.co.uk/news/ 2018/04/19/smart-speaker-ownership-doubles-six-months/

Cited By

Chang FSheng LGu Z(2024)Investigating the Integration and the Long-Term Use of Smart Speakers in Older Adults’ Daily Practices: Qualitative StudyJMIR mHealth and uHealth10.2196/4747212(e47472)Online publication date: 12-Feb-2024
https://doi.org/10.2196/47472
Ahire SSimon BRohs M(2024)WorkFit: Designing Proactive Voice Assistance for the Health and Well-Being of Knowledge WorkersProceedings of the 6th ACM Conference on Conversational User Interfaces10.1145/3640794.3665561(1-14)Online publication date: 8-Jul-2024
https://dl.acm.org/doi/10.1145/3640794.3665561
Doke PKopparapu S(2024)Challenges and Opportunities Designing Voice User Interfaces for Emergent UsersHuman-Computer Interaction10.1007/978-3-031-60449-2_1(3-16)Online publication date: 29-Jun-2024
https://dl.acm.org/doi/10.1007/978-3-031-60449-2_1
Show More Cited By

Index Terms

StreetWise: Smart Speakers vs Human Help in Public Slum Settings
1. Human-centered computing
  1. Human computer interaction (HCI)
2. Information systems
  1. Information retrieval
    1. Specialized information retrieval
      1. Multimedia and multimodal retrieval
        Speech / audio search

Recommendations

Revisiting “Hole in the Wall” Computing: Private Smart Speakers and Public Slum Settings
CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems

Millions of homes worldwide enjoy access to digital content and services through smart speakers such as Amazon's Echo and Google's Home. Promotional materials and users' own videos typically show homes that have many well-resourced rooms, with good ...
Technology adoption by 'emergent' users: the user-usage model
APCHI '13: Proceedings of the 11th Asia Pacific Conference on Computer Human Interaction

Information and Communication Technologies (ICTs) have a role to play in human development. However, in order to be effective, they have to be adopted and used by their potential users. While there is an extensive literature on user modelling, there has ...
An Honest Conversation: Transparently Combining Machine and Human Speech Assistance in Public Spaces
CHI '20: Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems

There is widespread concern over the ways speech assistant providers currently use humans to listen to users' queries without their knowledge. We report two iterations of the TalkBack smart speaker, which transparently combines machine and human ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

CHI '19: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems

May 2019

9077 pages

ISBN:9781450359702

DOI:10.1145/3290605

General Chairs:
Stephen Brewster
University of Glasgow, Scotland, UK
,
Geraldine Fitzpatrick
TU Wien, Austria
,
Program Chairs:
Anna Cox
University College London, UK
,
Vassilis Kostakos
University of Melbourne, Australia

Copyright � 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGCHI: ACM Special Interest Group on Computer-Human Interaction

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 May 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Engineering and Physical Sciences Research Council

Conference

CHI '19

Sponsor:

SIGCHI

CHI '19: CHI Conference on Human Factors in Computing Systems

May 4 - 9, 2019

Glasgow, Scotland Uk

Acceptance Rates

CHI '19 Paper Acceptance Rate 703 of 2,958 submissions, 24%;

Overall Acceptance Rate 6,199 of 26,314 submissions, 24%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

26
Total Citations
View Citations
612
Total Downloads

Downloads (Last 12 months)44
Downloads (Last 6 weeks)7

Reflects downloads up to 21 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

Chang FSheng LGu Z(2024)Investigating the Integration and the Long-Term Use of Smart Speakers in Older Adults’ Daily Practices: Qualitative StudyJMIR mHealth and uHealth10.2196/4747212(e47472)Online publication date: 12-Feb-2024
https://doi.org/10.2196/47472
Ahire SSimon BRohs M(2024)WorkFit: Designing Proactive Voice Assistance for the Health and Well-Being of Knowledge WorkersProceedings of the 6th ACM Conference on Conversational User Interfaces10.1145/3640794.3665561(1-14)Online publication date: 8-Jul-2024
https://dl.acm.org/doi/10.1145/3640794.3665561
Doke PKopparapu S(2024)Challenges and Opportunities Designing Voice User Interfaces for Emergent UsersHuman-Computer Interaction10.1007/978-3-031-60449-2_1(3-16)Online publication date: 29-Jun-2024
https://dl.acm.org/doi/10.1007/978-3-031-60449-2_1
Saha MLindsay SVarghese DBartindale TOlivier P(2023)Benefits of Community Voice: A Framework for Understanding Inclusion of Community Voice in HCI4DProceedings of the ACM on Human-Computer Interaction10.1145/36101747:CSCW2(1-26)Online publication date: 4-Oct-2023
https://dl.acm.org/doi/10.1145/3610174
Reitmaier TWallington EKlejch OMarkl NLam-Yee-Mui LPearson JJones MBell PRobinson S(2023)Situating Automatic Speech Recognition Development within Communities of Under-heard Language SpeakersProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581385(1-17)Online publication date: 19-Apr-2023
https://dl.acm.org/doi/10.1145/3544548.3581385
Bartle VAlbright LDell N(2023)"This machine is for the aides": Tailoring Voice Assistant Design to Home Health Care WorkProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581118(1-19)Online publication date: 19-Apr-2023
https://dl.acm.org/doi/10.1145/3544548.3581118
Ahire S(2022)Designing a Smart Speaker for Emergent Users: Human Plus AI ResponseProceedings of the 13th Indian Conference on Human-Computer Interaction10.1145/3570211.3570217(67-72)Online publication date: 9-Nov-2022
https://dl.acm.org/doi/10.1145/3570211.3570217
Acer UBroeck MMin CDasari MKawsar F(2022)The City as a Personal AssistantProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/35345736:2(1-31)Online publication date: 7-Jul-2022
https://dl.acm.org/doi/10.1145/3534573
Jones MVon Feldt MAndrus N(2022)Outside Where? A Survey of Climates and Built Environments in Studies of HCI outdoorsProceedings of the 2022 CHI Conference on Human Factors in Computing Systems10.1145/3491102.3507656(1-15)Online publication date: 29-Apr-2022
https://dl.acm.org/doi/10.1145/3491102.3507656
Pearson JBailey GRobinson SJones MOwen TZhang CReitmaier TSteer CCarter ASahoo DRaju D(2022)Can’t Touch This: Rethinking Public Technology in a COVID-19 EraProceedings of the 2022 CHI Conference on Human Factors in Computing Systems10.1145/3491102.3501980(1-14)Online publication date: 29-Apr-2022
https://dl.acm.org/doi/10.1145/3491102.3501980
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents