skip to main content
article

Teaching about Data and Databases: Why, What, How?

Published: 29 July 2022 Publication History

Abstract

The panel on data(base) education at VLDB2021 [13] drew attention to important challenges in choosing how database classes are constructed for students in a world where data is being used in novel and impactful settings. This paper aims to present one view of a process for making these pedagogy decisions. We don't aim to present a best-possible design of the subject, rather we want to illuminate the space of possibilities, to encourage reasoned choices rather than simply teaching the subject as it was previously offered, or spending time on the latest innovations without considering the "opportunity cost" of doing so. We hope to guide the perplexed instructor or departmental curriculum committee.

References

[1]
A. Adhikari, J. DeNero, and M. I. Jordan. Interleaving computational and inferential thinking: Data science for undergraduates at berkeley. Harvard Data Science Review, 4 2021. https://hdsr.mitpress.mit.edu/pub/e69066t4.
[2]
A. Ailamaki and J. M. Hellerstein. Exposing undergraduate students to database system internals. SIGMOD Rec., 32(3):18--20, 2003.
[3]
R. Alkhabaz, S. Poulsen, M. Chen, and A. Alawini. Insights from student solutions to mongodb homework problems. In ITiCSE 2021: 26th ACM Conference on Innovation and Technology in Computer Science Education, pages 276--282, 2021.
[4]
A. Bhangdiya, B. Chandra, B. Kar, B. Radhakrishnan, K. V. M. Reddy, S. Shah, and S. Sudarshan. The xda-ta system for automated grading of SQL query assignments. In 31st IEEE International Conference on Data Engineering, ICDE 2015, pages 1468--1471, 2015.
[5]
J. Celko. SQL for Smarties: Advanced SQL Programming, (5. ed.). Morgan Kaufmann, 2014.
[6]
M. Chen, S. Poulsen, R. Alkhabaz, and A. Alawini. A quantitative analysis of student solutions to graph database problems. In ITiCSE 2021: 26th ACM Conference on Innovation and Technology in Computer Science Education, pages 283--289, 2021.
[7]
A. Danyluk and P. Leidig. Computer science curricula 2013. https://www.acm.org/binaries/content/ assets/education/cs2013_web_final.pdf.
[8]
M. de Raadt, S. Dekeyser, and T. Y. Lee. Do students SQLify? improving learning outcomes with peer review and enhanced computer assisted assessment of querying skills. In 6th Baltic Sea Conference on Computing Education Research, Koli Calling'06, pages 101--108, 2006.
[9]
A. D. Fekete. Teaching transaction management with SQL examples. In Proceedings of the 10th Annual SIGCSE Conference on Innovation and Technology in Computer Science Education, ITiCSE 2005, pages 163--167, 2005.
[10]
A. D. Fekete, J. Kay, and U. R�ohm. A data-centric computing curriculum for a data science major. In SIGCSE '21: The 52nd ACM Technical Symposium on Computer Science Education, pages 865--871, 2021.
[11]
S. Foss, T. Urazova, and R. Lawrence. Automatic generation and marking of UML database design diagrams. In SIGCSE 2022: The 53rd ACM Technical Symposium on Computer Science Education, Providence, RI, USA, March 3--5, 2022, Volume 1, pages 626--632, 2022.
[12]
H. Garcia-Molina, J. D. Ullman, and J. Widom. Database systems - the complete book (2. ed.). Pearson Education, 2009.
[13]
Z. Ives, J. Gehrke, J. Giceva, A. Kumar, and R. Pottinger. VLDB panel summary: "the future of data(base) education: Is the cow book dead?". SIGMOD Rec., 50(3):23--26, 2021.
[14]
J. Kawash, T. N. Jarada, and M. Moshirpour. Group exams as learning tools: Evidence from an undergraduate database course. In Proceedings of the 51st ACM Technical Symposium on Computer Science Education, SIGCSE 2020, pages 626--632, 2020.
[15]
R. Kearns, S. Shead, and A. D. Fekete. A teaching system for SQL. In Proceedings of the ACM SIGCSE 2nd Australasian Conference on Computer Science Education, ACSE 1997, pages 224--231, 1997.
[16]
S. Kim. Seamless integration of nosql class into the database curriculum. In Proceedings of the 2020 ACM Conference on Innovation and Technology in Computer Science Education, ITiCSE 2020, pages 314--320, 2020.
[17]
A. Kleerekoper and A. Schofield. SQL tester: an online SQL assessment tool and its impact. In Proceedings of the 23rd Annual ACM Conference on Innovation and Technology in Computer Science Education, ITiCSE 2018, pages 87--92, 2018.
[18]
C. Kleiner, C. Tebbe, and F. Heine. Automated grading and tutoring of SQL statements to improve student learning. In 13th Koli Calling International Conference on Computing Education Research, Koli Calling '13, pages 161--168, 2013.
[19]
E. M. Knorr. Worked examples, cognitive load, and exam assessments in a senior database course. In Proceedings of the 51st ACM Technical Symposium on Computer Science Education, SIGCSE 2020, pages 612--618, 2020.
[20]
P. M. Lewis, A. J. Bernstein, and M. Kifer. Database Systems: An Application Oriented Approach, Compete Version, (2. ed.). Addison-Wesley, 2006.
[21]
A. Migler and A. Dekhtyar. Mapping the SQL learning process in introductory database courses. In Proceedings of the 51st ACM Technical Symposium on Computer Science Education, SIGCSE 2020, pages 619--625, 2020.
[22]
A. Mitrovic. Learning SQL with a computerized tutor. In Proceedings of the 29th SIGCSE Technical Symposium on Computer Science Education, SIGCSE 1998, pages 307--311, 1998.
[23]
P. E. O'Neil and E. J. O'Neil. Database: Principles, Programming, and Performance, (2. ed.). Morgan Kaufmann, 2000.
[24]
M. P. Papazoglou and W. Valder. Relational database management - a systems programming approach. Prentice Hall, 1989.
[25]
R. Ramakrishnan and J. Gehrke. Database management systems (3. ed.). McGraw-Hill, 2003.
[26]
U. R�ohm, L. Brent, T. Dawborn, and B. Jeffries. Sql for data scientists: Designing sql tutorials for scalable online teaching. Proceedings of the VLDB (PVLDB), 13(12):2989--2992, 2020.
[27]
S. W. Sadiq, M. E. Orlowska, W. Sadiq, and J. Y. Lin. Sqlator: an online SQL learning workbench. In Proceedings of the 9th Annual SIGCSE Conference on Innovation and Technology in Computer Science Education, ITiCSE 2004, pages 223--227, 2004.
[28]
M. Sahami and S. Roach. Computing competencies for undergraduate data science curricula. https: //dstf.acm.org/DSTF_Final_Report.pdf, 2021.
[29]
E. Sciore. Simpledb: a simple java-based multiuser syst for teaching database internals. In Proceedings of the 38th SIGCSE Technical Symposium on Computer Science Education, SIGCSE 2007, Covington, Kentucky, USA, March 7--11, 2007, pages 561--565, 2007.
[30]
A. Silberschatz, H. F. Korth, and S. Sudarshan. Database System Concepts, Seventh Edition. McGraw-Hill Book Company, 2020.
[31]
B. Sotomayor and A. Shaw. chidb: Building asimple relational database system from scratch. In Proceedings of the 47th ACM Technical Symposium on Computing Science Education, SIGCSE 2016, Memphis, TN, USA, March 02 - 05, 2016, pages 407--412, 2016.
[32]
G. Swart. Minsql: a simple componentized database for the classroom. In Proceedings of the 2nd International Symposium on Principles and Practice of Programming in Java, PPPJ 2003, Kilkenny City, Ireland, June 16--18, 2003, pages 129--132, 2003.
[33]
J. D. Ullman. Improving the efficiency of database-system teaching. In Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, pages 1--3, 2003.
[34]
J. D. Ullman and J. Widom. A first course in database systems (2. ed.). Prentice Hall, 2002.

Cited By

View all
  • (2024)Teaching Multiple Data Models and Query LanguagesProceedings of the 2024 on Innovation and Technology in Computer Science Education V. 110.1145/3649217.3653628(234-240)Online publication date: 3-Jul-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGMOD Record
ACM SIGMOD Record  Volume 51, Issue 2
June 2022
72 pages
ISSN:0163-5808
DOI:10.1145/3552490
Issue’s Table of Contents
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 July 2022
Published in SIGMOD Volume 51, Issue 2

Check for updates

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)49
  • Downloads (Last 6 weeks)5
Reflects downloads up to 17 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Teaching Multiple Data Models and Query LanguagesProceedings of the 2024 on Innovation and Technology in Computer Science Education V. 110.1145/3649217.3653628(234-240)Online publication date: 3-Jul-2024

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media