Using a Chatbot to Assess: Application of a Conversational Tool in an Assessment Course

Authors

  • Karen Moran Jackson Soka University of America Author
  • Yuki Miyoshi Soka University of America Author
  • Emma Sherbine Soka University of America Author

Keywords:

assessment, chatbot, generative AI

Abstract

The authors describe the development of an AI chatbot, based on a large language model (LLM), as a formative assessment tool in a master’s-level educational assessment class. The pedagogical goal was to provide an experiential assessment and feedback on student content knowledge. The students and instructor found the conversational assessment straightforward, but were concerned about feedback quality, grading reliability, and development time. For use in classroom assessments, LLM-based chatbots and related applications will need guidance from educators on appropriate conversational responses. Educators will also need to develop verification methods regarding the reliability of chatbot-assigned grades.

Downloads

Download data is not yet available.

References

Adiguzel, T., Kaya, M. H., & Cansu, F. K. (2023). Revolutionizing education with AI: Exploring the transformative potential of ChatGPT. Contemporary Educational Technology, 15(3), ep429. https://doi.org/10.30935/cedtech/13152

AI for Education. (2023). GenAI chatbot prompt library for educators. AI for Education. https://www.aiforeducation.io/prompt-library

American Educational Research Association, American Psychological Association, National Council on Measurement in Education (AERA, APA, & NCME). (2014). Standards for educational and psychological testing. American Educational Research Association. https://www.testingstandards.net/open-access-files.html

Clarke, M. & Luna-Bazaldua, D. (2021). Primer on large-scale assessments of educational achievement. World Bank Publications. https://hdl.handle.net/10986/35494

Davar, N. F., Dewan, M. A. A., & Zhang, X. (2025). AI chatbots in education: Challenges and opportunities. Information, 16(3), Article 3. https://doi.org/10.3390/info16030235

Dolman, J. (2024, March 5). Do we really need AI wrappers? The AI English Teacher. https://theaienglishteacher.wordpress.com/2024/03/05/do-we-really-need-ai-wrappers/

Eke, D. O. (2023). ChatGPT and the rise of generative AI: Threat to academic integrity? Journal of Responsible Technology, 13, 100060. https://doi.org/10.1016/j.jrt.2023.100060

Fan, J., Sun, T., Liu, J., Zhao, T., Zhang, B., Chen, Z., Glorioso, M., & Hack, E. (2023). How well can an AI chatbot infer personality? Examining psychometric properties of machine-inferred personality scores. Journal of Applied Psychology, 108(8), 1277–1299. https://doi.org/10.1037/apl0001082

Gruenhagen, J. H., Sinclair, P. M., Carroll, J.-A., Baker, P. R. A., Wilson, A., & Demant, D. (2024). The rapid rise of generative AI and its implications for academic integrity: Students’ perceptions and use of chatbots for assistance with assessments. Computers and Education: Artificial Intelligence, 7, 100273. https://doi.org/10.1016/j.caeai.2024.100273

Herft, A. (2023). A Teacher’s Prompt Guide to ChatGPT aligned with ’What Works Best’. https://usergeneratededucation.files.wordpress.com/2023/01/a-teachers-prompt-guide-to-chatgpt-aligned-with-what-works-best.pdf

Huang, K. (2023, January 18). Alarmed by A.I. chatbots, universities start revamping how they teach. The New York Times, International Edition. https://www.proquest.com/docview/2766885034/citation/30EC3A82EB184DB1PQ/1

Hmoud, M., Swaity, H., Anjass, E., & Aguaded-Ramírez, E. M. (2024). Rubric development and validation for assessing tasks’ solving via AI chatbots. Electronic Journal of E-Learning, 22(6), 1–17. https://doi.org/10.34190/ejel.22.6.3292

Ifenthaler, D., Gibson, D., Prasse, D., Shimada, A., & Yamada, M. (2020). Putting learning back into learning analytics: Actions for policy makers, researchers, and practitioners. Educational Technology Research and Development. https://doi.org/10.1007/s11423-020-09909-8

Kahoot. (2024). About us. https://kahoot.com/company/

Khademi, A. (2023). Can ChatGPT and bard generate aligned assessment items? A reliability analysis against human performance. ArXiv Preprint. ArXiv:2304.05372.

Koretz, D. (2009). Measuring up: What educational testing really tells us. Harvard University Press.

Labadze, L., Grigolia, M., & Machaidze, L. (2023). Role of AI chatbots in education: Systematic literature review. International Journal of Educational Technology in Higher Education, 20(1), 56. https://doi.org/10.1186/s41239-023-00426-1

Latif, E., Mai, G., Nyaaba, M., Wu, X., Liu, N., Lu, G., Li, S., Liu, T., & Zhai, X. (2023). AGI: Artificial General Intelligence for Education (arXiv:2304.12479). arXiv. https://doi.org/10.48550/arXiv.2304.12479

Okonkwo, C. W., & Ade-Ibijola, A. (2021). Chatbots applications in education: A systematic review. Computers and Education: Artificial Intelligence, 2, 100033. https://doi.org/10.1016/j.caeai.2021.100033

Perkins et al. (2024). The AI assessment scale.

Peters, M. A. (2018). Deep learning, education and the final stage of automation. Educational Philosophy and Theory, 50(6–7), 549–553. https://doi.org/10.1080/00131857.2017.1348928

Playlab.ai. (2024). Terms of Use. https://www.playlab.ai/policies/terms-of-use

Plickers (2021). What is Plickers? https://help.plickers.com/hc/en-us/articles/360009395854-What-is-Plickers

Roose, K. (2023, January 12). Don’t Ban ChatGPT in Schools. Teach With It. The New York Times. https://www.nytimes.com/2023/01/12/technology/chatgpt-schools-teachers.html

Rudolph, J., Tan, S., & Tan, S. (2023). ChatGPT: Bullshit spewer or the end of traditional assessments in higher education? Journal of Applied Learning and Teaching, 6(1), Article 1. https://doi.org/10.37074/jalt.2023.6.1.9

Schick, A., Feine, J., Morana, S., Maedche, A., & Reininghaus, U. (2022). Validity of Chatbot Use for Mental Health Assessment: Experimental Study. JMIR mHealth and uHealth, 10(10), e28082. https://doi.org/10.2196/28082

Stahl, B. C., Schroeder, D., & Rodrigues, R. (2023). Ethics of Artificial Intelligence: Case Studies and Options for Addressing Ethical Challenges. Springer International Publishing. https://doi.org/10.1007/978-3-031-17040-9

Wang, T., Lund, B. D., Marengo, A., Pagano, A., Mannuru, N. R., Teel, Z. A., & Pange, J. (2023). Exploring the potential impact of artificial intelligence (AI) on international students in higher education: Generative AI, chatbots, analytics, and international student success. Applied Sciences, 13(11), 6716. https://doi.org/10.3390/app13116716

Williams, R. T. (2024). The ethical implications of using generative chatbots in higher education. Frontiers in Education, 8. https://doi.org/10.3389/feduc.2023.1331607

Yang, S., & Evans, C. (2020). Opportunities and challenges in using ai chatbots in higher education. Proceedings of the 2019 3rd International Conference on Education and E-Learning, 79–83. https://doi.org/10.1145/3371647.3371659

Published

2026-04-06

How to Cite

Using a Chatbot to Assess: Application of a Conversational Tool in an Assessment Course. (2026). Journal on Excellence in College Teaching, 37(Special Issue). https://celt.miamioh.edu/index.php/JECT/article/view/1328