Online Evaluation of Conversational Agents Using Machine-Learned Metrics

Guneet Singh Kohli

doi:10.52783/jisem.v10i61s.13328

PDF

Published: Oct 29, 2025

DOI: https://doi.org/10.52783/jisem.v10i61s.13328

Keywords:

Conversational Agents, Machine Learning Evaluation, Real-Time Quality Assessment, Adaptive Dialogue Systems, User Satisfaction Metrics

Guneet Singh Kohli

Abstract

The article discusses the new paradigm of metrics learned by machines to be used in the assessment of conversational agents in a real-time setting. With voice assistants and chatbots playing an ever-larger role in human interactions with computers in many domains, the classical evaluation techniques are fatally limited in their range, precision, and dynamism. Manual feedback mechanisms are primarily retrospective, introducing significant delays between defect identification and corrective action. Machine-learned evaluation methods, in contrast, utilize computational models that have been trained over historical interactions to automatically evaluate the satisfaction of users solely based on the content of a dialogue, conversation metadata, and behavioral cues. Hierarchical systems process information at multiple temporal resolutions. This enables both detailed and summary-level evaluations of dialogue quality. Applied in real time, these measures allow conversational systems to adapt dynamically during interactions. Remediation strategies may include response adjustment or escalation to human operators. Empirical studies explain that such approaches have a stronger association with user satisfaction, spot deficiencies in quality earlier, extrapolate to other areas of application, and continue to enhance their performance through online education. However, these benefits are accompanied by major challenges such as ensuring explainability, adapting to cultural contexts, evaluating multimodal interactions, modeling long-term engagement, preserving user privacy, and establishing standardized evaluation frameworks.

Issue

Vol. 10 No. 61s (2025)

Section

Articles

Journal of Information Systems Engineering and Management

Online Evaluation of Conversational Agents Using Machine-Learned Metrics

Abstract

Volume 11 (2026)

Volume 10 (2025)

Volume 9 (2024)

Volume 8 (2023)

Volume 7 (2022)

Volume 6 (2021)

Volume 5 (2020)

Volume 4 (2019)

Volume 3 (2018)

Volume 2 (2017)

Volume 1 (2016)

Journal of Information Systems Engineering and Management

Article Sidebar

Main Article Content

Abstract

Article Details