2 Artículos

Variational Reward Estimator Bottleneck: Towards Robust Reward Estimator for Multidomain Task-Oriented Dialogue

Acceso

en línea

Jeiyoon Park, Chanhee Lee, Chanjun Park, Kuekyeng Kim and Heuiseok Lim

Despite its significant effectiveness in adversarial training approaches to multidomain task-oriented dialogue systems, adversarial inverse reinforcement learning of the dialogue policy frequently fails to balance the performance of the reward estimator ... ver más

Revista: Applied Sciences Formato: Electrónico

Tabla de contenido: Vol: 11 Num: 0 Par: 14 Año: 2021

« Anterior Página: 1 de 1 Siguiente »