doi:10.1017/S0269888921000047

Banerjee , B., Vittanala , S. & Taylor , M. E. 2019. Team learning from human demonstration with coordination confidence. The Knowledge Engineering Review 34, e12.

de la Cruz , G. V., Du , Y. & Taylor , M. E. 2019. Pre-training with non-expert human demonstration for deep reinforcement learning. The Knowledge Engineering Review 34, e10.

Jain , A., Khetarpal , K. & Precup , D. 2021. Safe option-critic: learning safety in the option-critic architecture. The Knowledge Engineering Review 36, e4.

Li , M., Brys , T. & Kudenko , D. 2019a. Introspective q-learning and learning from demonstration. The Knowledge Engineering Review 34, e8.

Li , M., Wei , Y. & Kudenko , D. 2019b. Two-level q-learning: learning from conflict demonstrations. The Knowledge Engineering Review 34, e14.

Player , C. & Griffiths , N. 2020. Improving trust and reputation assessment with dynamic behaviour. The Knowledge Engineering Review 35, e29.

Ramos , G. d. O., Da Silva , B. C., Râdulescu , R., Bazzan , A. L. C. & Nowé , A. 2020. Toll-based reinforcement learning for efficient equilibria in route choice. The Knowledge Engineering Review 35, e8.

Roesler , O. & Nowé , A. 2019. Action learning and grounding in simulated human–robot interactions. The Knowledge Engineering Review 34, e13.

Sen , S., Crawford , C., Dees , A., Nanda Kumar , R. & Hale , J. 2020. Effects of parity, sympathy and reciprocity in increasing social welfare. The Knowledge Engineering Review 35, e31.

Valcarcel Macua , S., Davies , I., Tukiainen , A. & Munoz de Cote , E. in press. Diff-dac: Fully distributed actor-critic for average multitask deep reinforcement learning. The Knowledge Engineering Review 36.