|
Banerjee , B., Vittanala , S. & Taylor , M. E. 2019. Team learning from human demonstration with coordination confidence. The Knowledge Engineering Review 34, e12. |
|
de la Cruz , G. V., Du , Y. & Taylor , M. E. 2019. Pre-training with non-expert human demonstration for deep reinforcement learning. The Knowledge Engineering Review 34, e10. |
|
Jain , A., Khetarpal , K. & Precup , D. 2021. Safe option-critic: learning safety in the option-critic architecture. The Knowledge Engineering Review 36, e4. |
|
Li , M., Brys , T. & Kudenko , D. 2019a. Introspective q-learning and learning from demonstration. The Knowledge Engineering Review 34, e8. |
|
Li , M., Wei , Y. & Kudenko , D. 2019b. Two-level q-learning: learning from conflict demonstrations. The Knowledge Engineering Review 34, e14. |
|
Player , C. & Griffiths , N. 2020. Improving trust and reputation assessment with dynamic behaviour. The Knowledge Engineering Review 35, e29. |
|
Ramos , G. d. O., Da Silva , B. C., Râdulescu , R., Bazzan , A. L. C. & Nowé , A. 2020. Toll-based reinforcement learning for efficient equilibria in route choice. The Knowledge Engineering Review 35, e8. |
|
Roesler , O. & Nowé , A. 2019. Action learning and grounding in simulated human–robot interactions. The Knowledge Engineering Review 34, e13. |
|
Sen , S., Crawford , C., Dees , A., Nanda Kumar , R. & Hale , J. 2020. Effects of parity, sympathy and reciprocity in increasing social welfare. The Knowledge Engineering Review 35, e31. |
|
Valcarcel Macua , S., Davies , I., Tukiainen , A. & Munoz de Cote , E. in press. Diff-dac: Fully distributed actor-critic for average multitask deep reinforcement learning. The Knowledge Engineering Review 36. |