UNI-MB - logo
UMNIK - logo
 

Search results

Basic search    Expert search   

Currently you are NOT authorised to access e-resources UM. For full access, REGISTER.

1 2 3 4 5
hits: 44
1.
  • Revisiting the Arcade Learn... Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
    Machado, Marlos C.; Bellemare, Marc G.; Talvitie, Erik ... The Journal of artificial intelligence research, 03/2018, Volume: 61
    Journal Article
    Peer reviewed
    Open access

    The Arcade Learning Environment (ALE) is an evaluation platform that poses the challenge of building AI agents with general competency across dozens of Atari 2600 games. It supports a variety of ...
Full text

PDF
2.
  • Overcoming catastrophic for... Overcoming catastrophic forgetting in neural networks
    Kirkpatrick, James; Pascanu, Razvan; Rabinowitz, Neil ... Proceedings of the National Academy of Sciences - PNAS, 03/2017, Volume: 114, Issue: 13
    Journal Article
    Peer reviewed
    Open access

    The ability to learn tasks in a sequential fashion is crucial to the development of artificial intelligence. Until now neural networks have not been capable of this and it has been widely thought ...
Full text

PDF
3.
  • The Arcade Learning Environ... The Arcade Learning Environment: An Evaluation Platform for General Agents
    Bellemare, M. G.; Naddaf, Y.; Veness, J. ... The Journal of artificial intelligence research, 01/2013, Volume: 47
    Journal Article
    Peer reviewed
    Open access

    In this article we introduce the Arcade Learning Environment (ALE): both a challenge problem and a platform and methodology for evaluating the development of general, domain-independent AI ...
Full text

PDF
4.
  • Human-level control through... Human-level control through deep reinforcement learning
    Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David ... Nature (London), 2015-Feb-26, 2015-02-26, 20150226, Volume: 518, Issue: 7540
    Journal Article
    Peer reviewed

    The theory of reinforcement learning provides a normative account, deeply rooted in psychological and neuroscientific perspectives on animal behaviour, of how agents may optimize their control of an ...
Full text
5.
  • A Monte-Carlo AIXI Approxim... A Monte-Carlo AIXI Approximation
    Veness, J.; Ng, K.S.; Hutter, M. ... The Journal of artificial intelligence research, 01/2011, Volume: 40
    Journal Article
    Peer reviewed
    Open access

    This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. Our approach is based on a direct approximation of AIXI, a Bayesian optimality notion ...
Full text

PDF
6.
Full text
7.
Full text
8.
Full text

PDF
9.
  • REPLY TO HUSZÁR REPLY TO HUSZÁR
    Kirkpatrick, James; Pascanu, Razvan; Rabinowitz, Neil ... Proceedings of the National Academy of Sciences, 03/2018, Volume: 115, Issue: 11
    Journal Article
    Peer reviewed
    Open access

    In their recent work on elastic weight consolidation (EWC), Kirkpatrick et al show that forgetting in neural networks can be alleviated by using a quadratic penalty whose derivation was inspired by ...
Full text

PDF
10.
Full text

PDF
1 2 3 4 5
hits: 44

Load filters