NUK - logo

Search results

Basic search    Expert search   

Currently you are NOT authorised to access e-resources NUK. For full access, REGISTER.

1 2 3
hits: 22
1.
  • Neural scene representation... Neural scene representation and rendering
    Eslami, S M Ali; Jimenez Rezende, Danilo; Besse, Frederic ... Science, 06/2018, Volume: 360, Issue: 6394
    Journal Article
    Peer reviewed
    Open access

    Scene representation-the process of converting visual sensory data into concise descriptions-is a requirement for intelligent behavior. Recent work has shown that neural networks excel at this task ...
Full text
2.
  • Grandmaster level in StarCr... Grandmaster level in StarCraft II using multi-agent reinforcement learning
    Vinyals, Oriol; Babuschkin, Igor; Czarnecki, Wojciech M ... Nature (London), 11/2019, Volume: 575, Issue: 7782
    Journal Article
    Peer reviewed

    Many real-world applications require artificial agents to compete and coordinate with other agents in complex environments. As a stepping stone to this goal, the domain of StarCraft has emerged as an ...
Full text
3.
  • Hybrid computing using a ne... Hybrid computing using a neural network with dynamic external memory
    Graves, Alex; Wayne, Greg; Reynolds, Malcolm ... Nature, 10/2016, Volume: 538, Issue: 7626
    Journal Article
    Peer reviewed
    Open access

    Artificial neural networks are remarkably adept at sensory processing, sequence learning and reinforcement learning, but are limited in their ability to represent variables and data structures and to ...
Full text

PDF
4.
  • Planning and Policy Improvement
    Danihelka, Ivo 01/2023
    Dissertation

    MuZero is currently the most successful general reinforcement learning algorithm, achieving the state of the art on Go, chess, shogi, and Atari. We want to help MuZero to be successful in even more ...
Full text
5.
  • Muesli: Combining Improvements in Policy Optimization
    Hessel, Matteo; Danihelka, Ivo; Viola, Fabio ... arXiv (Cornell University), 03/2022
    Paper, Journal Article
    Open access

    We propose a novel policy update that combines regularized policy optimization with model learning as an auxiliary loss. The update (henceforth Muesli) matches MuZero's state-of-the-art performance ...
Full text
6.
  • Optimistic Simulated Exploration as an Incentive for Real Exploration
    Danihelka, Ivo arXiv (Cornell University), 05/2009
    Paper, Journal Article
    Open access

    Many reinforcement learning exploration techniques are overly optimistic and try to explore every state. Such exploration is impossible in environments with the unlimited number of states. I propose ...
Full text
7.
  • Comparison of Maximum Likelihood and GAN-based training of Real NVPs
    Danihelka, Ivo; Lakshminarayanan, Balaji; Uria, Benigno ... arXiv (Cornell University), 05/2017
    Paper, Journal Article
    Open access

    We train a generator by maximum likelihood and we also train the same generator architecture by Wasserstein GAN. We then compare the generated samples, exact log-probability densities and approximate ...
Full text
8.
  • Neural Turing Machines
    Graves, Alex; Wayne, Greg; Danihelka, Ivo arXiv (Cornell University), 12/2014
    Paper, Journal Article
    Open access

    We extend the capabilities of neural networks by coupling them to external memory resources, which they can interact with by attentional processes. The combined system is analogous to a Turing ...
Full text
9.
  • Memory-Efficient Backpropagation Through Time
    Gruslys, Audrūnas; Munos, Remi; Danihelka, Ivo ... arXiv (Cornell University), 06/2016
    Paper, Journal Article
    Open access

    We propose a novel approach to reduce memory consumption of the backpropagation through time (BPTT) algorithm when training recurrent neural networks (RNNs). Our approach uses dynamic programming to ...
Full text
10.
  • Grid Long Short-Term Memory
    Nal Kalchbrenner; Danihelka, Ivo; Graves, Alex arXiv (Cornell University), 01/2016
    Paper, Journal Article
    Open access

    This paper introduces Grid Long Short-Term Memory, a network of LSTM cells arranged in a multidimensional grid that can be applied to vectors, sequences or higher dimensional data such as images. The ...
Full text
1 2 3
hits: 22

Load filters