NUK - logo

Rezultati iskanja

Osnovno iskanje    Ukazno iskanje   

Trenutno NISTE avtorizirani za dostop do e-virov NUK. Za polni dostop se PRIJAVITE.

1 2 3 4 5
zadetkov: 350
1.
  • Sigmoid-weighted linear uni... Sigmoid-weighted linear units for neural network function approximation in reinforcement learning
    Elfwing, Stefan; Uchibe, Eiji; Doya, Kenji Neural networks, 11/2018, Letnik: 107
    Journal Article
    Recenzirano
    Odprti dostop

    In recent years, neural networks have enjoyed a renaissance as function approximators in reinforcement learning. Two decades after Tesauro’s TD-Gammon achieved near top-level human performance in ...
Celotno besedilo

PDF
2.
  • Canonical cortical circuits... Canonical cortical circuits and the duality of Bayesian inference and optimal control
    Doya, Kenji Current opinion in behavioral sciences, 10/2021, Letnik: 41
    Journal Article
    Recenzirano
    Odprti dostop

    •The duality of sensory inference and optimal control has been known since 1960s.•The duality stems from the common computations for the posterior distribution in dynamic Bayesian inference and the ...
Celotno besedilo

PDF
3.
  • Forward and inverse reinfor... Forward and inverse reinforcement learning sharing network weights and hyperparameters
    Uchibe, Eiji; Doya, Kenji Neural networks, 12/2021, Letnik: 144
    Journal Article
    Recenzirano
    Odprti dostop

    This paper proposes model-free imitation learning named Entropy-Regularized Imitation Learning (ERIL) that minimizes the reverse Kullback–Leibler (KL) divergence. ERIL combines forward and inverse ...
Celotno besedilo

PDF
4.
  • Distinct neural representat... Distinct neural representation in the dorsolateral, dorsomedial, and ventral parts of the striatum during fixed- and free-choice tasks
    Ito, Makoto; Doya, Kenji The Journal of neuroscience, 02/2015, Letnik: 35, Številka: 8
    Journal Article
    Recenzirano
    Odprti dostop

    The striatum is a major input site of the basal ganglia, which play an essential role in decision making. Previous studies have suggested that subareas of the striatum have distinct roles: the ...
Celotno besedilo

PDF
5.
  • Enhancing reinforcement lea... Enhancing reinforcement learning models by including direct and indirect pathways improves performance on striatal dependent tasks
    Blackwell, Kim T; Doya, Kenji PLoS computational biology, 08/2023, Letnik: 19, Številka: 8
    Journal Article
    Recenzirano
    Odprti dostop

    A major advance in understanding learning behavior stems from experiments showing that reward learning requires dopamine inputs to striatal neurons and arises from synaptic plasticity of ...
Celotno besedilo
6.
  • Validation of Decision-Maki... Validation of Decision-Making Models and Analysis of Decision Variables in the Rat Basal Ganglia
    Ito, Makoto; Doya, Kenji The Journal of neuroscience, 08/2009, Letnik: 29, Številka: 31
    Journal Article
    Recenzirano
    Odprti dostop

    Reinforcement learning theory plays a key role in understanding the behavioral and neural mechanisms of choice behavior in animals and humans. Especially, intermediate variables of learning models ...
Celotno besedilo

PDF
7.
  • Reward probability and timi... Reward probability and timing uncertainty alter the effect of dorsal raphe serotonin neurons on patience
    Miyazaki, Katsuhiko; Miyazaki, Kayoko W; Yamanaka, Akihiro ... Nature communications, 06/2018, Letnik: 9, Številka: 1
    Journal Article
    Recenzirano
    Odprti dostop

    Recent experiments have shown that optogenetic activation of serotonin neurons in the dorsal raphe nucleus (DRN) in mice enhances patience in waiting for future rewards. Here, we show that serotonin ...
Celotno besedilo

PDF
8.
  • Synergizing habits and goal... Synergizing habits and goals with variational Bayes
    Han, Dongqi; Doya, Kenji; Li, Dongsheng ... Nature communications, 05/2024, Letnik: 15, Številka: 1
    Journal Article
    Recenzirano
    Odprti dostop

    Behaving efficiently and flexibly is crucial for biological and artificial embodied agents. Behavior is generally classified into two types: habitual (fast but inflexible), and goal-directed ...
Celotno besedilo
9.
  • Neural substrate of dynamic Bayesian inference in the cerebral cortex
    Funamizu, Akihiro; Kuhn, Bernd; Doya, Kenji Nature neuroscience, 12/2016, Letnik: 19, Številka: 12
    Journal Article
    Recenzirano

    Dynamic Bayesian inference allows a system to infer the environmental state under conditions of limited sensory observation. Using a goal-reaching task, we found that posterior parietal cortex (PPC) ...
Celotno besedilo
10.
  • Reinforcement Learning in C... Reinforcement Learning in Continuous Time and Space
    Doya, Kenji Neural computation, 01/2000, Letnik: 12, Številka: 1
    Journal Article
    Recenzirano

    This article presents a reinforcement learning framework for continuous-time dynamical systems without a priori discretization of time, state, and action. Basedonthe Hamilton-Jacobi-Bellman (HJB) ...
Celotno besedilo
1 2 3 4 5
zadetkov: 350

Nalaganje filtrov