NUK - logo

Search results

Basic search    Advanced search   
Search
request
Library

Currently you are NOT authorised to access e-resources NUK. For full access, REGISTER.

3 4 5 6 7
hits: 151,874
41.
  • A general reinforcement lea... A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
    Silver, David; Hubert, Thomas; Schrittwieser, Julian ... Science (American Association for the Advancement of Science), 12/2018, Volume: 362, Issue: 6419
    Journal Article
    Peer reviewed
    Open access

    The game of chess is the longest-studied domain in the history of artificial intelligence. The strongest programs are based on a combination of sophisticated search techniques, domain-specific ...
Full text

PDF
42.
  • GAN-Powered Deep Distributi... GAN-Powered Deep Distributional Reinforcement Learning for Resource Management in Network Slicing
    Hua, Yuxiu; Li, Rongpeng; Zhao, Zhifeng ... IEEE journal on selected areas in communications, 02/2020, Volume: 38, Issue: 2
    Journal Article
    Peer reviewed
    Open access

    Network slicing is a key technology in 5G communications system. Its purpose is to dynamically and efficiently allocate resources for diversified services with distinct requirements over a common ...
Full text

PDF
43.
  • Thinning Schedules of Reinf... Thinning Schedules of Reinforcement Following Functional Communication Training for Children with Intellectual and Developmental Disabilities: A Meta-analytic Review
    Muharib, Reem; Alrasheed, Fahad; Ninci, Jennifer ... Journal of autism and developmental disorders, 12/2019, Volume: 49, Issue: 12
    Journal Article
    Peer reviewed

    Functional communication training (FCT) is an evidence-based practice used to mitigate challenging behavior by increasing functional communication skills. To increase the practicality and feasibility ...
Full text
44.
  • The Partial-Reinforcement E... The Partial-Reinforcement Extinction Effect Does Not Result From Reduced Sensitivity to Nonreinforcement
    Harris, Justin A; Seet, Manuel S; Kwok, Dorothy W. S Journal of experimental psychology. Animal learning and cognition, 04/2019, Volume: 45, Issue: 2
    Journal Article
    Peer reviewed
    Open access

    Five experiments used a magazine approach paradigm with rats to investigate whether learning about nonreinforcement is impaired in the presence of a conditioned stimulus (CS) that had been partially ...
Full text

PDF
45.
  • Linear Quadratic Tracking C... Linear Quadratic Tracking Control of Partially-Unknown Continuous-Time Systems Using Reinforcement Learning
    Modares, Hamidreza; Lewis, Frank L. IEEE transactions on automatic control, 2014-Nov., 2014-11-00, 20141101, Volume: 59, Issue: 11
    Journal Article
    Peer reviewed

    In this technical note, an online learning algorithm is developed to solve the linear quadratic tracking (LQT) problem for partially-unknown continuous-time systems. It is shown that the value ...
Full text
46.
  • Whole section anchor–grouti... Whole section anchor–grouting reinforcement technology and its application in underground roadways with loose and fractured surrounding rock
    Fangtian, Wang; Cun, Zhang; Shuaifeng, Wei ... Tunnelling and underground space technology, January 2016, 2016-01-00, 20160101, Volume: 51
    Journal Article
    Peer reviewed

    •Roadways deformation with loose and fractured surrounding rock is large and difficult to control.•Plastic zone distributes in oval shape with different forces in horizontal and vertical ...
Full text
47.
  • Corrosion of steel bars emb... Corrosion of steel bars embedded in fibre reinforced concrete under chloride attack: State of the art
    Berrocal, Carlos G.; Lundgren, Karin; Löfgren, Ingemar Cement and concrete research, 02/2016, Volume: 80
    Journal Article
    Peer reviewed
    Open access

    This literature review summarises the influence of fibres on the main parameters governing corrosion of conventional reinforcement. The ability of fibres to suppress crack growth has proven to ...
Full text

PDF
48.
  • Deconstructing the human al... Deconstructing the human algorithms for exploration
    Gershman, Samuel J. Cognition, 04/2018, Volume: 173
    Journal Article
    Peer reviewed
    Open access

    •Exploration algorithms can be distinguished in terms of the bias and slope of choice functions.•Two experiments show evidence for both directed and random exploration.•A hybrid algorithm provides ...
Full text

PDF
49.
  • Reinforcement biases subseq... Reinforcement biases subsequent perceptual decisions when confidence is low, a widespread behavioral phenomenon
    Lak, Armin; Hueske, Emily; Hirokawa, Junya ... eLife, 04/2020, Volume: 9
    Journal Article
    Peer reviewed
    Open access

    Learning from successes and failures often improves the quality of subsequent decisions. Past outcomes, however, should not influence purely perceptual decisions after task acquisition is complete ...
Full text

PDF
50.
  • Learning About Trial Sequen... Learning About Trial Sequences Disrupts the Partial Reinforcement Extinction Effect in Classical Conditioning
    Jiao, Tianjian; Harris, Justin A. Journal of experimental psychology. Animal learning and cognition, 01/2024, Volume: 50, Issue: 1
    Journal Article
    Peer reviewed

    The partial reinforcement extinction effect (PREE) refers to the phenomenon that conditioned responding extinguishes more slowly if subjects had been inconsistently ("partially") reinforced than if ...
Full text
3 4 5 6 7
hits: 151,874

Load filters