NUK - logo

Rezultati iskanja

Osnovno iskanje    Ukazno iskanje   

Trenutno NISTE avtorizirani za dostop do e-virov NUK. Za polni dostop se PRIJAVITE.

1 2 3 4 5
zadetkov: 7.365
1.
  • Mixed experience sampling f... Mixed experience sampling for off-policy reinforcement learning
    Yu, Jiayu; Li, Jingyao; Lü, Shuai ... Expert systems with applications, 10/2024, Letnik: 251
    Journal Article
    Recenzirano

    In deep reinforcement learning, experience replay is usually used to improve data efficiency and alleviate experience forgetting. However, online reinforcement learning is often influenced by the ...
Celotno besedilo
2.
  • Recruitment-imitation mecha... Recruitment-imitation mechanism for evolutionary reinforcement learning
    Lü, Shuai; Han, Shuai; Zhou, Wenbo ... Information sciences, April 2021, 2021-04-00, Letnik: 553
    Journal Article
    Recenzirano
    Odprti dostop

    Reinforcement learning, evolutionary algorithms and imitation learning are three principal methods to deal with continuous control tasks. Reinforcement learning is sample efficient, yet sensitive to ...
Celotno besedilo

PDF
3.
  • Unsupervised domain adaptat... Unsupervised domain adaptation via softmax-based prototype construction and adaptation
    Li, Jingyao; Lü, Shuai; Li, Zhanshan Information sciences, September 2022, 2022-09-00, Letnik: 609
    Journal Article
    Recenzirano

    To mitigate the distribution difference between the source and target domains, there have been many unsupervised domain adaptation methods to achieve class-level alignment by aligning the prototypes ...
Celotno besedilo
4.
  • Actor-critic with familiari... Actor-critic with familiarity-based trajectory experience replay
    Gong, Xiaoyu; Yu, Jiayu; Lü, Shuai ... Information sciences, January 2022, 2022-01-00, Letnik: 582
    Journal Article
    Recenzirano

    This paper aims to solve sample inefficiency in Asynchronous Advantage Actor-Critic (A3C). First, we design a new off-policy actor-critic algorithm, which combines actor-critic with experience replay ...
Celotno besedilo
5.
  • Feature concatenation for a... Feature concatenation for adversarial domain adaptation
    Li, Jingyao; Li, Zhanshan; Lü, Shuai Expert systems with applications, 05/2021, Letnik: 169
    Journal Article
    Recenzirano

    Domain adaptation aims to mitigate the domain gap between the source and target domains so that knowledge can be transferred between domains. There are two key factors that determine the adaptation ...
Celotno besedilo
6.
  • Consistency regularization-... Consistency regularization-based mutual alignment for source-free domain adaptation
    Lü, Shuai; Li, Zongze; Zhang, Xinyu ... Expert systems with applications, 05/2024, Letnik: 241
    Journal Article
    Recenzirano

    Unsupervised domain adaptation (UDA) is used to extend the model working on well-annotated source data to unlabeled target data. However, in practice, due to privacy and storage issues, we can only ...
Celotno besedilo
7.
  • A New Battery/Ultracapacito... A New Battery/Ultracapacitor Energy Storage System Design and Its Motor Drive Integration for Hybrid Electric Vehicles
    Shuai Lu; Corzine, K.A.; Ferdowsi, M. IEEE transactions on vehicular technology, 07/2007, Letnik: 56, Številka: 4
    Journal Article
    Recenzirano

    This paper proposes a new energy storage system (ESS) design, including both batteries and ultracapacitors (UCs) in hybrid electric vehicle (HEV) and electric vehicle applications. The conventional ...
Celotno besedilo
8.
Celotno besedilo
9.
  • Construction of emissive ru... Construction of emissive ruthenium(II) metallacycle over 1000 nm wavelength for in vivo biomedical applications
    Xu, Yuling; Li, Chonglu; Lu, Shuai ... Nature communications, 04/2022, Letnik: 13, Številka: 1
    Journal Article
    Recenzirano
    Odprti dostop

    Although Ru(II)-based agents are expected to be promising candidates for substituting Pt-drug, their in vivo biomedical applications are still limited by the short excitation/emission wavelengths and ...
Celotno besedilo
10.
  • Regularly updated determini... Regularly updated deterministic policy gradient algorithm
    Han, Shuai; Zhou, Wenbo; Lü, Shuai ... Knowledge-based systems, 02/2021, Letnik: 214
    Journal Article
    Recenzirano
    Odprti dostop

    Deep Deterministic Policy Gradient (DDPG) algorithm is one of the most well-known reinforcement learning methods. However, this method is inefficient and unstable in practical applications. On the ...
Celotno besedilo

PDF
1 2 3 4 5
zadetkov: 7.365

Nalaganje filtrov