Rezultati iskanja SubjectTerms:"AlphaZero" UL

1.	MiniZero: Comparative Analysis of AlphaZero and MuZero on Go, Othello, and Atari Games Wu, Ti-Rong; Guei, Hung; Peng, Pei-Chiun ... IEEE transactions on games, 2024 Journal Article Recenzirano This paper presents MiniZero , a zero-knowledge learning framework that supports four state-of-the-art algorithms, including AlphaZero, MuZero, Gumbel AlphaZero, and Gumbel MuZero. While these ...	Celotno besedilo
2.	Inteligencia Artificial y Juegos de Tablero: Desde el Turco hasta AlphaZero Valencia, Ivan Francisco; Valdovinos Rosas, Rosa María; Marcial Romero, J. Raymundo ... RECIBE, 05/2023, Letnik: 11, Številka: 2 Journal Article Odprti dostop El presente artículo tiene como objetivo mostrar la evolución del desarrollo de agentes inteligentes capaces de jugar juegos de tablero. Se muestra una breve revisión histórica de los agentes que se ...	Celotno besedilo
3.	Post-storm repair crew dispatch for distribution grid restoration using stochastic Monte Carlo tree search and deep neural networks Shuai, Hang; Li, Fangxing; She, Buxin ... International journal of electrical power & energy systems, 01/2023, Letnik: 144 Journal Article Recenzirano Odprti dostop Natural disasters such as storms usually bring significant damages to distribution grids. This paper investigates the optimal routing of utility vehicles to restore outages in the distribution grid ...	Celotno besedilo
4.	Artificial intelligence warm-start approach: optimizing the generalization capability of QAOA in complex energy landscapes Zhao, Runsheng; Cheng, Tao; Wang, Rui ... New journal of physics, 05/2024, Letnik: 26, Številka: 5 Journal Article Recenzirano Odprti dostop Abstract To address the issue of the quantum approximate optimization algorithm frequently encountering local minima and the cost of parameter optimization within complex non-convex optimization ...	Celotno besedilo
5.	Adaptive Warm-Start MCTS in AlphaZero-Like Deep Reinforcement Learning Lecture notes in computer science Book Chapter Recenzirano Odprti dostop	Celotno besedilo PDF
6.	Chess AI: Competing Paradigms for Machine Intelligence Maharaj, Shiva; Polson, Nick; Turk, Alex Entropy, 04/2022, Letnik: 24, Številka: 4 Journal Article Recenzirano Odprti dostop Endgame studies have long served as a tool for testing human creativity and intelligence. We find that they can serve as a tool for testing machine ability as well. Two of the leading chess engines, ...	Celotno besedilo
7.	Newton’s method for reinforcement learning and model predictive control Bertsekas, Dimitri Results in control and optimization, June 2022, 2022-06-00, 2022-06-01, Letnik: 7 Journal Article Recenzirano Odprti dostop The purpose of this paper is to propose and develop a new conceptual framework for approximate Dynamic Programming (DP) and Reinforcement Learning (RL). This framework centers around two algorithms, ...	Celotno besedilo
8.	On the Value of Chess Squares Gupta, Aditya; Maharaj, Shiva; Polson, Nicholas ... Entropy, 09/2023, Letnik: 25, Številka: 10 Journal Article Recenzirano Odprti dostop We propose a neural network-based approach to calculate the value of a chess square–piece combination. Our model takes a triplet (color, piece, square) as the input and calculates a value that ...	Celotno besedilo
9.	Method on generating massive virtual driving curves for high-speed trains of the Cross-Taiwan Strait Railway and its statistical analysis Zheng, Xiaoyu; Chen, Dewang; Lin, Zhiming ... The Journal of supercomputing, 02/2024, Letnik: 80, Številka: 3 Journal Article Recenzirano The Cross-Taiwan Strait Railway (CTSR) is a significant construction, and the automatic driving of high-speed trains (HSTs) on the CTSR is a crucial technology for its operation. Before opening the ...	Celotno besedilo
10.	Efficiently Mastering the Game of NoGo with Deep Reinforcement Learning Supported by Domain Knowledge Gao, Yifan; Wu, Lezhou Electronics, 07/2021, Letnik: 10, Številka: 13 Journal Article Recenzirano Odprti dostop Computer games have been regarded as an important field of artificial intelligence (AI) for a long time. The AlphaZero structure has been successful in the game of Go, beating the top professional ...	Celotno besedilo PDF

1.

MiniZero: Comparative Analysis of AlphaZero and MuZero on Go, Othello, and Atari Games
Wu, Ti-Rong; Guei, Hung; Peng, Pei-Chiun ... IEEE transactions on games, 2024
Journal Article

Recenzirano

This paper presents MiniZero , a zero-knowledge learning framework that supports four state-of-the-art algorithms, including AlphaZero, MuZero, Gumbel AlphaZero, and Gumbel MuZero. While these ...

Celotno besedilo

2.

Inteligencia Artificial y Juegos de Tablero: Desde el Turco hasta AlphaZero
Valencia, Ivan Francisco; Valdovinos Rosas, Rosa María; Marcial Romero, J. Raymundo ... RECIBE, 05/2023, Letnik: 11, Številka: 2
Journal Article

Odprti dostop

El presente artículo tiene como objetivo mostrar la evolución del desarrollo de agentes inteligentes capaces de jugar juegos de tablero. Se muestra una breve revisión histórica de los agentes que se ...

Celotno besedilo

3.

Post-storm repair crew dispatch for distribution grid restoration using stochastic Monte Carlo tree search and deep neural networks
Shuai, Hang; Li, Fangxing; She, Buxin ... International journal of electrical power & energy systems, 01/2023, Letnik: 144
Journal Article

Recenzirano

Odprti dostop

Natural disasters such as storms usually bring significant damages to distribution grids. This paper investigates the optimal routing of utility vehicles to restore outages in the distribution grid ...

Celotno besedilo

4.

Artificial intelligence warm-start approach: optimizing the generalization capability of QAOA in complex energy landscapes
Zhao, Runsheng; Cheng, Tao; Wang, Rui ... New journal of physics, 05/2024, Letnik: 26, Številka: 5
Journal Article

Recenzirano

Odprti dostop

Abstract To address the issue of the quantum approximate optimization algorithm frequently encountering local minima and the cost of parameter optimization within complex non-convex optimization ...

Celotno besedilo

5.

Adaptive Warm-Start MCTS in AlphaZero-Like Deep Reinforcement Learning
Lecture notes in computer science
Book Chapter

Recenzirano

Odprti dostop

Celotno besedilo

PDF

6.

Chess AI: Competing Paradigms for Machine Intelligence
Maharaj, Shiva; Polson, Nick; Turk, Alex Entropy, 04/2022, Letnik: 24, Številka: 4
Journal Article

Recenzirano

Odprti dostop

Endgame studies have long served as a tool for testing human creativity and intelligence. We find that they can serve as a tool for testing machine ability as well. Two of the leading chess engines, ...

Celotno besedilo

7.

Newton’s method for reinforcement learning and model predictive control
Bertsekas, Dimitri Results in control and optimization, June 2022, 2022-06-00, 2022-06-01, Letnik: 7
Journal Article

Recenzirano

Odprti dostop

The purpose of this paper is to propose and develop a new conceptual framework for approximate Dynamic Programming (DP) and Reinforcement Learning (RL). This framework centers around two algorithms, ...

Celotno besedilo

8.

On the Value of Chess Squares
Gupta, Aditya; Maharaj, Shiva; Polson, Nicholas ... Entropy, 09/2023, Letnik: 25, Številka: 10
Journal Article

Recenzirano

Odprti dostop

We propose a neural network-based approach to calculate the value of a chess square–piece combination. Our model takes a triplet (color, piece, square) as the input and calculates a value that ...

Celotno besedilo

9.

Method on generating massive virtual driving curves for high-speed trains of the Cross-Taiwan Strait Railway and its statistical analysis
Zheng, Xiaoyu; Chen, Dewang; Lin, Zhiming ... The Journal of supercomputing, 02/2024, Letnik: 80, Številka: 3
Journal Article

Recenzirano

The Cross-Taiwan Strait Railway (CTSR) is a significant construction, and the automatic driving of high-speed trains (HSTs) on the CTSR is a crucial technology for its operation. Before opening the ...

Celotno besedilo

10.

Efficiently Mastering the Game of NoGo with Deep Reinforcement Learning Supported by Domain Knowledge
Gao, Yifan; Wu, Lezhou Electronics, 07/2021, Letnik: 10, Številka: 13
Journal Article

Recenzirano

Odprti dostop

Computer games have been regarded as an important field of artificial intelligence (AI) for a long time. The AlphaZero structure has been successful in the game of Go, beating the top professional ...

Celotno besedilo

PDF

Naloži sliko

Rezultati iskanja

Nalaganje filtrov

Noben zadetek ni izbran!

Iskanje je bilo uspešno shranjeno.

Urejanje

Iskanja ni bilo mogoče shraniti.

Shrani iskanje

Vnos na polico

Noben zadetek ni izbran!

Dodajanje gradiva na polico je uspelo.

Dodajanje gradiva na polico je le deloma uspelo.

Dodajanje gradiva na polico je v celoti spodletelo.

Dodajanje gradiva na polico ni bilo potrebno.

Duplikat

Dosežena omejitev

Urejanje

Napaka

Urejanje

Dodajanje

Urejanje

Sprememba statusa

Tema