Stochastic optimal control usually requires an explicit dynamical model with probability distributions, which are difficult to obtain in practice. In this work, we consider the linear quadratic ...regulator (LQR) problem of unknown linear systems and adopt a Wasserstein penalty to address the distribution uncertainty of additive stochastic disturbances. By constructing an equivalent deterministic game of the penalized LQR problem, we propose a Q-learning method with convergence guarantees to learn an optimal minimax controller.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
The isoline tracking of this work is concerned with the control design for a sensing vehicle to track a desired isoline of an unknown scalar field. To this end, we propose a simple proportional ...integral (PI)-like controller for a Dubins vehicle in the GPS-denied environment. Inspired by the sliding mode control method, our key idea lies in the design of an error term in the standard PI controller. For a circular field, we show that the P-like controller can globally regulate the vehicle to the desired isoline where the steady-state error can be arbitrarily reduced by increasing the P gain and eliminated by the PI-like controller. Moreover, the P-like controller is shown to achieve the local regulation for smooth fields. Finally, the effectiveness and advantages of our approach are validated via both simulations and experiments with a differential steering vehicle.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
The time-varying network topology can significantly affect the stability of multi-agent systems. This paper examines the stability of leader–follower multi-agent systems with general linear dynamics ...and switching network topologies, which have applications in the platooning of connected vehicles. The switching interaction topology is modeled as a class of directed graphs in order to describe the information exchange between multi-agent systems, where the eigenvalues of every associated matrix are required to be positive real. The Hurwitz criterion and the Riccati inequality are used to design a distributed control law and estimate the convergence speed of the closed-loop system. A sufficient condition is provided for the stability of multi-agent systems under switching topologies. A common Lyapunov function is formulated to prove closed-loop stability for the directed network with switching topologies. The result is applied to a typical cyber–physical system—that is, a connected vehicle platoon—which illustrates the effectiveness of the proposed method.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
This paper proposes distributed discrete-time algorithms to cooperatively solve an additive cost optimization problem in multiagent networks. The striking feature lies in the use of only the sign of ...relative state information between neighbors, which substantially differentiates our algorithms from others in the existing literature. We first interpret the proposed algorithms in terms of the penalty method in optimization theory and then perform nonasymptotic analysis to study convergence for static network graphs. Compared with the celebrated distributed subgradient algorithms, which, however, use the exact relative state information, the convergence speed is essentially not affected by the loss of information. We also study how introducing noise into the relative state information and randomly activated graphs affect the performance of our algorithms. Finally, we validate the theoretical results on a class of distributed quantile regression problems.
In this paper we investigate a general multi-level quantized filter of linear stochastic systems. For a given multi-level quantization and under the Gaussian assumption on the predicted density, a ...quantized innovations filter that achieves the minimum mean square error is derived. The filter is given in terms of quantization thresholds and a simple modified Riccati difference equation. By optimizing the filtering error covariance with respect to quantization thresholds, the associated optimal thresholds and the corresponding filter are obtained. Furthermore, the convergence of the filter to the standard Kalman filter is established. We also discuss the design of a robust minimax quantized filter when the innovation covariance is not exactly known. Simulation and experimental results illustrate the effectiveness and advantages of the proposed quantized filter.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, OILJ, PILJ, PNG, SAZU, UILJ, UKNU, UL, UM, UPUK
This paper investigates a consensus design problem for continuous-time first-order multiagent systems with uniform constant communication delay.Provided that the agent dynamic is unstable and the ...diagraph is undirected,sufficient conditions are derived to guarantee consensus.The key technique is the adoption of historical input information in the protocol.Especially,when agent's own historical input information is used in the protocol design,the consensus condition is constructed in terms of agent dynamic,communication delay,and the eigenratio of the network topology.Simulation result is presented to validate the effectiveness of the theoretical result.
Full text
Available for:
EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OBVAL, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
Information compression is essential to reduce communication cost in distributed optimization over peer-to-peer networks. This paper proposes a co mmunication-efficient l inearly convergent d ...istributed (COLD) algorithm to solve strongly convex optimization problems. By compressing innovation vectors, which are the differences between decision vectors and their estimates, COLD achieves linear convergence for a class of <inline-formula><tex-math notation="LaTeX">\delta</tex-math></inline-formula>-contracted compressors, and we explicitly quantify how the compression affects the convergence rate. Interestingly, our results strictly improve existing results for the quantized consensus problem. Numerical experiments demonstrate the advantages of COLD under different compressors.
This paper proposes a coordinate-free controller to drive a mobile robot to encircle a target at unknown position only using range-based measurements. Different from the existing works, a ...backstepping based controller is proposed to encircle the target with zero steady-state error for any desired smooth pattern. Moreover, we show its asymptotic exponential stability under a fixed set of parameters for the controller. The effectiveness and advantages of the proposed controller are validated via simulations.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
This paper considers a distributed convex optimization problem with inequality constraints over time-varying unbalanced digraphs, where the cost function is a sum of local objective functions, and ...each node of the graph only knows its local objective and inequality constraints. Although there is a vast body of literature on distributed optimization, most of them require the graph to be balanced, which is quite restrictive and not necessary. To solve it, this work proposes a novel idea of using the epigraph form of the constrained optimization, which can be easily used to study time-varying unbalanced digraphs. Under local communications, a simple iterative algorithm is then designed for each node. We prove that if the graph is uniformly jointly strongly connected, each node asymptotically converges to some common optimal solution.