Works (2)

Updated: February 22nd, 2025 05:01

2025 journal article

A Simple Finite-Time Analysis of TD Learning With Linear Function Approximation

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 70(2), 1388–1394.

By: A. Mitra n

author keywords: Temporal difference learning; Approximation algorithms; Standards; Function approximation; Convergence; Perturbation methods; Noise; Vectors; Heuristic algorithms; Delays; Finite-time analysis; reinforcement learning; stochastic approximation; temporal difference learning
topics (OpenAlex): Neural Networks and Applications
Source: Web Of Science
Added: February 17, 2025

2023 journal article

Federated TD Learning Over Finite-Rate Erasure Channels: Linear Speedup Under Markovian Sampling

IEEE CONTROL SYSTEMS LETTERS, 7, 2461–2466.

By: N. Dal Fabbro*, A. Mitra n & G. Pappas*

author keywords: Servers; Markov processes; Quantization (signal); Function approximation; Approximation algorithms; Reinforcement learning; Supervised learning; Machine learning; large-scale systems; communication networks
topics (OpenAlex): Distributed Control Multi-Agent Systems; Stability and Control of Uncertain Systems; Age of Information Optimization
TL;DR: This work proposes and analyze QFedTD - a quantized federated temporal difference learning algorithm with linear function approximation that highlights the effect of quantization and erasures on the convergence rate and establishes a linear speedup w.r.t. the number of agents under Markovian sampling. (via Semantic Scholar)
Source: Web Of Science
Added: August 7, 2023

Citation Index includes data from a number of different sources. If you have questions about the sources of data in the Citation Index or need a set of data which is free to re-distribute, please contact us.

Certain data included herein are derived from the Web of Science© and InCites© (2025) of Clarivate Analytics. All rights reserved. You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.