Updated on 2025.06.28
Usage instructions: here
AGENT
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-26 | Homogenization of Multi-agent Learning Dynamics in Finite-state Markov Games | Yann Kerzreho et.al. | 2506.21079 | null |
2025-06-26 | Evidence-based diagnostic reasoning with multi-agent copilot for human pathology | Chengkuan Chen et.al. | 2506.20964 | null |
2025-06-26 | LLM-guided Chemical Process Optimization with a Multi-Agent Approach | Tong Zeng et.al. | 2506.20921 | null |
2025-06-25 | MAGPIE: A dataset for Multi-AGent contextual PrIvacy Evaluation | Gurusha Juneja et.al. | 2506.20737 | null |
2025-06-25 | The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind | Andrei Lupu et.al. | 2506.20664 | null |
2025-06-25 | Fine-Tuning and Prompt Engineering of LLMs, for the Creation of Multi-Agent AI for Addressing Sustainable Protein Production Challenges | Alexander D. Kalian et.al. | 2506.20598 | null |
2025-06-25 | SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models | Dipayan Saha et.al. | 2506.20415 | null |
2025-06-25 | A Visualization Framework for Exploring Multi-Agent-Based Simulations Case Study of an Electric Vehicle Home Charging Ecosystem | Kristoffer Christensen et.al. | 2506.20400 | null |
2025-06-25 | Language Modeling by Language Models | Junyan Cheng et.al. | 2506.20249 | null |
2025-06-25 | PSALM-V: Automating Symbolic Planning in Interactive Visual Environments with Large Language Models | Wang Bill Zhu et.al. | 2506.20097 | null |
2025-06-25 | From Conversation to Orchestration: HCI Challenges and Opportunities in Interactive Multi-Agentic Systems | Sarah Schömbs et.al. | 2506.20091 | null |
2025-06-24 | Learning Bilateral Team Formation in Cooperative Multi-Agent Reinforcement Learning | Koorosh Moslemi et.al. | 2506.20039 | null |
2025-06-24 | Automated Generation of Diverse Courses of Actions for Multi-Agent Operations using Binary Optimization and Graph Learning | Prithvi Poddar et.al. | 2506.20031 | null |
2025-06-24 | QHackBench: Benchmarking Large Language Models for Quantum Code Generation Using PennyLane Hackathon Challenges | Abdul Basit et.al. | 2506.20008 | null |
2025-06-24 | JoyAgents-R1: Joint Evolution Dynamics for Versatile Multi-LLM Agents with Reinforcement Learning | Ai Han et.al. | 2506.19846 | null |
2025-06-24 | MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration | Yucheng Zhou et.al. | 2506.19835 | null |
2025-06-24 | A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects | Shulan Ruan et.al. | 2506.19769 | null |
2025-06-24 | How trust networks shape students’ opinions about the proficiency of artificially intelligent assistants | Yutong Bu et.al. | 2506.19655 | null |
2025-06-24 | Adaptive Domain Modeling with Language Models: A Multi-Agent Approach to Task Planning | Harisankar Babu et.al. | 2506.19592 | null |
2025-06-24 | MATE: LLM-Powered Multi-Agent Translation Environment for Accessibility Applications | Aleksandr Algazinov et.al. | 2506.19502 | null |
2025-06-24 | LLM-based Multi-Agent System for Intelligent Refactoring of Haskell Code | Shahbaz Siddeeq et.al. | 2506.19481 | null |
2025-06-24 | Center of Gravity-Guided Focusing Influence Mechanism for Multi-Agent Reinforcement Learning | Yisak Park et.al. | 2506.19417 | null |
2025-06-24 | Augmenting Multi-Agent Communication with State Delta Trajectory | Yichen Tang et.al. | 2506.19209 | null |
2025-06-23 | Distilling Tool Knowledge into Language Models via Back-Translated Traces | Xingyue Huang et.al. | 2506.19171 | null |
2025-06-23 | Audit & Repair: An Agentic Framework for Consistent Story Visualization in Text-to-Image Diffusion Models | Kiymet Akdemir et.al. | 2506.18900 | null |
2025-06-23 | GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM | Annika Thomas et.al. | 2506.18885 | null |
2025-06-23 | Multi-Agent Online Control with Adversarial Disturbances | Anas Barakat et.al. | 2506.18814 | null |
2025-06-23 | TRIZ Agents: A Multi-Agent LLM Approach for TRIZ-Based Innovation | Kamil Szczepanik et.al. | 2506.18783 | null |
2025-06-23 | MARL-MambaContour: Unleashing Multi-Agent Deep Reinforcement Learning for Active Contour Optimization in Medical Image Segmentation | Ruicheng Zhang et.al. | 2506.18679 | null |
2025-06-23 | MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation | Tianchen Deng et.al. | 2506.18678 | null |
2025-06-23 | Dual-level Behavioral Consistency for Inter-group and Intra-group Coordination in Multi-Agent Systems | Shuocun Yang et.al. | 2506.18651 | null |
2025-06-23 | Multi-Agent Reinforcement Learning for Inverse Design in Photonic Integrated Circuits | Yannik Mahlau et.al. | 2506.18627 | null |
2025-06-23 | Reply to “Emergent LLM behaviors are observationally equivalent to data leakage” | Ariel Flint Ashery et.al. | 2506.18600 | null |
2025-06-23 | Transformer World Model for Sample Efficient Multi-Agent Reinforcement Learning | Azad Deihim et.al. | 2506.18537 | null |
2025-06-20 | Scalable and Reliable Multi-agent Reinforcement Learning for Traffic Assignment | Leizhen Wang et.al. | 2506.17029 | null |
2025-06-20 | RAGentA: Multi-Agent Retrieval-Augmented Generation for Attributed Question Answering | Ines Besrour et.al. | 2506.16988 | null |
2025-06-20 | Integrating Traditional Technical Analysis with AI: A Multi-Agent LLM-Based Approach to Stock Market Forecasting | Michał Wawer et.al. | 2506.16813 | null |
2025-06-20 | Distributed Affine Formation Control of Linear Multi-agent Systems with Adaptive Event-triggering | Chenjun Liu et.al. | 2506.16797 | null |
2025-06-20 | A Scalable Post-Processing Pipeline for Large-Scale Free-Space Multi-Agent Path Planning with PiBT | Arjo Chakravarty et.al. | 2506.16748 | null |
2025-06-20 | Generalizable Agent Modeling for Agent Collaboration-Competition Adaptation with Multi-Retrieval and Dynamic Generation | Chenxu Wang et.al. | 2506.16718 | null |
2025-06-20 | Exploring Traffic Simulation and Cybersecurity Strategies Using Large Language Models | Lu Gao et.al. | 2506.16699 | null |
2025-06-19 | StoryWriter: A Multi-Agent Framework for Long Story Generation | Haotian Xia et.al. | 2506.16445 | null |
2025-06-19 | When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework | Zhen Xu et.al. | 2506.16411 | null |
2025-06-19 | AGC-Drive: A Large-Scale Dataset for Real-World Aerial-Ground Collaboration in Driving Scenarios | Yunhao Hou et.al. | 2506.16371 | null |
2025-06-18 | SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence | Yao Zhang et.al. | 2506.15672 | null |
2025-06-18 | PhishDebate: An LLM-Based Multi-Agent Framework for Phishing Website Detection | Wenhao Li et.al. | 2506.15656 | null |
2025-06-18 | The Effect of State Representation on LLM Agent Behavior in Dynamic Routing Games | Lyle Goodyear et.al. | 2506.15624 | null |
2025-06-18 | Multi-Agent, Multi-Scale Systems with the Koopman Operator | Craig Bakker et.al. | 2506.15589 | null |
2025-06-18 | Learning to flock in open space by avoiding collisions and staying together | Martino Brambati et.al. | 2506.15587 | null |
2025-06-18 | AgentGroupChat-V2: Divide-and-Conquer Is What LLM-Based Multi-Agent System Need | Zhouhong Gu et.al. | 2506.15451 | link |
2025-06-18 | Multi-Agent Reinforcement Learning for Autonomous Multi-Satellite Earth Observation: A Realistic Case Study | Mohamad A. Hady et.al. | 2506.15207 | null |
2025-06-18 | Local Differential Privacy for Distributed Stochastic Aggregative Optimization with Guaranteed Optimality | Ziqin Chen et.al. | 2506.15106 | null |
2025-06-17 | MEAL: A Benchmark for Continual Multi-Agent Reinforcement Learning | Tristan Tomilin et.al. | 2506.14990 | null |
2025-06-17 | Fair Algorithms with Probing for Multi-Agent Multi-Armed Bandits | Tianyi Xu et.al. | 2506.14988 | null |
2025-06-17 | Swarm-STL: A Framework for Motion Planning in Large-Scale, Multi-Swarm Systems | Shiyu Cheng et.al. | 2506.14749 | null |
2025-06-17 | Linear Planar 3-SAT and Its Applications in Planning | Victorien Desbois et.al. | 2506.14713 | null |
2025-06-17 | Factor-Graph-Based Passive Acoustic Navigation for Decentralized Cooperative Localization Using Bearing Elevation Depth Difference | Kalliyan Velasco et.al. | 2506.14690 | null |
2025-06-17 | A Novel Indicator for Quantifying and Minimizing Information Utility Loss of Robot Teams | Xiyu Zhao et.al. | 2506.14237 | null |
2025-06-17 | Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team | Md Tanzib Hosain et.al. | 2506.14234 | null |
2025-06-17 | MAS-LitEval : Multi-Agent System for Literary Translation Quality Assessment | Junghwan Kim et.al. | 2506.14199 | null |
2025-06-17 | Hierarchical Multi-Agent Reinforcement Learning-based Coordinated Spatial Reuse for Next Generation WLANs | Jiaming Yu et.al. | 2506.14187 | null |
2025-06-17 | Light Aircraft Game : Basic Implementation and training results analysis | Hanzhong Cao et.al. | 2506.14164 | link |
2025-06-17 | StorySage: Conversational Autobiography Writing Powered by a Multi-Agent Framework | Shayan Talaei et.al. | 2506.14159 | null |
2025-06-17 | RadFabric: Agentic AI System with Reasoning Capability for Radiology | Wenting Chen et.al. | 2506.14142 | null |
2025-06-16 | MARCO: Hardware-Aware Neural Architecture Search for Edge Devices with Multi-Agent Reinforcement Learning and Conformal Prediction Filtering | Arya Fayyazi et.al. | 2506.13755 | null |
2025-06-16 | Agent Capability Negotiation and Binding Protocol (ACNBP) | Ken Huang et.al. | 2506.13590 | link |
2025-06-16 | Towards a Formal Specification for Self-organized Shape Formation in Swarm Robotics | YR Darr et.al. | 2506.13453 | null |
2025-06-16 | Dynamic Reinsurance Treaty Bidding via Multi-Agent Reinforcement Learning | Stella C. Dong et.al. | 2506.13113 | null |
2025-06-17 | Towards the Autonomous Optimization of Urban Logistics: Training Generative AI with Scientific Tools via Agentic Digital Twins and Model Context Protocol | Haowen Xu et.al. | 2506.13068 | link |
2025-06-16 | MAGIC: Multi-Agent Argumentation and Grammar Integrated Critiquer | Joaquin Jordan et.al. | 2506.13037 | null |
2025-06-15 | Distributed Composite Optimization with Sub-Weibull Noises | Zhan Yu et.al. | 2506.12901 | null |
2025-06-15 | Homeostatic Coupling for Prosocial Behavior | Naoto Yoshida et.al. | 2506.12894 | null |
2025-06-15 | WereWolf-Plus: An Update of Werewolf Game setting Based on DSGBench | Xinyuan Xia et.al. | 2506.12841 | null |
2025-06-15 | Multimodal Large Language Models-Enabled UAV Swarm: Towards Efficient and Intelligent Autonomous Aerial Systems | Yuqi Ping et.al. | 2506.12710 | null |
2025-06-13 | Revealing Political Bias in LLMs through Structured Multi-Agent Debate | Aishwarya Bandaru et.al. | 2506.11825 | link |
2025-06-13 | PE-MA: Parameter-Efficient Co-Evolution of Multi-Agent Systems | Yingfan Deng et.al. | 2506.11803 | null |
2025-06-13 | SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks | Hwiwon Lee et.al. | 2506.11791 | link |
2025-06-13 | LLMs for Sentence Simplification: A Hybrid Multi-Agent prompting Approach | Pratibha Zunjare et.al. | 2506.11681 | null |
2025-06-13 | Robot Context Protocol (RCP): A Runtime-Agnostic Interface for Agent-Aware Robot Control | Lambert Lee et.al. | 2506.11650 | null |
2025-06-13 | AutoGen Driven Multi Agent Framework for Iterative Crime Data Analysis and Prediction | Syeda Kisaa Fatima et.al. | 2506.11475 | null |
2025-06-13 | Resolve Highway Conflict in Multi-Autonomous Vehicle Controls with Local State Attention | Xuan Duy Ta et.al. | 2506.11445 | null |
2025-06-12 | A Hybrid Adaptive Nash Equilibrium Solver for Distributed Multi-Agent Systems with Game-Theoretic Jump Triggering | Qiuyu Miao et.al. | 2506.11304 | null |
2025-06-12 | Shapley Machine: A Game-Theoretic Framework for N-Agent Ad Hoc Teamwork | Jianhong Wang et.al. | 2506.11285 | link |
2025-06-12 | Sensor Model Identification via Simultaneous Model Selection and State Variable Determination | Christian Brommer et.al. | 2506.11263 | null |
2025-06-12 | SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks | Lianghong Guo et.al. | 2506.10954 | link |
2025-06-12 | CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-Training | Alireza Salemi et.al. | 2506.10844 | link |
2025-06-12 | Joint Beamforming with Extremely Large Scale RIS: A Sequential Multi-Agent A2C Approach | Zhi Chai et.al. | 2506.10815 | null |
2025-06-12 | SDialog: A Python Toolkit for Synthetic Dialogue Generation and Analysis | Sergio Burdisso et.al. | 2506.10622 | link |
2025-06-12 | AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation | Haoyuan Shi et.al. | 2506.10540 | null |
2025-06-12 | BugGen: A Self-Correcting Multi-Agent LLM Pipeline for Realistic RTL Bug Synthesis | Surya Jasper et.al. | 2506.10501 | null |
2025-06-12 | Specification and Evaluation of Multi-Agent LLM Systems – Prototype and Cybersecurity Applications | Felix Härer et.al. | 2506.10467 | link |
2025-06-12 | A Benchmark for Generalizing Across Diverse Team Strategies in Competitive Pokémon | Cameron Angliss et.al. | 2506.10326 | link |
2025-06-12 | WGSR-Bench: Wargame-based Game-theoretic Strategic Reasoning Benchmark for Large Language Models | Qiyue Yin et.al. | 2506.10264 | null |
2025-06-11 | AURA: A Multi-Agent Intelligence Framework for Knowledge-Enhanced Cyber Threat Attribution | Nanda Rani et.al. | 2506.10175 | null |
2025-06-11 | The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability | Jiachen Hu et.al. | 2506.09940 | null |
2025-06-11 | Intelligent Design 4.0: Paradigm Evolution Toward the Agentic AI Era | Shuo Jiang et.al. | 2506.09755 | null |
2025-06-11 | Patterns of Patterns III | Joseph Corneli et.al. | 2506.09696 | null |
2025-06-11 | Application-Driven Value Alignment in Agentic AI Systems: Survey and Perspectives | Wei Zeng et.al. | 2506.09656 | null |
2025-06-11 | Effective Red-Teaming of Policy-Adherent Agents | Itay Nakash et.al. | 2506.09600 | null |
2025-06-11 | ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning | Yu Sun et.al. | 2506.09513 | link |
2025-06-11 | Optimizing Cooperative Multi-Object Tracking using Graph Signal Processing | Maria Damanaki et.al. | 2506.09469 | null |
2025-06-11 | When Is Diversity Rewarded in Cooperative Multi-Agent Learning? | Michael Amir et.al. | 2506.09434 | null |
2025-06-11 | Intelligent System of Emergent Knowledge: A Coordination Fabric for Billions of Minds | Moshi Wei et.al. | 2506.09335 | null |
2025-06-11 | Multi-Agent Language Models: Advancing Cooperation, Coordination, and Adaptation | Arjun Vaithilingam Sudhakar et.al. | 2506.09331 | null |
2025-06-10 | VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning | Li Kang et.al. | 2506.09049 | null |
2025-06-10 | Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation | Xiaowen Ma et.al. | 2506.09046 | null |
2025-06-10 | MasHost Builds It All: Autonomous Multi-Agent System Directed by Reinforcement Learning | Kuo Yang et.al. | 2506.08507 | null |
2025-06-10 | CAF-I: A Collaborative Multi-Agent Framework for Enhanced Irony Detection with Large Language Models | Ziqi. Liu et.al. | 2506.08430 | null |
2025-06-11 | TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration | Weiya Li et.al. | 2506.08403 | link |
2025-06-10 | Reinforce LLM Reasoning through Multi-Agent Reflection | Yurun Yuan et.al. | 2506.08379 | null |
2025-06-10 | Reinforcement Fine-Tuning for Reasoning towards Multi-Step Multi-Source Search in Large Language Models | Wentao Shi et.al. | 2506.08352 | link |
2025-06-11 | HiBerNAC: Hierarchical Brain-emulated Robotic Neural Agent Collective for Disentangling Complex Manipulation | Hongjun Wu et.al. | 2506.08296 | null |
2025-06-09 | From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium | Xie Yi et.al. | 2506.08292 | link |
2025-06-09 | Ego-centric Learning of Communicative World Models for Autonomous Driving | Hang Wang et.al. | 2506.08149 | null |
2025-06-09 | Supporting Construction Worker Well-Being with a Multi-Agent Conversational AI System | Fan Yang et.al. | 2506.07997 | null |
2025-06-09 | A distributed motion planning approach to cooperative underwater acoustic source tracking and pursuit | Andrea Tiranti et.al. | 2506.07877 | null |
2025-06-09 | Decentralizing Multi-Agent Reinforcement Learning with Temporal Causal Information | Jan Corazza et.al. | 2506.07829 | null |
2025-06-09 | Deep Equivariant Multi-Agent Control Barrier Functions | Nikolaos Bousias et.al. | 2506.07755 | null |
2025-06-09 | Delay Optimization in Remote ID-Based UAV Communication via BLE and Wi-Fi Switching | Yian Zhu et.al. | 2506.07715 | null |
2025-06-09 | QUITE: A Query Rewrite System Beyond Rules with LLM Agents | Yuyang Song et.al. | 2506.07675 | null |
2025-06-09 | Blending Participatory Design and Artificial Awareness for Trustworthy Autonomous Vehicles | Ana Tanevska et.al. | 2506.07633 | null |
2025-06-09 | MalGEN: A Generative Agent Framework for Modeling Malicious Software in Cybersecurity | Bikash Saha et.al. | 2506.07586 | null |
2025-06-10 | SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems | Peiran Li et.al. | 2506.07564 | null |
2025-06-09 | Curriculum Learning With Counterfactual Group Relative Policy Advantage For Multi-Agent Reinforcement Learning | Weiqiang Jin et.al. | 2506.07548 | link |
2025-06-06 | A Theoretical Study of (Hyper) Self-Attention through the Lens of Interactions: Representation, Training, Generalization | Muhammed Ustaomeroglu et.al. | 2506.06179 | null |
2025-06-06 | Does It Run and Is That Enough? Revisiting Text-to-Chart Generation with a Multi-Agent Approach | James Ford et.al. | 2506.06175 | null |
2025-06-06 | On-board Mission Replanning for Adaptive Cooperative Multi-Robot Systems | Elim Kwan et.al. | 2506.06094 | null |
2025-06-06 | Conversational Interfaces for Parametric Conceptual Architectural Design: Integrating Mixed Reality with LLM-driven Interaction | Ruochen Ji et.al. | 2506.06066 | null |
2025-06-06 | Modeling human reputation-seeking behavior in a spatio-temporally complex public good provision game | Edward Hughes et.al. | 2506.06032 | null |
2025-06-06 | When to Trust Context: Self-Reflective Debates for Context Reliability | Zeqi Zhou et.al. | 2506.06020 | null |
2025-06-06 | Policy Optimization for Continuous-time Linear-Quadratic Graphon Mean Field Games | Philipp Plank et.al. | 2506.05894 | null |
2025-06-06 | MAPLE: Multi-Agent Adaptive Planning with Long-Term Memory for Table Reasoning | Ye Bai et.al. | 2506.05813 | null |
2025-06-05 | SocialDF: Benchmark Dataset and Detection Model for Mitigating Harmful Deepfake Content on Social Media Platforms | Arnesh Batra et.al. | 2506.05538 | link |
2025-06-05 | Sequence Modeling for N-Agent Ad Hoc Teamwork | Caroline Wang et.al. | 2506.05527 | null |
2025-06-05 | Teaming in the AI Era: AI-Augmented Frameworks for Forming, Simulating, and Optimizing Human Teams | Mohammed Almutairi et.al. | 2506.05265 | null |
2025-06-05 | Towards Language-Augmented Multi-Agent Deep Reinforcement Learning | Maxime Toquebiau et.al. | 2506.05236 | null |
2025-06-05 | A Framework for Ethical Judgment of Smart City Applications | Weichen Shi et.al. | 2506.05172 | null |
2025-06-05 | LLM-Guided Scenario-based GUI Testing | Shengcheng Yu et.al. | 2506.05079 | null |
2025-06-05 | ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development | Zhenran Xu et.al. | 2506.05010 | link |
2025-06-05 | Towards a Multi-Agent Simulation of Cyber-attackers and Cyber-defenders Battles | Julien Soulé et.al. | 2506.04849 | null |
2025-06-05 | Agents of Change: Self-Evolving LLM Agents for Strategic Planning | Nikolas Belle et.al. | 2506.04651 | null |
2025-06-05 | Advancing Tool-Augmented Large Language Models via Meta-Verification and Reflection Learning | Zhiyuan Ma et.al. | 2506.04625 | null |
2025-06-05 | Demonstrations of Integrity Attacks in Multi-Agent Systems | Can Zheng et.al. | 2506.04572 | null |
2025-06-05 | OpenAg: Democratizing Agricultural Intelligence | Srikanth Thudumu et.al. | 2506.04571 | null |
2025-06-04 | Thinking Beyond Visibility: A Near-Optimal Policy Framework for Locally Interdependent Multi-Agent MDPs | Alex DeWeese et.al. | 2506.04215 | null |
2025-06-04 | MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures | Elena Zamaraeva et.al. | 2506.04195 | null |
2025-06-04 | TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems | Shaina Raza et.al. | 2506.04133 | null |
2025-06-04 | CLAIM: An Intent-Driven Multi-Agent Framework for Analyzing Manipulation in Courtroom Dialogues | Disha Sheshanarayana et.al. | 2506.04131 | null |
2025-06-04 | Graph Counselor: Adaptive Graph Exploration via Multi-Agent Synergy to Enhance LLM Reasoning | Junqi Gao et.al. | 2506.03939 | link |
2025-06-04 | PulseReddit: A Novel Reddit Dataset for Benchmarking MAS in High-Frequency Cryptocurrency Trading | Qiuhan Han et.al. | 2506.03861 | null |
2025-06-04 | From Theory to Practice: Real-World Use Cases on Trustworthy LLM-Driven Process Modeling, Prediction and Automation | Peter Pfeiffer et.al. | 2506.03801 | null |
2025-06-04 | A Retrieval-Augmented Multi-Agent Framework for Psychiatry Diagnosis | Mengxi Xiao et.al. | 2506.03750 | link |
2025-06-04 | Joint Beamforming and Resource Allocation for Delay Optimization in RIS-Assisted OFDM Systems: A DRL Approach | Yu Ma et.al. | 2506.03586 | null |
2025-06-04 | From Virtual Agents to Robot Teams: A Multi-Robot Framework Evaluation in High-Stakes Healthcare Context | Yuanchen Bai et.al. | 2506.03546 | null |
2025-06-03 | MAEBE: Multi-Agent Emergent Behavior Framework | Sinem Erisken et.al. | 2506.03053 | null |
2025-06-03 | Coding Agents with Multimodal Browsing are Generalist Problem Solvers | Aditya Bharat Soni et.al. | 2506.03011 | null |
2025-06-03 | A Multi-Agent Framework for Mitigating Dialect Biases in Privacy Policy Question-Answering Systems | Đorđe Klisura et.al. | 2506.02998 | null |
2025-06-03 | Mapping Student-AI Interaction Dynamics in Multi-Agent Learning Environments: Supporting Personalised Learning and Reducing Performance Gaps | Zhanxin Hao et.al. | 2506.02993 | null |
2025-06-03 | Mitigating Manipulation and Enhancing Persuasion: A Reflective Multi-Agent Approach for Legal Argument Generation | Li Zhang et.al. | 2506.02992 | null |
2025-06-03 | Adaptive Graph Pruning for Multi-Agent Communication | Boyi Li et.al. | 2506.02951 | null |
2025-06-04 | A Multi-agent LLM-based JUnit Test Generation with Strong Oracles | Qinghua Xu et.al. | 2506.02943 | null |
2025-06-03 | ATAG: AI-Agent Application Threat Assessment with Attack Graphs | Parth Atulbhai Gandhi et.al. | 2506.02859 | null |
2025-06-03 | Ensemble-MIX: Enhancing Sample Efficiency in Multi-Agent RL Using Ensemble Methods | Tom Danino et.al. | 2506.02841 | null |
2025-06-03 | Why do AI agents communicate in human language? | Pengcheng Zhou et.al. | 2506.02739 | null |
2025-05-30 | Multiple LLM Agents Debate for Equitable Cultural Alignment | Dayeon Ki et.al. | 2505.24671 | link |
2025-05-30 | Distributed Intelligence in the Computing Continuum with Active Inference | Victor Casamayor Pujol et.al. | 2505.24618 | null |
2025-05-30 | NexusSum: Hierarchical LLM Agents for Long-Form Narrative Summarization | Hyuntak Kim et.al. | 2505.24575 | null |
2025-05-30 | CREFT: Sequential Multi-Agent LLM for Character Relation Extraction | Ye Eun Chun et.al. | 2505.24553 | null |
2025-05-30 | RMoA: Optimizing Mixture-of-Agents through Diversity Maximization and Residual Compensation | Zhentao Xie et.al. | 2505.24442 | null |
2025-05-30 | R3DM: Enabling Role Discovery and Diversity Through Dynamics Models in Multi-agent Reinforcement Learning | Harsh Goel et.al. | 2505.24265 | link |
2025-05-30 | An Adversary-Resistant Multi-Agent LLM System via Credibility Scoring | Sana Ebrahimi et.al. | 2505.24239 | null |
2025-05-30 | SentinelAgent: Graph-based Anomaly Detection in Multi-Agent Systems | Xu He et.al. | 2505.24201 | null |
2025-05-30 | Biological Pathway Guided Gene Selection Through Collaborative Reinforcement Learning | Ehtesamul Azim et.al. | 2505.24155 | link |
2025-05-30 | Distributed Neural Policy Gradient Algorithm for Global Convergence of Networked Multi-Agent Reinforcement Learning | Pengcheng Dai et.al. | 2505.24113 | null |
2025-05-29 | From Connectivity to Autonomy: The Dawn of Self-Evolving Communication Systems | Zeinab Nezami et.al. | 2505.23710 | null |
2025-05-29 | Data-to-Dashboard: Multi-Agent LLM Framework for Insightful Visualization in Enterprise Analytics | Ran Zhang et.al. | 2505.23695 | link |
2025-05-29 | ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork | Caroline Wang et.al. | 2505.23686 | link |
2025-05-29 | MAPLE: A Mobile Assistant with Persistent Finite State Machines for Recovery Reasoning | Linqiang Guo et.al. | 2505.23596 | null |
2025-05-29 | Agent Interpolation for Knowledge | Marta Bílková et.al. | 2505.23401 | null |
2025-05-29 | GAM-Agent: Game-Theoretic and Uncertainty-Aware Collaboration for Complex Visual Reasoning | Jusheng Zhang et.al. | 2505.23399 | null |
2025-05-29 | Understanding the Information Propagation Effects of Communication Topologies in LLM-based Multi-Agent Systems | Xu Shen et.al. | 2505.23352 | link |
2025-05-29 | Wireless Agentic AI with Retrieval-Augmented Multimodal Semantic Perception | Guangyuan Liu et.al. | 2505.23275 | null |
2025-05-29 | Cross-Task Experiential Learning on LLM-based Multi-Agent Collaboration | Yilong Li et.al. | 2505.23187 | null |
2025-05-29 | MenTeR: A fully-automated Multi-agenT workflow for end-to-end RF/Analog Circuits Netlist Design | Pin-Han Chen et.al. | 2505.22990 | null |
2025-05-28 | HDDLGym: A Tool for Studying Multi-Agent Hierarchical Problems Defined in HDDL with OpenAI Gym | Ngoc La et.al. | 2505.22597 | link |
2025-05-29 | Topological Structure Learning Should Be A Research Priority for LLM-Based Multi-Agent Systems | Jiaxi Yang et.al. | 2505.22467 | null |
2025-05-28 | AgentDNS: A Root Domain Naming System for LLM Agents | Enfang Cui et.al. | 2505.22368 | null |
2025-05-28 | From Large AI Models to Agentic AI: A Tutorial on Future Intelligent Communications | Feibo Jiang et.al. | 2505.22311 | null |
2025-05-28 | Voice CMS: updating the knowledge base of a digital assistant through conversation | Grzegorz Wolny et.al. | 2505.22303 | null |
2025-05-28 | Efficient Leave-one-out Approximation in LLM Multi-agent Debate Based on Introspection | Yue Cui et.al. | 2505.22192 | null |
2025-05-28 | MONSTR: Model-Oriented Neutron Strain Tomographic Reconstruction | Mohammad Samin Nur Chowdhury et.al. | 2505.22187 | null |
2025-05-28 | Oryx: a Performant and Scalable Algorithm for Many-Agent Coordination in Offline MARL | Claude Formanek et.al. | 2505.22151 | null |
2025-05-28 | AudioGenie: A Training-Free Multi-Agent Framework for Diverse Multimodality-to-Multiaudio Generation | Yan Rong et.al. | 2505.22053 | null |
2025-05-28 | Reward-Independent Messaging for Decentralized Multi-Agent Reinforcement Learning | Naoto Yoshida et.al. | 2505.21985 | null |
2025-05-27 | Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making | Yihan Wang et.al. | 2505.21503 | null |
2025-05-27 | Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers | Wei Pang et.al. | 2505.21497 | link |
2025-05-27 | Robust Hypothesis Generation: LLM-Automated Language Bias for Inductive Logic Programming | Yang Yang et.al. | 2505.21486 | null |
2025-05-27 | Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration | Zijun Liu et.al. | 2505.21471 | link |
2025-05-27 | Evaluating LLM Adaptation to Sociodemographic Factors: User Profile vs. Dialogue History | Qishuai Zhong et.al. | 2505.21362 | link |
2025-05-27 | Large Language Models Miss the Multi-Agent Mark | Emanuele La Malfa et.al. | 2505.21298 | null |
2025-05-27 | Breaking the Performance Ceiling in Complex Reinforcement Learning requires Inference Strategies | Felix Chalumeau et.al. | 2505.21236 | null |
2025-05-27 | Creativity in LLM-based Multi-Agent Systems: A Survey | Yi-Cheng Lin et.al. | 2505.21116 | null |
2025-05-27 | Simulating Ethics: Using LLM Debate Panels to Model Deliberation on Medical Dilemmas | Hazem Zohny et.al. | 2505.21112 | null |
2025-05-27 | RefAV: Towards Planning-Centric Scenario Mining | Cainan Davidson et.al. | 2505.20981 | link |
2025-05-26 | MA-RAG: Multi-Agent Retrieval-Augmented Generation via Collaborative Chain-of-Thought Reasoning | Thang Nguyen et.al. | 2505.20096 | null |
2025-05-26 | Multi-Agent Reinforcement Learning in Cybersecurity: From Fundamentals to Applications | Christoph R. Landolt et.al. | 2505.19837 | null |
2025-05-26 | SecVulEval: Benchmarking LLMs for Real-World C/C++ Vulnerability Detection | Md Basim Uddin Ahmed et.al. | 2505.19828 | link |
2025-05-26 | Select, Read, and Write: A Multi-Agent Framework of Full-Text-based Related Work Generation | Xiaochuan Liu et.al. | 2505.19647 | link |
2025-05-26 | Adaptive Episode Length Adjustment for Multi-agent Reinforcement Learning | Byunghyun Yoo et.al. | 2505.19637 | null |
2025-05-26 | DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue | Yichun Feng et.al. | 2505.19630 | link |
2025-05-26 | Multi-Agent Collaboration via Evolving Orchestration | Yufan Dang et.al. | 2505.19591 | null |
2025-05-26 | LLM-Agent-Controller: A Universal Multi-Agent Large Language Model System as a Control Engineer | Rasoul Zahedifar et.al. | 2505.19567 | null |
2025-05-26 | AMQA: An Adversarial Dataset for Benchmarking Bias of LLMs in Medicine and Healthcare | Ying Xiao et.al. | 2505.19562 | link |
2025-05-26 | DoctorRAG: Medical RAG Fusing Knowledge with Patient Analogy through Textual Gradients | Yuxing Lu et.al. | 2505.19538 | null |
2025-05-23 | ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework | Lisheng Huang et.al. | 2505.18105 | link |
2025-05-23 | Survival Games: Human-LLM Strategic Showdowns under Severe Resource Scarcity | Zhihong Chen et.al. | 2505.17937 | link |
2025-05-23 | Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour | Bálint Gyevnár et.al. | 2505.17801 | null |
2025-05-23 | URB – Urban Routing Benchmark for RL-equipped Connected Autonomous Vehicles | Ahmet Onur Akman et.al. | 2505.17734 | null |
2025-05-23 | Learning Equilibria from Data: Provably Efficient Multi-Agent Imitation Learning | Till Freihaut et.al. | 2505.17610 | null |
2025-05-23 | Probe by Gaming: A Game-based Benchmark for Assessing Conceptual Knowledge in LLMs | Shuhang Xu et.al. | 2505.17512 | null |
2025-05-23 | Multi-agent Systems for Misinformation Lifecycle : Detection, Correction And Source Identification | Aditya Gautam et.al. | 2505.17511 | null |
2025-05-23 | PD $^3$ : A Project Duplication Detection Framework via Adapted Multi-Agent Debate | Dezheng Bao et.al. | 2505.17492 | null |
2025-05-23 | LLM-BSCVM: An LLM-Based Blockchain Smart Contract Vulnerability Management Framework | Yanli Jin et.al. | 2505.17416 | link |
2025-05-22 | A Survey of Safe Reinforcement Learning and Constrained MDPs: A Technical Survey on Single-Agent and Multi-Agent Safety | Ankita Kushwaha et.al. | 2505.17342 | null |
2025-05-22 | SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding | Haoning Wu et.al. | 2505.17012 | link |
2025-05-22 | X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs | Rui Ye et.al. | 2505.16997 | link |
2025-05-22 | MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems | Rui Ye et.al. | 2505.16988 | link |
2025-05-22 | Know the Ropes: A Heuristic Strategy for LLM-based Multi-Agent System Design | Zhenkun Li et.al. | 2505.16979 | null |
2025-05-22 | SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development | Yaxin Du et.al. | 2505.16975 | link |
2025-05-22 | NovelSeek: When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification | NovelSeek Team et.al. | 2505.16938 | link |
2025-05-22 | RealEngine: Simulating Autonomous Driving in Realistic Context | Junzhe Jiang et.al. | 2505.16902 | link |
2025-05-22 | From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization | Haonian Ji et.al. | 2505.16832 | link |
2025-05-22 | Fuzzy Information Evolution with Three-Way Decision in Social Network Group Decision-Making | Qianlei Jia et.al. | 2505.16781 | null |
2025-05-22 | Large Language Model-Empowered Interactive Load Forecasting | Yu Zuo et.al. | 2505.16577 | null |
2025-05-21 | DEBATE, TRAIN, EVOLVE: Self Evolution of Language Model Reasoning | Gaurav Srivastava et.al. | 2505.15734 | null |
2025-05-21 | Improved power methods for computing eigenvalues of dual quaternion Hermitian matrices | Yongjun Chen et.al. | 2505.15584 | null |
2025-05-21 | Temporal Spectrum Cartography in Low-Altitude Economy Networks: A Generative AI Framework with Multi-Agent Learning | Changyuan Zhao et.al. | 2505.15571 | null |
2025-05-21 | AGENT-X: Adaptive Guideline-based Expert Network for Threshold-free AI-generated teXt detection | Jiatao Li et.al. | 2505.15261 | null |
2025-05-21 | R&D-Agent-Quant: A Multi-Agent Framework for Data-Centric Factors and Model Joint Optimization | Yuante Li et.al. | 2505.15155 | link |
2025-05-21 | Multicrossmodal Automated Agent for Integrating Diverse Materials Science Data | Adib Bazgir et.al. | 2505.15132 | null |
2025-05-21 | Agentic Feature Augmentation: Unifying Selection and Generation with Teaming, Planning, and Memories | Nanxu Gong et.al. | 2505.15076 | null |
2025-05-21 | ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges | Cheng Qian et.al. | 2505.15068 | link |
2025-05-21 | PiFlow: Principle-aware Scientific Discovery with Multi-Agent Collaboration | Yingming Pu et.al. | 2505.15047 | link |
2025-05-21 | Meta-Design Matters: A Self-Design Multi-Agent System | Zixuan Ke et.al. | 2505.14996 | null |
2025-05-20 | Agent Context Protocols Enhance Collective Inference | Devansh Bhardwaj et.al. | 2505.14569 | null |
2025-05-20 | Multi-agent Reinforcement Learning vs. Fixed-Time Control for Traffic Signal Optimization: A Simulation Study | Saahil Mahato et.al. | 2505.14544 | link |
2025-05-21 | Robustness Evaluation of Graph-based News Detection Using Network Structural Information | Xianghua Zeng et.al. | 2505.14453 | null |
2025-05-20 | Empowering LLMs in Task-Oriented Dialogues: A Domain-Independent Multi-Agent Framework and Fine-Tuning Strategy | Zihao Feng et.al. | 2505.14299 | null |
2025-05-20 | MAS-KCL: Knowledge component graph structure learning with large language model-based agentic workflow | Yuan-Hao Jiang et.al. | 2505.14126 | null |
2025-05-20 | Personalized and Resilient Distributed Learning Through Opinion Dynamics | Luca Ballotta et.al. | 2505.14081 | null |
2025-05-20 | Divide by Question, Conquer by Agent: SPLIT-RAG with Question-Driven Graph Partitioning | Ruiyi Yang et.al. | 2505.13994 | null |
2025-05-20 | CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring | Jiamin Su et.al. | 2505.13965 | null |
2025-05-20 | MultiDrive: A Co-Simulation Framework Bridging 2D and 3D Driving Simulation for AV Software Validation | Marc Kaufeld et.al. | 2505.13959 | link |
2025-05-20 | MLZero: A Multi-Agent System for End-to-end Machine Learning Automation | Haoyang Fang et.al. | 2505.13941 | link |
2025-05-19 | Robin: A multi-agent system for automating scientific discovery | Ali Essam Ghareeb et.al. | 2505.13400 | null |
2025-05-19 | Synthesis of Communication Policies for Multi-Agent Systems Robust to Communication Restrictions | Saleh Soudijani et.al. | 2505.13311 | null |
2025-05-19 | Hybrid Voting-Based Task Assignment in Modular Construction Scenarios | Daniel Weiner et.al. | 2505.13278 | null |
2025-05-19 | Agentic Publications: An LLM-Driven Framework for Interactive Scientific Publishing, Supplementing Traditional Papers with AI-Powered Knowledge Systems | Roberto Pugliese et.al. | 2505.13246 | null |
2025-05-19 | Information Science Principles of Machine Learning: A Causal Chain Meta-Framework Based on Formalized Information Mapping | Jianfeng Xu et.al. | 2505.13182 | null |
2025-05-19 | Adversarial Reasoning for Repair Based on Inferred Program Intent | He Ye et.al. | 2505.13008 | null |
2025-05-19 | The Traitors: Deception and Trust in Multi-Agent Language Model Simulations | Pedro M. P. Curvo et.al. | 2505.12923 | link |
2025-05-19 | From Grunts to Grammar: Emergent Language from Cooperative Foraging | Maytus Piriyajitakonkij et.al. | 2505.12872 | null |
2025-05-19 | Reasoning BO: Enhancing Bayesian Optimization with Long-Context Reasoning Power of LLMs | Zhuo Yang et.al. | 2505.12833 | null |
2025-05-19 | Dynamic Sight Range Selection in Multi-Agent Reinforcement Learning | Wei-Chen Liao et.al. | 2505.12811 | null |
2025-05-16 | Signal attenuation enables scalable decentralized multi-agent reinforcement learning over networks | Wesley A Suttle et.al. | 2505.11461 | null |
2025-05-16 | Explaining Strategic Decisions in Multi-Agent Reinforcement Learning for Aerial Combat Tactics | Ardian Selmonaj et.al. | 2505.11311 | null |
2025-05-16 | Bidirectional Distillation: A Mixed-Play Framework for Multi-Agent Generalizable Behaviors | Lang Feng et.al. | 2505.11100 | null |
2025-05-16 | Time Travel is Cheating: Going Live with DeepFund for Real-Time Fund Investment Benchmarking | Changlun Li et.al. | 2505.11065 | link |
2025-05-16 | Review-Instruct: A Review-Driven Multi-Turn Conversations Generation Method for Large Language Models | Jiangxu Wu et.al. | 2505.11010 | null |
2025-05-16 | Let the Trial Begin: A Mock-Court Approach to Vulnerability Detection using LLM-Based Agents | Ratnadira Widyasari et.al. | 2505.10961 | null |
2025-05-16 | Connecting the Dots: A Chain-of-Collaboration Prompting Framework for LLM Agents | Jiaxing Zhao et.al. | 2505.10936 | null |
2025-05-16 | Vaiage: A Multi-Agent Solution to Personalized Travel Planning | Binwen Liu et.al. | 2505.10922 | null |
2025-05-15 | Towards an LLM-powered Social Digital Twinning Platform | Önder Gürcan et.al. | 2505.10681 | null |
2025-05-15 | Agent Name Service (ANS): A Universal Directory for Secure AI Agent Discovery and Interoperability | Ken Huang et.al. | 2505.10609 | null |
2025-05-15 | Fixing Incomplete Value Function Decomposition for Multi-Agent Reinforcement Learning | Andrea Baisero et.al. | 2505.10484 | null |
2025-05-15 | Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps | Ningyuan Yang et.al. | 2505.10482 | null |
2025-05-15 | AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge | Ranjan Sapkota et.al. | 2505.10468 | null |
2025-05-15 | Multi-Agent Path Finding For Large Agents Is Intractable | Artem Agafonov et.al. | 2505.10387 | null |
2025-05-15 | Optimizing Electric Bus Charging Scheduling with Uncertainties Using Hierarchical Deep Reinforcement Learning | Jiaju Qi et.al. | 2505.10296 | null |
2025-05-15 | MASS: Multi-Agent Simulation Scaling for Portfolio Construction | Taian Guo et.al. | 2505.10278 | link |
2025-05-15 | Near Optimal Best Arm Identification for Clustered Bandits | Yash et.al. | 2505.10147 | null |
2025-05-15 | CartoAgent: a multimodal large language model-powered multi-agent cartographic framework for map style transfer and evaluation | Chenglong Wang et.al. | 2505.09936 | null |
2025-05-15 | Stability and Convergence Analysis of Multi-Agent Consensus with Communication Delays: A Lambert W Function Approach | Layan Badran et.al. | 2505.09897 | null |
2025-05-14 | Hamilton’s Rule for Enabling Altruism in Multi-Agent Systems | Brooks A. Butler et.al. | 2505.09841 | null |
2025-05-14 | WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models | Abdullah Mushtaq et.al. | 2505.09595 | null |
2025-05-14 | Streaming Multi-agent Pathfinding | Mingkai Tang et.al. | 2505.09472 | link |
2025-05-14 | Reproducibility Study of “Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents” | Pedro M. P. Curvo et.al. | 2505.09289 | link |
2025-05-14 | Data-driven Internal Model Control for Output Regulation | Wenjie Liu et.al. | 2505.09255 | null |
2025-05-14 | SALM: A Multi-Agent Framework for Language Model-Driven Social Network Simulation | Gaurav Koley et.al. | 2505.09081 | link |
2025-05-13 | Enhancing Aerial Combat Tactics through Hierarchical Multi-Agent Reinforcement Learning | Ardian Selmonaj et.al. | 2505.08995 | null |
2025-05-13 | TRAIL: Trace Reasoning and Agentic Issue Localization | Darshan Deshpande et.al. | 2505.08638 | null |
2025-05-13 | Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning | Shuai Han et.al. | 2505.08630 | null |
2025-05-13 | MC-Swarm: Minimal-Communication Multi-Agent Trajectory Planning and Deadlock Resolution for Quadrotor Swarm | Yunwoo Lee et.al. | 2505.08593 | null |
2025-05-13 | The Truth Becomes Clearer Through Debate! Multi-Agent Systems with Large Language Models Unmask Fake News | Yuhan Liu et.al. | 2505.08532 | null |
2025-05-13 | Scalable UAV Multi-Hop Networking via Multi-Agent Reinforcement Learning with Large Language Models | Yanggang Xu et.al. | 2505.08448 | null |
2025-05-13 | Agent-as-a-Service based on Agent Network | Yuhan Zhu et.al. | 2505.08446 | null |
2025-05-13 | Benchmarking AI scientists in omics data-driven biological research | Erpai Luo et.al. | 2505.08341 | link |
2025-05-13 | Reciprocity as the Foundational Substrate of Society: How Reciprocal Dynamics Scale into Social Systems | Egil Diau et.al. | 2505.08319 | null |
2025-05-13 | Scaling Multi Agent Reinforcement Learning for Underwater Acoustic Tracking via Autonomous Vehicles | Matteo Gallici et.al. | 2505.08222 | null |
2025-05-12 | FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning | Zhehao Zhang et.al. | 2505.08054 | null |
2025-05-12 | Multi-Agent Path Finding via Finite-Horizon Hierarchical Factorization | Jiarui Li et.al. | 2505.07779 | null |
2025-05-12 | KAQG: A Knowledge-Graph-Enhanced RAG for Difficulty-Controlled Question Generation | Ching Han Chen et.al. | 2505.07618 | null |
2025-05-12 | AgentFlow: Resilient Adaptive Cloud-Edge Framework for Multi-Agent Coordination | Ching Han Chen et.al. | 2505.07603 | null |
2025-05-12 | RAI: Flexible Agent Framework for Embodied AI | Kajetan Rachwał et.al. | 2505.07532 | link |
2025-05-12 | Towards Multi-Agent Reasoning Systems for Collaborative Expertise Delegation: An Exploratory Design Study | Baixuan Xu et.al. | 2505.07313 | null |
2025-05-12 | Multi-Agent DRL for Multi-Objective Twin Migration Routing with Workload Prediction in 6G-enabled IoV | Peng Yin et.al. | 2505.07290 | null |
2025-05-12 | BETTY Dataset: A Multi-modal Dataset for Full-Stack Autonomy | Micah Nye et.al. | 2505.07266 | null |
2025-05-12 | Continuous-Time Control Synthesis for Multiple Quadrotors under Signal Temporal Logic Specifications | Yating Yuan et.al. | 2505.07240 | null |
2025-05-12 | UAV-CodeAgents: Scalable UAV Mission Planning via Multi-Agent ReAct and Vision-Language Reasoning | Oleg Sautenkov et.al. | 2505.07236 | null |
2025-05-12 | Hypergraph Coordination Networks with Dynamic Grouping for Multi-Agent Reinforcement Learning | Chiqiang Liu et.al. | 2505.07207 | link |
2025-05-09 | Robust Multi-Agent Decision-Making in Finite-Population Games | Shinkyu Park et.al. | 2505.06200 | null |
2025-05-09 | Offline Multi-agent Reinforcement Learning via Score Decomposition | Dan Qiao et.al. | 2505.05968 | null |
2025-05-09 | Learning Power Control Protocol for In-Factory 6G Subnetworks | Uyoata E. Uyoata et.al. | 2505.05967 | null |
2025-05-09 | Why is distortion inevitable in opinion propagation on social media? Noise induced layer-wised synchronization in Noise-Frustrated Hegselmann-Krause model | Kaiming Luo et.al. | 2505.05769 | null |
2025-05-09 | Multi-Agent Systems for Robotic Autonomy with LLMs | Junhong Chen et.al. | 2505.05762 | null |
2025-05-09 | EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation | Biao Yi et.al. | 2505.05440 | null |
2025-05-08 | Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration | Andreas Kontogiannis et.al. | 2505.05262 | link |
2025-05-08 | Societal and technological progress as sewing an ever-growing, ever-changing, patchy, and polychrome quilt | Joel Z. Leibo et.al. | 2505.05197 | null |
2025-05-08 | Multi-agent Embodied AI: Advances and Future Directions | Zhaohan Feng et.al. | 2505.05108 | null |
2025-05-12 | Beyond the Tragedy of the Commons: Building A Reputation System for Generative Multi-agent Systems | Siyue Ren et.al. | 2505.05029 | null |
2025-05-08 | Foam-Agent: Towards Automated Intelligent CFD Workflows | Ling Yue et.al. | 2505.04997 | link |
2025-05-08 | Semi-Explicit Solution of Some Discrete-Time Mean-Field-Type Games with Higher-Order Costs | Julian Barreiro-Gomez et.al. | 2505.04988 | null |
2025-05-08 | A Multi-Agent AI Framework for Immersive Audiobook Production through Spatial Audio and Neural Narration | Shaja Arul Selvamani et.al. | 2505.04885 | null |
2025-05-08 | From First Draft to Final Insight: A Multi-Agent Approach for Feedback Generation | Jie Cao et.al. | 2505.04869 | null |
2025-05-07 | Large Language Models are Autonomous Cyber Defenders | Sebastián R. Castro et.al. | 2505.04843 | link |
2025-05-07 | Benchmarking LLMs’ Swarm intelligence | Kai Ruan et.al. | 2505.04364 | link |
2025-05-07 | Adaptive and Robust DBSCAN with Multi-agent Reinforcement Learning | Hao Peng et.al. | 2505.04339 | link |
2025-05-07 | Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning | Ruize Zhang et.al. | 2505.04317 | null |
2025-05-07 | PPO-ACT: Proximal Policy Optimization with Adversarial Curriculum Transfer for Spatial Public Goods Games | Zhaoqilin Yang et.al. | 2505.04302 | null |
2025-05-07 | Facilitating Trustworthy Human-Agent Collaboration in LLM-based Multi-Agent System oriented Software Engineering | Krishna Ronanki et.al. | 2505.04251 | null |
2025-05-07 | Multi-Agent Reinforcement Learning-based Cooperative Autonomous Driving in Smart Intersections | Taoyuan Yu et.al. | 2505.04231 | null |
2025-05-07 | AutoPatch: Multi-Agent Framework for Patching Real-World CVE Vulnerabilities | Minjae Seo et.al. | 2505.04195 | link |
2025-05-08 | The Power of Stories: Narrative Priming Shapes How LLM Agents Collaborate and Compete | Gerrit Großmann et.al. | 2505.03961 | link |
2025-05-06 | Deep Q-Network (DQN) multi-agent reinforcement learning (MARL) for Stock Trading | John Christopher Tidwell et.al. | 2505.03949 | null |
2025-05-06 | MARCO: A Multi-Agent System for Optimizing HPC Code Generation Using Large Language Models | Asif Rahman et.al. | 2505.03906 | null |
2025-05-06 | Multi-Agent System for Comprehensive Soccer Understanding | Jiayuan Rao et.al. | 2505.03735 | null |
2025-05-06 | RoboOS: A Hierarchical Embodied Framework for Cross-Embodiment and Multi-Agent Collaboration | Huajie Tan et.al. | 2505.03673 | link |
2025-05-06 | Rainbow Delay Compensation: A Multi-Agent Reinforcement Learning Framework for Mitigating Delayed Observation | Songchen Fu et.al. | 2505.03586 | null |
2025-05-06 | Multi-Agent Reinforcement Learning Scheduling to Support Low Latency in Teleoperated Driving | Giacomo Avanzi et.al. | 2505.03558 | null |
2025-05-06 | A Hashgraph-Inspired Consensus Mechanism for Reliable Multi-Model Reasoning | Kolawole E. Ogunsina et.al. | 2505.03553 | null |
2025-05-06 | Small-Scale-Fading-Aware Resource Allocation in Wireless Federated Learning | Jiacheng Wang et.al. | 2505.03533 | null |
2025-05-06 | Multi-Agent Deep Reinforcement Learning for Zonal Ancillary Market Coupling | Francesco Morri et.al. | 2505.03288 | null |
2025-05-06 | Model Predictive Fuzzy Control: A Hierarchical Multi-Agent Control Architecture for Outdoor Search-and-Rescue Robots | Craig Maxwell et.al. | 2505.03257 | null |
2025-05-06 | RADE: Learning Risk-Adjustable Driving Environment via Multi-Agent Conditional Diffusion | Jiawei Wang et.al. | 2505.03178 | null |
2025-05-07 | An LLM-based Self-Evolving Security Framework for 6G Space-Air-Ground Integrated Networks | Qi Qin et.al. | 2505.03161 | null |
2025-05-05 | A Survey of Slow Thinking-based Reasoning LLMs using Reinforced Learning and Inference-time Scaling Law | Qianjun Pan et.al. | 2505.02665 | null |
2025-05-06 | MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation | Mingcheng Li et.al. | 2505.02648 | null |
2025-05-05 | Beyond the model: Key differentiators in large language models and multi-agent services | Muskaan Goyal et.al. | 2505.02489 | null |
2025-05-05 | El Agente: An Autonomous Agent for Quantum Chemistry | Yunheng Zou et.al. | 2505.02484 | null |
2025-05-04 | Resolving Conflicting Constraints in Multi-Agent Reinforcement Learning with Layered Safety | Jason J. Choi et.al. | 2505.02293 | null |
2025-05-04 | Interpretable Emergent Language Using Inter-Agent Transformers | Mannan Bhardwaj et.al. | 2505.02215 | link |
2025-05-04 | Enhancing LLM Code Generation: A Systematic Evaluation of Multi-Agent Collaboration and Runtime Debugging for Improved Accuracy, Reliability, and Latency | Nazmus Ashrafi et.al. | 2505.02133 | link |
2025-05-04 | DriveAgent: Multi-Agent Structured Reasoning with LLM and Multimodal Sensor Fusion for Autonomous Driving | Xinmeng Hou et.al. | 2505.02123 | link |
2025-05-04 | Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents | Christian Schroeder de Witt et.al. | 2505.02077 | null |
2025-05-03 | Securing 5G and Beyond-Enabled UAV Networks: Resilience Through Multiagent Learning and Transformers Detection | Joseanne Viana et.al. | 2505.01885 | null |
2025-05-02 | Pattern formation using an intrinsic optimal control approach | Tianhao Li et.al. | 2505.01302 | null |
2025-05-02 | Exploring Equity of Climate Policies using Multi-Agent Multi-Objective Reinforcement Learning | Palok Biswas et.al. | 2505.01115 | null |
2025-05-02 | Multi-agents based User Values Mining for Recommendation | Lijian Chen et.al. | 2505.00981 | null |
2025-05-02 | Virtual Force-Based Routing of Modular Agents on a Graph | Adam Casselman et.al. | 2505.00928 | null |
2025-05-01 | ParkDiffusion: Heterogeneous Multi-Agent Multi-Modal Trajectory Prediction for Automated Parking using Diffusion Models | Jiarong Wei et.al. | 2505.00586 | null |
2025-05-01 | Emergence of Roles in Robotic Teams with Model Sharing and Limited Communication | Ian O’Flynn et.al. | 2505.00540 | null |
2025-05-01 | Safety-Critical Traffic Simulation with Guided Latent Diffusion Model | Mingxing Peng et.al. | 2505.00515 | null |
2025-05-01 | UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces | Alaa Saleh et.al. | 2505.00472 | null |
2025-05-01 | AI2-Active Safety: AI-enabled Interaction-aware Active Safety Analysis with Vehicle Dynamics | Keshu Wu et.al. | 2505.00322 | null |
2025-05-01 | Large Language Models as AI Agents for Digital Atoms and Molecules: Catalyzing a New Era in Computational Biophysics | Yijie Xia et.al. | 2505.00270 | null |
2025-04-30 | PSN Game: Game-theoretic Planning via a Player Selection Network | Tianyu Qiu et.al. | 2505.00213 | null |
2025-04-30 | Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems | Shaokun Zhang et.al. | 2505.00212 | link |
2025-04-30 | Investigating Adaptive Tuning of Assistive Exoskeletons Using Offline Reinforcement Learning: Challenges and Insights | Yasin Findik et.al. | 2505.00201 | null |
2025-04-30 | Uncertainty, bias and the institution bootstrapping problem | Stavros Anagnou et.al. | 2504.21579 | null |
2025-04-30 | Stability of Open Multi-agent Systems over Dynamic Signed Graphs | Pelin Sekercioglu et.al. | 2504.21443 | null |
2025-04-30 | Robust Multi-agent Communication Based on Decentralization-Oriented Adversarial Training | Xuyan Ma et.al. | 2504.21278 | null |
2025-04-29 | Learning Large-Scale Competitive Team Behaviors with Mean-Field Interactions | Bhavini Jeloka et.al. | 2504.21164 | null |
2025-04-29 | NavEX: A Multi-Agent Coverage in Non-Convex and Uneven Environments via Exemplar-Clustering | Donipolo Ghimire et.al. | 2504.21113 | link |
2025-04-29 | How to Coordinate UAVs and UGVs for Efficient Mission Planning? Optimizing Energy-Constrained Cooperative Routing with a DRL Framework | Md Safwan Mondal et.al. | 2504.21111 | null |
2025-04-29 | AegisLLM: Scaling Agentic Systems for Self-Reflective Defense in LLM Security | Zikui Cai et.al. | 2504.20965 | link |
2025-04-29 | Opinion-Driven Decision-Making for Multi-Robot Navigation through Narrow Corridors | Norah K. Alghamdi et.al. | 2504.20947 | null |
2025-04-29 | Exploiting inter-agent coupling information for efficient reinforcement learning of cooperative LQR | Shahbaz P Qadri Syed et.al. | 2504.20927 | null |
2025-04-29 | Modeling AI-Human Collaboration as a Multi-Agent Adaptation | Prothit Sen et.al. | 2504.20903 | link |
2025-04-29 | CBM-RAG: Demonstrating Enhanced Interpretability in Radiology Report Generation with Multi-Agent RAG and Concept Bottleneck Models | Hasan Md Tusfiqur Alam et.al. | 2504.20898 | link |
2025-04-29 | Independent Learning in Performative Markov Potential Games | Rilind Sahitaj et.al. | 2504.20593 | link |
2025-04-29 | Safe Bottom-Up Flexibility Provision from Distributed Energy Resources | Costas Mylonas et.al. | 2504.20529 | null |
2025-04-28 | Securing GenAI Multi-Agent Systems Against Tool Squatting: A Zero Trust Registry-Based Approach | Vineeth Sai Narajala et.al. | 2504.19951 | null |
2025-04-28 | Can AI Agents Design and Implement Drug Discovery Pipelines? | Khachik Smbatyan et.al. | 2504.19912 | null |
2025-04-28 | LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects | Guangyi Liu et.al. | 2504.19838 | link |
2025-04-28 | PhenoAssistant: A Conversational Multi-Agent AI System for Automated Plant Phenotyping | Feng Chen et.al. | 2504.19818 | link |
2025-04-28 | From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review | Mohamed Amine Ferrag et.al. | 2504.19678 | null |
2025-04-28 | A Time-dependent Risk-aware distributed Multi-Agent Path Finder based on A* | S Nordström et.al. | 2504.19593 | null |
2025-04-28 | m-KAILIN: Knowledge-Driven Agentic Scientific Corpus Distillation Framework for Biomedical Large Language Models Training | Meng Xiao et.al. | 2504.19565 | null |
2025-04-28 | Evolution of Cooperation in LLM-Agent Societies: A Preliminary Study Using Different Punishment Strategies | Kavindu Warnakulasuriya et.al. | 2504.19487 | null |
2025-04-28 | Symmetric Policy Design for Multi-Agent Dispatch Coordination in Supply Chains | Sagar Sudhakara et.al. | 2504.19397 | null |
2025-04-27 | OpenFOAMGPT 2.0: end-to-end, trustworthy automation for computational fluid dynamics | Jingsen Feng et.al. | 2504.19338 | null |
2025-04-25 | Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant | Lei Shen et.al. | 2504.18373 | link |
2025-04-25 | MAGI: Multi-Agent Guided Interview for Psychiatric Assessment | Guanqun Bi et.al. | 2504.18260 | null |
2025-04-25 | Automating Function-Level TARA for Automotive Full-Lifecycle Security | Yuqiao Yang et.al. | 2504.18083 | null |
2025-04-25 | Sky-Drive: A Distributed Multi-Agent Simulation Platform for Socially-Aware and Human-AI Collaborative Future Transportation | Zilin Huang et.al. | 2504.18010 | null |
2025-04-24 | LLM Agent Swarm for Hypothesis-Driven Drug Discovery | Kevin Song et.al. | 2504.17967 | null |
2025-04-24 | Collaborating Action by Action: A Multi-agent LLM Framework for Embodied Reasoning | Isadora White et.al. | 2504.17950 | null |
2025-04-24 | Applied Sheaf Theory For Multi-agent Artificial Intelligence (Reinforcement Learning) Systems: A Prospectus | Eric Schmid et.al. | 2504.17700 | null |
2025-04-24 | Mitigating xApp conflicts for efficient network slicing in 6G O-RAN: a graph convolutional-based attention network approach | Sihem Bakri et.al. | 2504.17590 | null |
2025-04-24 | A Multi-Agent, Laxity-Based Aggregation Strategy for Cost-Effective Electric Vehicle Charging and Local Transformer Overload Prevention | Kristoffer Christensen et.al. | 2504.17575 | null |
2025-04-24 | Cooperative Task Offloading through Asynchronous Deep Reinforcement Learning in Mobile Edge Computing for Future Networks | Yuelin Liu et.al. | 2504.17526 | null |
2025-04-24 | AGCo-MATA: Air-Ground Collaborative Multi-Agent Task Allocation in Mobile Crowdsensing | Tianhao Shao et.al. | 2504.17409 | null |
2025-04-24 | Comprehend, Divide, and Conquer: Feature Subspace Exploration via Multi-Agent Hierarchical Reinforcement Learning | Weiliang Zhang et.al. | 2504.17356 | null |
2025-04-24 | Collaborative Multi-Agent Reinforcement Learning for Automated Feature Transformation with Graph-Driven Path Optimization | Xiaohan Huang et.al. | 2504.17355 | null |
2025-04-24 | A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation | Yangxinyu Xie et.al. | 2504.17200 | null |
2025-04-24 | Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning | Minju Seo et.al. | 2504.17192 | link |
2025-04-23 | Peer-Aware Cost Estimation in Nonlinear General-Sum Dynamic Games for Mutual Learning and Intent Inference | Seyed Yousef Soltanian et.al. | 2504.17129 | null |
2025-04-23 | OptimAI: Optimization from Natural Language Using LLM-Powered AI Agents | Raghav Thind et.al. | 2504.16918 | null |
2025-04-23 | Building A Secure Agentic AI Application Leveraging A2A Protocol | Idan Habler et.al. | 2504.16902 | null |
2025-04-23 | IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery | Aniketh Garikaparthi et.al. | 2504.16728 | link |
2025-04-23 | Amplified Vulnerabilities: Structured Jailbreak Attacks on LLM-based Multi-Agent Debate | Senmao Qi et.al. | 2504.16489 | null |
2025-04-23 | Less is More: Enhancing Structured Multi-Agent Reasoning via Quality-Guided Distillation | Jiahao Yuan et.al. | 2504.16408 | link |
2025-04-22 | Towards Test Generation from Task Description for Mobile Testing with Multi-modal Reasoning | Hieu Huynh et.al. | 2504.15917 | link |
2025-04-22 | DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models | Jie Zhu et.al. | 2504.15716 | link |
2025-04-22 | A Multi-Agent Framework for Automated Qinqiang Opera Script Generation Using Large Language Models | Gengxian Cao et.al. | 2504.15552 | null |
2025-04-22 | RiskNet: Interaction-Aware Risk Forecasting for Autonomous Driving in Long-Tail Scenarios | Qichao Liu et.al. | 2504.15541 | null |
2025-04-21 | Agent for User: Testing Multi-User Interactive Features in TikTok | Sidong Feng et.al. | 2504.15474 | null |
2025-04-21 | Solving Multi-Agent Safe Optimal Control with Distributed Epigraph Form MARL | Songyuan Zhang et.al. | 2504.15425 | null |
2025-04-21 | MRTA-Sim: A Modular Simulator for Multi-Robot Allocation, Planning, and Control in Open-World Environments | Victoria Marie Tuck et.al. | 2504.15418 | null |
2025-04-21 | FlowReasoner: Reinforcing Query-Level Meta-Agents | Hongcheng Gao et.al. | 2504.15257 | link |
2025-04-21 | Behavioral Universe Network (BUN): A Behavioral Information-Based Framework for Complex Systems | Wei Zhou et.al. | 2504.15146 | null |
2025-04-21 | Neural ATTF: A Scalable Solution to Lifelong Multi-Agent Path Planning | Kushal Shah et.al. | 2504.15130 | null |
2025-04-21 | DistilQwen2.5: Industrial Practices of Training Distilled Open Lightweight Language Models | Chengyu Wang et.al. | 2504.15027 | null |
2025-04-21 | Mechanism Design for Auctions with Externalities on Budgets | Yusen Zheng et.al. | 2504.14948 | null |
2025-04-21 | EducationQ: Evaluating LLMs’ Teaching Capabilities Through Multi-Agent Dialogue Framework | Yao Shi et.al. | 2504.14928 | null |
2025-04-21 | Event triggered optimal formation control for nonlinear multi-agent systems under Denial-of-Service attacks | Jianqiang Zhang et.al. | 2504.14874 | null |
2025-04-21 | SQL-Factory: A Multi-Agent Framework for High-Quality and Large-Scale SQL Generation | Jiahui Li et.al. | 2504.14837 | link |
2025-04-21 | An Enhanced Dual-Currency VCG Auction Mechanism for Resource Allocation in IoV: A Value of Information Perspective | Wei Wang et.al. | 2504.14824 | null |
2025-04-21 | Completing A Systematic Review in Hours instead of Months with Interactive AI Agents | Rui Qiu et.al. | 2504.14822 | link |
2025-04-18 | LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark | Guangyi Liu et.al. | 2504.13805 | null |
2025-04-18 | EyecareGPT: Boosting Comprehensive Ophthalmology Understanding with Tailored Dataset, Benchmark and Model | Sijing Li et.al. | 2504.13650 | link |
2025-04-18 | MAAM: A Lightweight Multi-Agent Aggregation Module for Efficient Image Classification Based on the MindSpore Framework | Zhenkai Qin et.al. | 2504.13574 | null |
2025-04-18 | Task Assignment and Exploration Optimization for Low Altitude UAV Rescue via Generative AI Enhanced Multi-agent Reinforcement Learning | Xin Tang et.al. | 2504.13554 | null |
2025-04-18 | MusFlow: Multimodal Music Generation via Conditional Flow Matching | Jiahao Song et.al. | 2504.13535 | null |
2025-04-18 | Large Language Models for Validating Network Protocol Parsers | Mingwei Zheng et.al. | 2504.13515 | link |
2025-04-18 | Decentralized Handover Parameter Optimization with MARL for Load Balancing in 5G Networks | Yang Shen et.al. | 2504.13424 | null |
2025-04-21 | LangCoop: Collaborative Driving with Language | Xiangbo Gao et.al. | 2504.13406 | link |
2025-04-18 | Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving Safety | Shashank Shriram et.al. | 2504.13399 | link |
2025-04-21 | Recursive Deep Inverse Reinforcement Learning | Paul Ghanem et.al. | 2504.13241 | null |
2025-04-17 | Retrieval-Augmented Generation with Conflicting Evidence | Han Wang et.al. | 2504.13079 | link |
2025-04-17 | InstructRAG: Leveraging Retrieval-Augmented Generation on Instruction Graphs for LLM-Based Task Planning | Zheng Wang et.al. | 2504.13032 | null |
2025-04-17 | QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning? | Zhouyang Jiang et.al. | 2504.12961 | null |
2025-04-17 | Are AI agents the new machine translation frontier? Challenges and opportunities of single- and multi-agent systems for multilingual digital communication | Vicent Briva-Iglesias et.al. | 2504.12891 | null |
2025-04-17 | DashChat: Interactive Authoring of Industrial Dashboard Design Prototypes through Conversation with LLM-Powered Agents | S. Shen et.al. | 2504.12865 | null |
2025-04-17 | Multi-Agent Reinforcement Learning Simulation for Environmental Policy Synthesis | James Rudd-Jones et.al. | 2504.12777 | null |
2025-04-18 | The Athenian Academy: A Seven-Layer Architecture Model for Multi-Agent Systems | Lidong Zhai et.al. | 2504.12735 | null |
2025-04-17 | Cross-environment Cooperation Enables Zero-shot Multi-agent Coordination | Kunal Jha et.al. | 2504.12714 | null |
2025-04-17 | Collaborative Perception Datasets for Autonomous Driving: A Review | Naibang Wang et.al. | 2504.12696 | link |
2025-04-17 | On Equivalence Between Decentralized Policy-Profile Mixtures and Behavioral Coordination Policies in Multi-Agent Systems | Nouman Khan et.al. | 2504.12635 | null |
2025-04-16 | Optimal flock formation induced by agent heterogeneity | Arthur N. Montanari et.al. | 2504.12297 | link |
2025-04-16 | Factor-MCLS: Multi-agent learning system with reward factor matrix and multi-critic framework for dynamic portfolio optimization | Ruoyu Sun et.al. | 2504.11874 | null |
2025-04-15 | Multi-Agent Reinforcement Learning for Decentralized Reservoir Management via Murmuration Intelligence | Heming Fu et.al. | 2504.11569 | null |
2025-04-15 | Multi-Agent Reinforcement Learning for Greenhouse Gas Offset Credit Markets | Liam Welsh et.al. | 2504.11258 | null |
2025-04-15 | A Linear Push-Pull Average Consensus Algorithm for Delay-Prone Networks | Evagoras Makridis et.al. | 2504.10960 | null |
2025-04-15 | LOKA Protocol: A Decentralized Framework for Trustworthy and Ethical AI Agent Ecosystems | Rajesh Ranjan et.al. | 2504.10915 | null |
2025-04-14 | Achieving Optimal Tissue Repair Through MARL with Reward Shaping and Curriculum Learning | Muhammad Al-Zafar Khan et.al. | 2504.10677 | null |
2025-04-14 | Can Competition Enhance the Proficiency of Agents Powered by Large Language Models in the Realm of News-driven Time Series Forecasting? | Yuxuan Zhang et.al. | 2504.10210 | null |
2025-04-14 | MSCoT: Structured Chain-of-Thought Generation for Multiple Programming Languages | Naizhu Jin et.al. | 2504.10178 | link |
2025-04-14 | DataMosaic: Explainable and Verifiable Multi-Modal Data Analytics through Extract-Reason-Verify | Zhengxuan Zhang et.al. | 2504.10036 | null |
2025-04-14 | PestMA: LLM-based Multi-Agent System for Informed Pest Management | Hongrui Shi et.al. | 2504.09855 | null |
2025-04-14 | Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning | Can Jin et.al. | 2504.09772 | link |
2025-04-14 | Socratic Chart: Cooperating Multiple Agents for Robust SVG Chart Understanding | Yuyang Ji et.al. | 2504.09764 | null |
2025-04-13 | Learning-based decentralized control with collision avoidance for multi-agent systems | Omayra Yago Nieto et.al. | 2504.09730 | null |
2025-04-13 | EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental Health Safety | Jiahao Qiu et.al. | 2504.09689 | link |
2025-04-13 | AgentDynEx: Nudging the Mechanics and Dynamics of Multi-Agent Simulations | Jenny Ma et.al. | 2504.09662 | null |
2025-04-13 | Fine-tuning an Large Language Model for Automating Computational Fluid Dynamics Simulations | Zhehao Dong et.al. | 2504.09602 | link |
2025-04-11 | DocAgent: A Multi-Agent System for Automated Code Documentation Generation | Dayu Yang et.al. | 2504.08725 | link |
2025-04-11 | MooseAgent: A LLM Based Multi-agent Framework for Automating Moose Simulation | Tao Zhang et.al. | 2504.08621 | link |
2025-04-11 | Belief States for Cooperative Multi-Agent Reinforcement Learning under Partial Observability | Paul J. Pritz et.al. | 2504.08417 | null |
2025-04-11 | Encoding argumentation frameworks with set attackers to propositional logic systems | Shuai Tang et.al. | 2504.08370 | null |
2025-04-11 | Graph Based Deep Reinforcement Learning Aided by Transformers for Multi-Agent Cooperation | Michael Elrod et.al. | 2504.08195 | null |
2025-04-10 | Hybrid Reinforcement Learning-based Sustainable Multi-User Computation Offloading for Mobile Edge-Quantum Computing | Minrui Xu et.al. | 2504.08134 | null |
2025-04-10 | Test Amplification for REST APIs via Single and Multi-Agent LLM Systems | Robbe Nooyens et.al. | 2504.08113 | null |
2025-04-11 | An LLM-Driven Multi-Agent Debate System for Mendelian Diseases | Xinyang Zhou et.al. | 2504.07881 | null |
2025-04-10 | Anytime Single-Step MAPF Planning with Anytime PIBT | Nayesha Gandotra et.al. | 2504.07841 | null |
2025-04-10 | MOSAIC: Modeling Social AI for Content Dissemination and Regulation in Multi-Agent Simulations | Genglin Liu et.al. | 2504.07830 | link |
2025-04-10 | Strategic learning for disturbance rejection in multi-agent systems: Nash and Minmax in graphical games | Xinyang Wang et.al. | 2504.07547 | null |
2025-04-10 | Achilles Heel of Distributed Multi-Agent Systems | Yiting Zhang et.al. | 2504.07461 | null |
2025-04-09 | Modeling Response Consistency in Multi-Agent LLM Systems: A Comparative Analysis of Shared and Separate Context Approaches | Tooraj Helmi et.al. | 2504.07303 | null |
2025-04-09 | Agentic SLMs: Hunting Down Test Smells | Rian Melo et.al. | 2504.07277 | null |
2025-04-09 | Analysis of the Unscented Transform for Cooperative Localization with Ranging-Only Information | Uthman Olawoye et.al. | 2504.07242 | null |
2025-04-09 | Compositional design for time-varying and nonlinear coordination | Jonas Hansson et.al. | 2504.07226 | null |
2025-04-09 | Multi-Agent Trustworthy Consensus under Random Dynamic Attacks | Orhan Eren Akgün et.al. | 2504.07189 | null |
2025-04-09 | AI-Driven Consensus: Modeling Multi-Agent Networks with Long-Range Interactions through path-Laplacian Matrices | Yusef Ahsini et.al. | 2504.06894 | link |
2025-04-09 | SDHN: Skewness-Driven Hypergraph Networks for Enhanced Localized Multi-Robot Coordination | Delin Zhao et.al. | 2504.06684 | link |
2025-04-09 | Dynamic Residual Safe Reinforcement Learning for Multi-Agent Safety-Critical Scenarios Decision-Making | Kaifeng Wang et.al. | 2504.06670 | null |
2025-04-09 | AgentFM: Role-Aware Failure Management for Distributed Databases with LLM-Driven Multi-Agents | Lingzhe Zhang et.al. | 2504.06614 | null |
2025-04-09 | Market, power, gift, and concession economies: Comparison using four-mode primitive network models | Takeshi Kato et.al. | 2504.06557 | null |
2025-04-08 | Linear Regulator-Based Synchronization of Positive Multi-Agent Systems | Alba Gurpegui et.al. | 2504.06169 | null |
2025-04-08 | Real-Time LaCAM | Runzhe Liang et.al. | 2504.06091 | null |
2025-04-08 | Robust and Efficient Average Consensus with Non-Coherent Over-the-Air Aggregation | Yuhang Deng et.al. | 2504.05729 | null |
2025-04-08 | Single-Agent vs. Multi-Agent LLM Strategies for Automated Student Reflection Assessment | Gen Li et.al. | 2504.05716 | null |
2025-04-08 | FactGuard: Leveraging Multi-Agent Systems to Generate Answerable and Unanswerable Questions for Enhanced Long-Context LLM Extraction | Qian-Wen Zhang et.al. | 2504.05607 | link |
2025-04-07 | Federated Hierarchical Reinforcement Learning for Adaptive Traffic Signal Control | Yongjie Fu et.al. | 2504.05553 | null |
2025-04-07 | Prism: Dynamic and Flexible Benchmarking of LLMs Code Generation with Monte Carlo Tree Search | Vahid Majdinasab et.al. | 2504.05500 | null |
2025-04-07 | BC-ADMM: An Efficient Non-convex Constrained Optimizer with Robotic Applications | Zherong Pan et.al. | 2504.05465 | null |
2025-04-07 | EduPlanner: LLM-Based Multi-Agent Systems for Customized and Intelligent Instructional Design | Xueqiao Zhang et.al. | 2504.05370 | null |
2025-04-07 | CREA: A Collaborative Multi-Agent Framework for Creative Content Generation with Diffusion Models | Kavana Venkatesh et.al. | 2504.05306 | null |
2025-04-07 | AI-Driven Tactical Communications and Networking for Defense: A Survey and Emerging Trends | Victor Monzon Baeza et.al. | 2504.05071 | null |
2025-04-08 | Attention-Augmented Inverse Reinforcement Learning with Graph Convolutions for Multi-Agent Task Allocation | Huilin Yin et.al. | 2504.05045 | null |
2025-04-07 | Hybrid Control Barrier Functions for Nonholonomic Multi-Agent Systems | Aurora Haraldsen et.al. | 2504.04937 | null |
2025-04-07 | BIASINSPECTOR: Detecting Bias in Structured Data through LLM Agents | Haoxuan Li et.al. | 2504.04855 | null |
2025-04-07 | An Efficient Approach for Cooperative Multi-Agent Learning Problems | Ángel Aso-Mollar et.al. | 2504.04850 | null |
2025-04-07 | Multi-Agent Deep Reinforcement Learning for Multiple Anesthetics Collaborative Control | Huijie Li et.al. | 2504.04765 | null |
2025-04-07 | Large-Scale Mixed-Traffic and Intersection Control using Multi-agent Reinforcement Learning | Songyang Liu et.al. | 2504.04691 | link |
2025-04-08 | HypRL: Reinforcement Learning of Control Policies for Hyperproperties | Tzu-Han Hsu et.al. | 2504.04675 | null |
2025-04-08 | Autono: A ReAct-Based Highly Robust Autonomous Agent Framework | Zihao Wu et.al. | 2504.04650 | link |
2025-04-04 | A Lower Bound on Conservative Elementary Object Systems Coverability | Francesco Di Cosmo et.al. | 2504.03591 | null |
2025-04-04 | Decentralized Collective World Model for Emergent Communication and Coordination | Kentaro Nomura et.al. | 2504.03353 | null |
2025-04-04 | Learning-Based Conformal Tube MPC for Safe Control in Interactive Multi-Agent Systems | Shuqi Wang et.al. | 2504.03293 | link |
2025-04-04 | Energy Aware and Safe Path Planning for Unmanned Aircraft Systems | Sebastian Gasche et.al. | 2504.03271 | null |
2025-04-07 | DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments | Yuxiang Zheng et.al. | 2504.03160 | link |
2025-04-03 | Distributionally Robust Predictive Runtime Verification under Spatio-Temporal Logic Specifications | Yiqi Zhao et.al. | 2504.02964 | link |
2025-04-03 | Sequential Binary Hypothesis Testing with Competing Agents under Information Asymmetry | Aneesh Raghavan et.al. | 2504.02743 | null |
2025-04-03 | A Set-Theoretic Robust Control Approach for Linear Quadratic Games with Unknown Counterparts | Francesco Bianchin et.al. | 2504.02679 | null |
2025-04-03 | Multi-Mission Tool Bench: Assessing the Robustness of LLM based Agents through Related and Dynamic Missions | PeiJie Yu et.al. | 2504.02623 | link |
2025-04-03 | Hierarchical Policy-Gradient Reinforcement Learning for Multi-Agent Shepherding Control of Non-Cohesive Targets | Stefano Covone et.al. | 2504.02479 | link |
2025-04-02 | A Survey of Scaling in Large Language Model Reasoning | Zihan Chen et.al. | 2504.02181 | null |
2025-04-02 | Achieving Unanimous Consensus in Decision Making Using Multi-Agents | Apurba Pokharel et.al. | 2504.02128 | null |
2025-04-02 | Distributed Resource Allocation for Human-Autonomy Teaming under Coupled Constraints | Yichen Yao et.al. | 2504.02088 | null |
2025-04-02 | Self-Resource Allocation in Multi-Agent LLM Systems | Alfonso Amayuelas et.al. | 2504.02051 | null |
2025-04-04 | Distributed Multi-agent Coordination over Cellular Sheaves | Tyler Hanks et.al. | 2504.02049 | null |
2025-04-02 | Budget-Feasible Contracts | Michal Feldman et.al. | 2504.01773 | null |
2025-04-02 | LLM-mediated Dynamic Plan Generation with a Multi-Agent Approach | Reo Abe et.al. | 2504.01637 | null |
2025-04-03 | GeoRAG: A Question-Answering Approach from a Geographical Perspective | Jian Wang et.al. | 2504.01458 | null |
2025-04-02 | Dynamic Incentive Strategies for Smart EV Charging Stations: An LLM-Driven User Digital Twin Approach | Yichen Sun et.al. | 2504.01423 | null |
2025-04-01 | Remember, but also, Forget: Bridging Myopic and Perfect Recall Fairness with Past-Discounting | Ashwin Kumar et.al. | 2504.01154 | null |
2025-04-01 | Provably Stable Multi-Agent Routing with Bounded-Delay Adversaries in the Decision Loop | Roee M. Francos et.al. | 2504.00863 | null |
2025-04-02 | Do We Truly Need So Many Samples? Multi-LLM Repeated Sampling Efficiently Scales Test-Time Compute | Jianhao Chen et.al. | 2504.00762 | link |
2025-04-01 | GraphMaster: Automated Graph Synthesis via LLM Agents in Data-Limited Environments | Enjun Du et.al. | 2504.00711 | null |
2025-04-01 | Simulation of Autonomous Industrial Vehicle Fleet Using Fuzzy Agents: Application to Task Allocation and Battery Charge Management | Juliette Grosset et.al. | 2504.00683 | null |
2025-04-01 | Asynchronous Multi-Agent Systems with Petri nets | Federica Adobbati et.al. | 2504.00602 | null |
2025-03-31 | Fair Dynamic Spectrum Access via Fully Decentralized Multi-Agent Reinforcement Learning | Yubo Zhang et.al. | 2503.24296 | null |
2025-03-31 | MaintainCoder: Maintainable Code Generation Under Dynamic Requirements | Zhengren Wang et.al. | 2503.24260 | link |
2025-04-02 | TeleAntiFraud-28k: An Audio-Text Slow-Thinking Dataset for Telecom Fraud Detection | Zhiming Ma et.al. | 2503.24115 | link |
2025-03-31 | Data-Driven Distributed Output Synchronization of Heterogeneous Discrete-Time Multi-Agent Systems | Giulio Fattore et.al. | 2503.24105 | null |
2025-03-31 | Consensus on Open Multi-Agent Systems Over Graphs Sampled from Graphons | Renato Vizuete et.al. | 2503.24025 | null |
2025-03-31 | Rubric Is All You Need: Enhancing LLM-based Code Evaluation With Question-Specific Rubrics | Aditya Pathak et.al. | 2503.23989 | null |
2025-03-31 | SchemaAgent: A Multi-Agents Framework for Generating Relational Database Schema | Qin Wang et.al. | 2503.23886 | link |
2025-03-31 | WHERE and WHICH: Iterative Debate for Biomedical Synthetic Data Augmentation | Zhengyi Zhao et.al. | 2503.23673 | null |
2025-03-31 | Multi-Agent Deep Reinforcement Learning for Optimized Multi-UAV Coverage and Power-Efficient UE Connectivity | Xuli Cai et.al. | 2503.23669 | null |
2025-04-01 | MolGround: A Benchmark for Molecular Grounding | Jiaxin Wu et.al. | 2503.23668 | null |
2025-03-28 | Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions | Mohammad Almansoori et.al. | 2503.22678 | null |
2025-03-28 | Unlocking LLM Repair Capabilities in Low-Resource Programming Languages Through Cross-Language Translation and Multi-Agent Refinement | Wenqiang Luo et.al. | 2503.22512 | null |
2025-03-28 | WorkTeam: Constructing Workflows from Natural Language with Multi-Agents | Hanchao Liu et.al. | 2503.22473 | null |
2025-03-31 | PharmAgents: Building a Virtual Pharma with Large Language Model Agents | Bowen Gao et.al. | 2503.22164 | null |
2025-03-28 | Cooperative Hybrid Multi-Agent Pathfinding Based on Shared Exploration Maps | Ning Liu et.al. | 2503.22162 | null |
2025-03-28 | REMAC: Self-Reflective and Self-Evolving Multi-Agent Collaboration for Long-Horizon Robot Manipulation | Puzhen Yuan et.al. | 2503.22122 | null |
2025-03-27 | Debate-Driven Multi-Agent LLMs for Phishing Email Detection | Ngoc Tuong Vy Nguyen et.al. | 2503.22038 | null |
2025-03-27 | Combining Graph Attention Networks and Distributed Optimization for Multi-Robot Mixed-Integer Convex Programming | Viet-Anh Le et.al. | 2503.21548 | null |
2025-03-27 | Learning Generalizable Skills from Offline Multi-Task Data for Multi-Agent Cooperation | Sicong Liu et.al. | 2503.21200 | link |
2025-03-27 | Safe Human Robot Navigation in Warehouse Scenario | Seth Farrell et.al. | 2503.21141 | null |
2025-03-26 | A Hopf-Lax Type Formula for Multi-Agent Path Planning with Pattern Coordination | Christian Parkinson et.al. | 2503.20974 | link |
2025-03-26 | Welfare and Cost Aggregation for Multi-Agent Control: When to Choose Which Social Cost Function, and Why? | Ilia Shilov et.al. | 2503.20772 | null |
2025-03-27 | Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound | Yuhao Huang et.al. | 2503.20685 | null |
2025-03-26 | TAMA: A Human-AI Collaborative Thematic Analysis Framework Using Multi-Agent LLMs for Clinical Interviews | Huimin Xu et.al. | 2503.20666 | null |
2025-03-26 | Agent-Based Analysis of the Impact of Near Real-Time Data and Smart Balancing on the Frequency Stability of Power Systems | Johannes Lips et.al. | 2503.20665 | null |
2025-03-26 | A Theoretical Framework for Prompt Engineering: Approximating Smooth Functions with Transformer Prompts | Ryumei Nakada et.al. | 2503.20561 | null |
2025-03-26 | On the order of the shortest solution sequences for the pebble motion problems | Tomoki Nakamigawa et.al. | 2503.20550 | null |
2025-03-26 | Knowledge-Based Multi-Agent Framework for Automated Software Architecture Design | Yiran Zhang et.al. | 2503.20536 | null |
2025-03-26 | GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving | Lloyd Russell et.al. | 2503.20523 | null |
2025-03-26 | Harmonia: A Multi-Agent Reinforcement Learning Approach to Data Placement and Migration in Hybrid Storage Systems | Rakesh Nadig et.al. | 2503.20507 | null |
2025-03-26 | A multi-agentic framework for real-time, autonomous freeform metasurface design | Robert Lupoiu et.al. | 2503.20479 | link |
2025-03-25 | A Multi-Agent Framework Integrating Large Language Models and Generative AI for Accelerated Metamaterial Design | Jie Tian et.al. | 2503.19889 | null |
2025-03-25 | Collaborative Satisfaction of Long-Term Spatial Constraints in Multi-Agent Systems: A Distributed Optimization Approach (extended version) | Farhad Mehdifar et.al. | 2503.19879 | null |
2025-03-25 | Optimal Path Planning and Cost Minimization for a Drone Delivery System Via Model Predictive Control | Muhammad Al-Zafar Khan et.al. | 2503.19699 | null |
2025-03-25 | Enabling Rapid Shared Human-AI Mental Model Alignment via the After-Action Review | Edward Gu et.al. | 2503.19607 | link |
2025-03-26 | Multi-agent Application System in Office Collaboration Scenarios | Songtao Sun et.al. | 2503.19584 | null |
2025-03-25 | Multi-Agent Deep Reinforcement Learning for Safe Autonomous Driving with RICS-Assisted MEC | Xueyao Zhang et.al. | 2503.19418 | null |
2025-03-25 | TraF-Align: Trajectory-aware Feature Alignment for Asynchronous Multi-agent Perception | Zhiying Song et.al. | 2503.19391 | link |
2025-03-24 | WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation | Zhongyu Yang et.al. | 2503.19065 | link |
2025-03-24 | AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration | Zhexuan Wang et.al. | 2503.18891 | link |
2025-03-24 | Learning Multi-Robot Coordination through Locality-Based Factorized Multi-Agent Actor-Critic Algorithm | Chak Lam Shek et.al. | 2503.18816 | null |
2025-03-24 | Unsupervised Acquisition of Discrete Grammatical Categories | David Ph. Shakouri et.al. | 2503.18702 | null |
2025-03-24 | Unified Uncertainty-Aware Diffusion for Multi-Agent Trajectory Modeling | Guillem Capellera et.al. | 2503.18589 | null |
2025-03-24 | Multi-agent coordination for data gathering with periodic requests and deliveries | Yaroslav Marchukov et.al. | 2503.18546 | null |
2025-03-24 | Optimizing Influence Campaigns: Nudging under Bounded Confidence | Yen-Shao Chen et.al. | 2503.18331 | null |
2025-03-24 | DeepFund: Will LLM be Professional at Fund Investment? A Live Arena Perspective | Changlun Li et.al. | 2503.18313 | null |
2025-03-23 | Decentralized Navigation of a Cable-Towed Load using Quadrupedal Robot Team via MARL | Wen-Tse Chen et.al. | 2503.18221 | null |
2025-03-23 | Iterative Multi-Agent Reinforcement Learning: A Novel Approach Toward Real-World Multi-Echelon Inventory Optimization | Georg Ziegner et.al. | 2503.18201 | null |
2025-03-23 | Metaphor-based Jailbreaking Attacks on Text-to-Image Models | Chenyu Zhang et.al. | 2503.17987 | null |
2025-03-21 | LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition Language | Kun Chu et.al. | 2503.17309 | link |
2025-03-21 | Which2comm: An Efficient Collaborative Perception Framework for 3D Object Detection | Duanrui Yu et.al. | 2503.17175 | null |
2025-03-21 | MAPS: A Multi-Agent Framework Based on Big Seven Personality and Socratic Guidance for Multimodal Scientific Problem Solving | Jian Zhang et.al. | 2503.16905 | link |
2025-03-21 | MARS: A Multi-Agent Framework Incorporating Socratic Guidance for Automated Prompt Optimization | Jian Zhang et.al. | 2503.16874 | null |
2025-03-21 | ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering | Kaisi Guan et.al. | 2503.16867 | null |
2025-03-21 | When Debate Fails: Bias Reinforcement in Large Language Models | Jihwan Oh et.al. | 2503.16814 | null |
2025-03-20 | RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints | Yiran Qin et.al. | 2503.16408 | null |
2025-03-20 | Characterizing the Convergence of Game Dynamics via Potentialness | Martin Bichler et.al. | 2503.16285 | link |
2025-03-21 | GreenIQ: A Deep Search Platform for Comprehensive Carbon Market Analysis and Automated Report Generation | Oluwole Fagbohun et.al. | 2503.16041 | null |
2025-03-21 | How much should we care about what others know? Jump signals in optimal investment under relative performance concerns | Peter Bank et.al. | 2503.16039 | null |
2025-03-20 | Consensus Tracking Control of Multi-agent Systems with A Time-varying Reference State under Binary-valued Communication | Ting Wang et.al. | 2503.15955 | null |
2025-03-20 | Unreal-MAP: Unreal-Engine-Based General Platform for Multi-Agent Reinforcement Learning | Tianyi Hu et.al. | 2503.15947 | link |
2025-03-20 | AutoRedTeamer: Autonomous Red Teaming with Lifelong Attack Integration | Andy Zhou et.al. | 2503.15754 | null |
2025-03-19 | Predicting Multi-Agent Specialization via Task Parallelizability | Elizabeth Mieczkowski et.al. | 2503.15703 | null |
2025-03-19 | PEnGUiN: Partially Equivariant Graph NeUral Networks for Sample Efficient MARL | Joshua McClellan et.al. | 2503.15615 | null |
2025-03-19 | Lyapunov-Based Graph Neural Networks for Adaptive Control of Multi-Agent Systems | Brandon C. Fallin et.al. | 2503.15360 | null |
2025-03-19 | MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration | David Wan et.al. | 2503.15272 | null |
2025-03-19 | When Pigs Get Sick: Multi-Agent AI for Swine Disease Detection | Tittaya Mairittha et.al. | 2503.15204 | null |
2025-03-19 | Multi-Agent Actor-Critic with Harmonic Annealing Pruning for Dynamic Spectrum Access Systems | George Stamatelis et.al. | 2503.15172 | null |
2025-03-19 | LogiAgent: Automated Logical Testing for REST Systems with LLM-Based Multi-Agents | Ke Zhang et.al. | 2503.15079 | null |
2025-03-19 | HAD-Gen: Human-like and Diverse Driving Behavior Modeling for Controllable Scenario Generation | Cheng Wang et.al. | 2503.15049 | link |
2025-03-19 | ChatStitch: Visualizing Through Structures via Surround-View Unsupervised Deep Image Stitching with Collaborative LLM-Agents | Hao Liang et.al. | 2503.14948 | null |
2025-03-18 | Safety-Critical and Distributed Nonlinear Predictive Controllers for Teams of Quadrupedal Robots | Basit Muhammad Imran et.al. | 2503.14656 | null |
2025-03-18 | Don’t lie to your friends: Learning what you know from collaborative self-play | Jacob Eisenstein et.al. | 2503.14481 | null |
2025-03-18 | Decentralized RISE-based Control for Exponential Heterogeneous Multi-Agent Target Tracking of Second-Order Nonlinear Systems | Cristian F. Nino et.al. | 2503.14418 | null |
2025-03-18 | Unified Analysis of Decentralized Gradient Descent: a Contraction Mapping Framework | Erik G. Larsson et.al. | 2503.14353 | null |
2025-03-18 | MANTRA: Enhancing Automated Method-Level Refactoring with Contextual RAG and Multi-Agent LLM Collaboration | Yisen Xu et.al. | 2503.14340 | null |
2025-03-18 | Towards a Barrier-free GeoQA Portal: Natural Language Interaction with Geospatial Data Using Multi-Agent LLMs and Semantic Search | Yu Feng et.al. | 2503.14251 | null |
2025-03-18 | Stacked-Residual PINN for State Reconstruction of Hyperbolic Systems | Katayoun Eshkofti et.al. | 2503.14222 | link |
2025-03-18 | Decentralized Continuification Control of Multi-Agent Systems via Distributed Density Estimation | Beniamino Di Lorenzo et.al. | 2503.14119 | null |
2025-03-18 | Sparse control in microscopic and mean-field leader-follower models | Melanie Harms et.al. | 2503.14113 | null |
2025-03-18 | Collective completeness and pricing hedging duality | Alessandro Doldi et.al. | 2503.14086 | null |
2025-03-18 | MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding | Siwei Han et.al. | 2503.13964 | link |
2025-03-17 | A Comprehensive Survey on Multi-Agent Cooperative Decision-Making: Scenarios, Approaches, Challenges and Perspectives | Weiqiang Jin et.al. | 2503.13415 | null |
2025-03-17 | Toward Generative 6G Simulation: An Experimental Multi-Agent LLM and ns-3 Integration | Farhad Rezazadeh et.al. | 2503.13402 | null |
2025-03-17 | Optimal intrinsic formation using exogenous systems | Yueyue Xu et.al. | 2503.13359 | null |
2025-03-17 | Goal2Story: A Multi-Agent Fleet based on Privately Enabled sLLMs for Impacting Mapping on Requirements Elicitation | Xinkai Zou et.al. | 2503.13279 | null |
2025-03-17 | Knowledge-Aware Iterative Retrieval for Multi-Agent Systems | Seyoung Song et.al. | 2503.13275 | null |
2025-03-17 | Robust Decision-Making Via Free Energy Minimization | Allahkaram Shafiei et.al. | 2503.13223 | null |
2025-03-17 | MAP: Evaluation and Multi-Agent Enhancement of Large Language Models for Inpatient Pathways | Zhen Chen et.al. | 2503.13205 | null |
2025-03-17 | Prioritized Planning for Continuous-time Lifelong Multi-agent Pathfinding | Alvin Combrink et.al. | 2503.13175 | null |
2025-03-17 | Collaborative AI Enhances Image Understanding in Materials Science | Ruoyan Avery Yin et.al. | 2503.13169 | null |
2025-03-17 | Actively learning equilibria in Nash games with misleading information | Barbara Franci et.al. | 2503.13167 | null |
2025-03-14 | Essentials of the kinetic theory of multi-agent systems | Nadia Loy et.al. | 2503.11554 | null |
2025-03-14 | Prompt Injection Detection and Mitigation via AI Multi-Agent NLP Frameworks | Diego Gosmar et.al. | 2503.11517 | link |
2025-03-14 | Multi-agent coordination for on-demand data gathering with periodic information upload | Yaroslav Marchukov et.al. | 2503.11504 | null |
2025-03-14 | Unicorn: A Universal and Collaborative Reinforcement Learning Approach Towards Generalizable Network-Wide Traffic Signal Control | Yifeng Zhang et.al. | 2503.11488 | null |
2025-03-14 | Research Vision: Multi-Agent Path Planning for Cops And Robbers Via Reactive Synthesis | William Fishell et.al. | 2503.11475 | null |
2025-03-14 | AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation | Fengyu Li et.al. | 2503.11346 | link |
2025-03-14 | EmoAgent: Multi-Agent Collaboration of Plan, Edit, and Critic, for Affective Image Manipulation | Qi Mao et.al. | 2503.11290 | null |
2025-03-14 | Collaboration is all you need: LLM Assisted Safe Code Translation | Rabimba Karanjai et.al. | 2503.11237 | null |
2025-03-14 | Ergodic exploration of dynamic distribution | Luka Lanča et.al. | 2503.11235 | null |
2025-03-14 | Prompt Alchemy: Automatic Prompt Refinement for Enhancing Code Generation | Sixiang Ye et.al. | 2503.11085 | link |
2025-03-13 | A large multi-agent system with noise both in position and control | Giuseppe D’Onofrio et.al. | 2503.10543 | null |
2025-03-13 | HALO: Fault-Tolerant Safety Architecture For High-Speed Autonomous Racing | Aron Harder et.al. | 2503.10341 | null |
2025-03-13 | SurgRAW: Multi-Agent Workflow with Chain-of-Thought Reasoning for Surgical Intelligence | Chang Han Low et.al. | 2503.10265 | null |
2025-03-13 | Reach-Avoid-Stay-Collision-Avoidance Negotiation Framework for Multi-Agent Systems via Spatiotemporal Tubes | Mohd. Faizuddin Faruqui et.al. | 2503.10245 | null |
2025-03-13 | Global synchronization of multi-agent systems with nonlinear interactions | Anthony Couthures et.al. | 2503.10205 | null |
2025-03-13 | Multi-Agent Q-Learning Dynamics in Random Networks: Convergence due to Exploration and Sparsity | Aamal Hussain et.al. | 2503.10186 | null |
2025-03-13 | Optimal Privacy-Preserving Distributed Median Consensus | Wenrui Yu et.al. | 2503.10147 | null |
2025-03-13 | AgentDAO: Synthesis of Proposal Transactions Via Abstract DAO Semantics | Lin Ao et.al. | 2503.10099 | null |
2025-03-13 | One-bit consensus of controllable linear multi-agent systems with communication noises | Ru An et.al. | 2503.10062 | null |
2025-03-13 | Enhancing Multi-Agent Systems via Reinforcement Learning with LLM-based Planner and Graph-based Policy | Ziqi Jia et.al. | 2503.10049 | null |
2025-03-12 | The turnpike control in stochastic multi-agent dynamics: a discrete-time approach with exponential integrators | Fabio Cassini et.al. | 2503.09549 | null |
2025-03-12 | PairVDN - Pair-wise Decomposed Value Functions | Zak Buzzard et.al. | 2503.09521 | link |
2025-03-12 | RESTRAIN: Reinforcement Learning-Based Secure Framework for Trigger-Action IoT Environment | Md Morshed Alam et.al. | 2503.09513 | null |
2025-03-12 | ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning | Ziyu Wan et.al. | 2503.09501 | link |
2025-03-12 | Multi-Agent Image Restoration | Xu Jiang et.al. | 2503.09403 | null |
2025-03-12 | Task Allocation for Multi-agent Systems via Unequal-dimensional Optimal Transport | Anqi Dong et.al. | 2503.09369 | null |
2025-03-12 | COLA: A Scalable Multi-Agent Framework For Windows UI Task Automation | Di Zhao et.al. | 2503.09263 | link |
2025-03-12 | Rethinking Bimanual Robotic Manipulation: Learning with Decoupled Interaction Framework | Jian-Jian Jiang et.al. | 2503.09186 | null |
2025-03-12 | Optimal control for multiagent systems with simultaneous aggregation | Mauro Bonafini et.al. | 2503.09168 | null |
2025-03-11 | Beam Selection in ISAC using Contextual Bandit with Multi-modal Transformer and Transfer Learning | Mohammad Farzanullah et.al. | 2503.08937 | null |
2025-03-11 | Hierarchical Multi Agent DRL for Soft Handovers Between Edge Clouds in Open RAN | F. Giarrè et.al. | 2503.08493 | null |
2025-03-11 | Collaborative Dynamic 3D Scene Graphs for Open-Vocabulary Urban Scene Understanding | Tim Steinke et.al. | 2503.08474 | null |
2025-03-11 | Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels | Qiming Xia et.al. | 2503.08421 | link |
2025-03-11 | InfluenceNet: AI Models for Banzhaf and Shapley Value Prediction | Benjamin Kempinski et.al. | 2503.08381 | null |
2025-03-11 | Liquidity Competition Between Brokers and an Informed Trader | Ryan Donnelly et.al. | 2503.08287 | null |
2025-03-11 | A Cascading Cooperative Multi-agent Framework for On-ramp Merging Control Integrating Large Language Models | Miao Zhang et.al. | 2503.08199 | null |
2025-03-11 | Privacy-Enhancing Paradigms within Federated Multi-Agent Systems | Zitong Shi et.al. | 2503.08175 | link |
2025-03-11 | FilmComposer: LLM-Driven Music Production for Silent Film Clips | Zhifeng Xie et.al. | 2503.08147 | null |
2025-03-10 | Fully Autonomous Programming using Iterative Multi-Agent Debugging with Large Language Models | Anastasiia Grishina et.al. | 2503.07693 | null |
2025-03-10 | Q-MARL: A quantum-inspired algorithm using neural message passing for large-scale multi-agent reinforcement learning | Kha Vo et.al. | 2503.07397 | null |
2025-03-10 | Automated Movie Generation via Multi-Agent CoT Planning | Weijia Wu et.al. | 2503.07314 | link |
2025-03-10 | VizTrust: A Visual Analytics Tool for Capturing User Trust Dynamics in Human-AI Communication | Xin Wang et.al. | 2503.07279 | link |
2025-03-10 | Automatic Curriculum Design for Zero-Shot Human-AI Coordination | Won-Sang You et.al. | 2503.07275 | null |
2025-03-10 | Communication-aware Multi-agent Systems Control Based on $k$ -hop Distributed Observers | Tommaso Zaccherini et.al. | 2503.07246 | null |
2025-03-10 | ReelWave: A Multi-Agent Framework Toward Professional Movie Sound Generation | Zixuan Wang et.al. | 2503.07217 | null |
2025-03-10 | DeFine: A Decomposed and Fine-Grained Annotated Dataset for Long-form Article Generation | Ming Wang et.al. | 2503.07170 | null |
2025-03-10 | Parametric Value Approximation for General-sum Differential Games with State Constraints | Lei Zhang et.al. | 2503.06994 | null |
2025-03-10 | ReAgent: Reversible Multi-Agent Reasoning for Knowledge-Enhanced Multi-Hop QA | Zhao Xinjie et.al. | 2503.06951 | null |
2025-03-10 | Can Proof Assistants Verify Multi-Agent Systems? | Julian Alfredo Mendez et.al. | 2503.06812 | link |
2025-03-07 | Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning | Justin Chih-Yao Chen et.al. | 2503.05641 | null |
2025-03-07 | The Society of HiveMind: Multi-Agent Optimization of Foundation Model Swarms to Unlock the Potential of Collective Intelligence | Noah Mamie et.al. | 2503.05473 | null |
2025-03-07 | Game Theory in Formula 1: Multi-agent Physical and Strategical Interactions | Giona Fienia et.al. | 2503.05421 | null |
2025-03-07 | Multi Agent based Medical Assistant for Edge Devices | Sakharam Gawade et.al. | 2503.05397 | link |
2025-03-07 | VLMs Play StarCraft II: A Benchmark and Multimodal Decision Method | Weiyu Ma et.al. | 2503.05383 | link |
2025-03-07 | GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report Evaluation | Zhenxuan Zhang et.al. | 2503.05347 | link |
2025-03-07 | MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and Audio | Xuenan Xu et.al. | 2503.05242 | link |
2025-03-07 | Multi-Robot Collaboration through Reinforcement Learning and Abstract Simulation | Adam Labiosa et.al. | 2503.05092 | null |
2025-03-06 | Multi-Agent Ergodic Exploration under Smoke-Based, Time-Varying Sensor Visibility Constraints | Elena Wittemyer et.al. | 2503.04998 | null |
2025-03-06 | Security-Aware Sensor Fusion with MATE: the Multi-Agent Trust Estimator | R. Spencer Hallyburton et.al. | 2503.04954 | null |
2025-03-06 | Multi-Agent Inverse Q-Learning from Demonstrations | Nathaniel Haynam et.al. | 2503.04679 | null |
2025-03-06 | From Idea to CAD: A Language Model-Driven Multi-Agent System for Collaborative Design | Felix Ocker et.al. | 2503.04417 | null |
2025-03-06 | AgentSafe: Safeguarding Large Language Model-based Multi-agent Systems via Hierarchical Data Management | Junyuan Mao et.al. | 2503.04392 | null |
2025-03-06 | Guidelines for Applying RL and MARL in Cybersecurity Applications | Vasilios Mavroudis et.al. | 2503.04262 | null |
2025-03-06 | Computational Intractability of Strategizing against Online Learners | Angelos Assos et.al. | 2503.04202 | null |
2025-03-06 | DVM-SLAM: Decentralized Visual Monocular Simultaneous Localization and Mapping for Multi-Agent Systems | Joshua Bird et.al. | 2503.04126 | null |
2025-03-05 | Pretrained LLMs as Real-Time Controllers for Robot Operated Serial Production Line | Muhammad Waseem et.al. | 2503.03889 | null |
2025-03-05 | Multi-Agent Systems Powered by Large Language Models: Applications in Swarm Intelligence | Cristian Jimenez-Romero et.al. | 2503.03800 | link |
2025-03-05 | CHOP: Mobile Operating Assistant with Constrained High-frequency Optimized Subtask Planning | Yuqi Zhou et.al. | 2503.03743 | link |
2025-03-05 | MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems | Rui Ye et.al. | 2503.03686 | null |
2025-03-05 | Parallelized Planning-Acting for Efficient LLM-based Multi-Agent Systems | Yaoru Li et.al. | 2503.03505 | link |
2025-03-05 | CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late Hybridization | Junhao Xu et.al. | 2503.03430 | link |
2025-03-05 | Multi-Agent DRL for Queue-Aware Task Offloading in Hierarchical MEC-Enabled Air-Ground Networks | Muhammet Hevesli et.al. | 2503.03391 | null |
2025-03-05 | Exploring the Potential of Large Language Models as Predictors in Dynamic Text-Attributed Graphs | Runlin Lei et.al. | 2503.03258 | null |
2025-03-05 | MA-LoT: Multi-Agent Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving | Ruida Wang et.al. | 2503.03205 | link |
2025-03-05 | Distributed Certifiably Correct Range-Aided SLAM | Alexander Thoms et.al. | 2503.03192 | link |
2025-03-06 | Dango: A Mixed-Initiative Data Wrangling System using Large Language Model | Wei-Hao Chen et.al. | 2503.03154 | null |
2025-03-04 | RAILGUN: A Unified Convolutional Policy for Multi-Agent Path Finding Across Different Environments and Tasks | Yimin Tang et.al. | 2503.02992 | null |
2025-03-04 | From Metaphor to Mechanism: How LLMs Decode Traditional Chinese Medicine Symbolic Language for Modern Clinical Relevance | Jiacheng Tang et.al. | 2503.02760 | null |
2025-03-04 | Federated Learning for Privacy-Preserving Feedforward Control in Multi-Agent Systems | Jakob Weber et.al. | 2503.02693 | link |
2025-03-04 | FinArena: A Human-Agent Collaboration Framework for Financial Market Analysis and Forecasting | Congluo Xu et.al. | 2503.02692 | null |
2025-03-04 | Unique existence of solution and Hyers-Ulam stability for a new fractional differential quasi-variational inequality with Mittag-Leffler kernel and its applications | Zeng-bao Wu et.al. | 2503.02669 | null |
2025-03-04 | Playing games with Large language models: Randomness and strategy | Alicia Vidler et.al. | 2503.02582 | null |
2025-03-04 | LTL Verification of Memoryful Neural Agents | Mehran Hosseini et.al. | 2503.02512 | link |
2025-03-05 | BRIDGE: Bootstrapping Text to Control Time-Series Generation via Multi-Agent Iterative Optimization and Diffusion Modelling | Hao Li et.al. | 2503.02445 | null |
2025-03-04 | Decentralized Reinforcement Learning for Multi-Agent Multi-Resource Allocation via Dynamic Cluster Agreements | Antonio Marino et.al. | 2503.02437 | null |
2025-03-04 | VisAgent: Narrative-Preserving Story Visualization Framework | Seungkwon Kim et.al. | 2503.02399 | null |
2025-03-04 | ReSo: A Reward-driven Self-organizing LLM-based Multi-Agent System for Reasoning Tasks | Heng Zhou et.al. | 2503.02390 | link |
2025-02-28 | Hybrid Team Tetris: A New Platform For Hybrid Multi-Agent, Multi-Human Teaming | Kaleb Mcdowell et.al. | 2502.21300 | null |
2025-02-28 | Persuasion Should be Double-Blind: A Multi-Domain Dialogue Dataset With Faithfulness Based on Causal Theory of Mind | Dingyi Zhang et.al. | 2502.21297 | null |
2025-02-28 | Towards Developing Ethical Reasoners: Integrating Probabilistic Reasoning and Decision-Making for Complex AI Systems | Nijesh Upreti et.al. | 2502.21250 | null |
2025-02-28 | ARIES: Autonomous Reasoning with LLMs on Interactive Thought Graph Environments | Pedro Gimenes et.al. | 2502.21208 | null |
2025-02-28 | The Power of Personality: A Human Simulation Perspective to Investigate Large Language Model Agents | Yifan Duan et.al. | 2502.20859 | null |
2025-02-28 | ProAI: Proactive Multi-Agent Conversational AI with Structured Knowledge Base for Psychiatric Diagnosis | Yuqi Wu et.al. | 2502.20689 | null |
2025-02-27 | Multi $^2$ : Multi-Agent Test-Time Scalable Framework for Multi-Document Processing | Juntai Cao et.al. | 2502.20592 | null |
2025-02-27 | Close-Proximity Satellite Operations through Deep Reinforcement Learning and Terrestrial Testing Environments | Henry Lei et.al. | 2502.20554 | null |
2025-02-27 | Cooperative Multi-Agent Assignment over Stochastic Graphs via Constrained Reinforcement Learning | Leopoldo Agorio et.al. | 2502.20462 | null |
2025-02-27 | Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers | Shalev Lifshitz et.al. | 2502.20379 | null |
2025-02-27 | Multi-Agent Path Planning in Complex Environments using Gaussian Belief Propagation with Global Path Finding | Jens Høigaard Jensen et.al. | 2502.20369 | link |
2025-02-27 | Trajectory-to-Action Pipeline (TAP): Automated Scenario Description Extraction for Autonomous Vehicle Behavior Comparison | Aron Harder et.al. | 2502.20353 | null |
2025-02-27 | M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging | Jinghao Feng et.al. | 2502.20301 | null |
2025-02-27 | MARVEL: Multi-Agent Reinforcement Learning for constrained field-of-View multi-robot Exploration in Large-scale environments | Jimmy Chiun et.al. | 2502.20217 | link |
2025-02-27 | Collab-Overcooked: Benchmarking and Evaluating Large Language Models as Collaborative Agents | Haochen Sun et.al. | 2502.20073 | link |
2025-02-27 | A Generative Model Enhanced Multi-Agent Reinforcement Learning Method for Electric Vehicle Charging Navigation | Tianyang Qi et.al. | 2502.20068 | null |
2025-02-27 | RouteRL: Multi-agent reinforcement learning framework for urban route choice with autonomous vehicles | Ahmet Onur Akman et.al. | 2502.20065 | link |
2025-02-27 | MIND: Towards Immersive Psychological Healing with Multi-agent Inner Dialogue | Yujia Chen et.al. | 2502.19860 | null |
2025-02-27 | Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Learning | Xinran Li et.al. | 2502.19717 | link |
2025-02-26 | EMT: A Visual Multi-Task Benchmark Dataset for Autonomous Driving in the Arab Gulf Region | Nadya Abdel Madjid et.al. | 2502.19260 | link |
2025-02-26 | Simulation of Language Evolution under Regulated Social Media Platforms: A Synergistic Approach of Large Language Models and Genetic Algorithms | Jinyu Cai et.al. | 2502.19193 | null |
2025-02-26 | Multi-Agent Security Tax: Trading Off Security and Collaboration Capabilities in Multi-Agent Systems | Pierre Peigne-Lefebvre et.al. | 2502.19145 | null |
2025-02-26 | A Temporal Planning Framework for Multi-Agent Systems via LLM-Aided Knowledge Base Management | Enrico Saccon et.al. | 2502.19135 | null |
2025-02-26 | Voting or Consensus? Decision-Making in Multi-Agent Debate | Lars Benedikt Kaesberg et.al. | 2502.19130 | link |
2025-02-26 | Nexus: A Lightweight and Scalable Multi-Agent Framework for Complex Tasks Automation | Humza Sami et.al. | 2502.19091 | link |
2025-02-26 | A Multi-Agent DRL-Based Framework for Optimal Resource Allocation and Twin Migration in the Multi-Tier Vehicular Metaverse | Nahom Abishu Hayla et.al. | 2502.19004 | null |
2025-02-26 | Multi-LLM Collaborative Search for Complex Problem Solving | Sen Yang et.al. | 2502.18873 | null |
2025-02-26 | Towards an AI co-scientist | Juraj Gottweis et.al. | 2502.18864 | null |
2025-02-26 | REALM-Bench: A Real-World Planning Benchmark for LLMs and Multi-Agent Systems | Longling Geng et.al. | 2502.18836 | link |
2025-02-25 | MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning | Chanwoo Park et.al. | 2502.18439 | null |
2025-02-25 | Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent | Xiaofeng Wang et.al. | 2502.18228 | null |
2025-02-25 | ChatMotion: A Multimodal Multi-Agent for Human Motion Analysis | Li Lei et.al. | 2502.18180 | null |
2025-02-25 | ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents | Qiuchen Wang et.al. | 2502.18017 | link |
2025-02-25 | FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models | Hongzhan Lin et.al. | 2502.17924 | link |
2025-02-25 | CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems | Rui Liu et.al. | 2502.17821 | null |
2025-02-25 | Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning | Meng Feng et.al. | 2502.17813 | link |
2025-02-24 | METAL: A Multi-Agent Framework for Chart Generation with Test-Time Scaling | Bingxuan Li et.al. | 2502.17651 | null |
2025-02-24 | Hierarchical Imitation Learning of Team Behavior from Heterogeneous Demonstrations | Sangwon Seo et.al. | 2502.17618 | null |
2025-02-24 | Distributed Coordination for Heterogeneous Non-Terrestrial Networks | Jikang Deng et.al. | 2502.17366 | null |
2025-02-25 | Survey on Strategic Mining in Blockchain: A Reinforcement Learning Approach | Jichen Li et.al. | 2502.17307 | null |
2025-02-24 | Semantic-Aware Dynamic and Distributed Power Allocation: a Multi-UAV Area Coverage Use Case | Hamidreza Mazandarani et.al. | 2502.17120 | null |
2025-02-25 | Mobile-Agent-V: Learning Mobile Device Operation Through Video-Guided Multi-Agent Collaboration | Junyang Wang et.al. | 2502.17110 | null |
2025-02-24 | MA2RL: Masked Autoencoders for Generalizable Multi-Agent Reinforcement Learning | Jinyuan Feng et.al. | 2502.17046 | null |
2025-02-24 | Engineering and Validating Cyber-Physical Energy Systems: Needs, Status Quo, and Research Trends | Thomas I. Strasser et.al. | 2502.16991 | null |
2025-02-24 | Leveraging Large Language Models for Effective and Explainable Multi-Agent Credit Assignment | Kartik Nagpal et.al. | 2502.16863 | null |
2025-02-24 | Multi-Agent Autonomous Driving Systems with Large Language Models: A Survey of Recent Advances | Yaozu Wu et.al. | 2502.16804 | null |
2025-02-24 | MobileSteward: Integrating Multiple App-Oriented Agents with Self-Evolution to Automate Cross-App Instructions | Yuxuan Liu et.al. | 2502.16796 | null |
2025-02-23 | Guardians of the Agentic System: Preventing Many Shots Jailbreak with Agentic System | Saikat Barua et.al. | 2502.16750 | link |
2025-02-21 | Multi-Agent Architecture in Distributed Environment Control Systems: vision, challenges, and opportunities | Natasha Astudillo et.al. | 2502.15663 | null |
2025-02-21 | Pub-Guard-LLM: Detecting Fraudulent Biomedical Articles with Reliable Explanations | Lihu Chen et.al. | 2502.15429 | link |
2025-02-21 | TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning | Giuseppe Paolo et.al. | 2502.15425 | link |
2025-02-21 | Learning with Limited Shared Information in Multi-agent Multi-armed Bandit | Junning Shao et.al. | 2502.15338 | null |
2025-02-21 | Real-Time Moving Flock Detection in Pedestrian Trajectories Using Sequential Deep Learning Models | Amartaivan Sanjjamts et.al. | 2502.15252 | null |
2025-02-21 | Multi-agent Multi-armed Bandits with Minimum Reward Guarantee Fairness | Piyushi Manupriya et.al. | 2502.15240 | link |
2025-02-21 | Investigating the Adaptive Robustness with Knowledge Conflicts in LLM-based Multi-Agent Systems | Tianjie Ju et.al. | 2502.15153 | link |
2025-02-20 | Voter Model Meets Rumour Spreading: A Study of Consensus Protocols on Graphs with Agnostic Nodes [Extended Version] | Marcelo Matheus Gauy et.al. | 2502.15029 | null |
2025-02-20 | Red-Teaming LLM Multi-Agent Systems via Communication Attacks | Pengfei He et.al. | 2502.14847 | null |
2025-02-20 | Optimizing Model Selection for Compound AI Systems | Lingjiao Chen et.al. | 2502.14815 | link |
2025-02-20 | Planning, scheduling, and execution on the Moon: the CADRE technology demonstration mission | Gregg Rabideau et.al. | 2502.14803 | null |
2025-02-20 | A Multi-Agent Perspective on Modern Information Retrieval | Haya Nachimovsky et.al. | 2502.14796 | null |
2025-02-20 | Tree-of-Debate: Multi-Persona Debate Trees Elicit Critical Thinking for Scientific Comparative Analysis | Priyanka Kargupta et.al. | 2502.14767 | link |
2025-02-21 | Multi-Agent Coordination across Diverse Applications: A Survey | Lijun Sun et.al. | 2502.14743 | null |
2025-02-20 | Curiosity Driven Multi-agent Reinforcement Learning for 3D Game Testing | Raihana Ferdous et.al. | 2502.14606 | link |
2025-02-20 | CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Zhenhong Zhou et.al. | 2502.14529 | link |
2025-02-20 | Enhancing Language Multi-Agent Learning with Multi-Agent Credit Re-Assignment for Interactive Environment Generalization | Zhitao He et.al. | 2502.14496 | link |
2025-02-20 | Data-Driven Cooperative Output Regulation via Distributed Internal Model | Liquan Lin et.al. | 2502.14336 | null |
2025-02-19 | Exploring Personalized Health Support through Data-Driven, Theory-Guided LLMs: A Case Study in Sleep Health | Xingbo Wang et.al. | 2502.13920 | link |
2025-02-19 | From Correctness to Comprehension: AI Agents for Personalized Error Diagnosis in Education | Yi-Fan Zhang et.al. | 2502.13789 | null |
2025-02-19 | Poster: SpiderSim: Multi-Agent Driven Theoretical Cybersecurity Simulation for Industrial Digitalization | Jiaqi Li et.al. | 2502.13778 | link |
2025-02-19 | Causes and Strategies in Multiagent Systems | Sylvia S. Kerkhove et.al. | 2502.13701 | null |
2025-02-19 | Decentralized Planning Using Probabilistic Hyperproperties | Francesco Pontiggia et.al. | 2502.13621 | null |
2025-02-19 | Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning | Hao Ma et.al. | 2502.13430 | null |
2025-02-19 | Learning Symbolic Task Decompositions for Multi-Agent Teams | Ameesh Shah et.al. | 2502.13376 | link |
2025-02-18 | Communication Strategy on Macro-and-Micro Traffic State in Cooperative Deep Reinforcement Learning for Regional Traffic Signal Control | Hankang Gu et.al. | 2502.13248 | null |
2025-02-18 | You need to MIMIC to get FAME: Solving Meeting Transcript Scarcity with a Multi-Agent Conversations | Frederic Kirstein et.al. | 2502.13001 | null |
2025-02-18 | Free Argumentative Exchanges for Explaining Image Classifiers | Avinash Kori et.al. | 2502.12995 | link |
2025-02-18 | A Survey on DRL based UAV Communications and Networking: DRL Fundamentals, Applications and Implementations | Wei Zhao et.al. | 2502.12875 | null |
2025-02-18 | Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research | Xiang Liu et.al. | 2502.12669 | null |
2025-02-18 | Automating Prompt Leakage Attacks on Large Language Models Using Agentic Approach | Tvrtko Sternak et.al. | 2502.12630 | link |
2025-02-18 | Hypernetwork-based approach for optimal composition design in partially controlled multi-agent systems | Kyeonghyeon Park et.al. | 2502.12605 | null |
2025-02-18 | Simulating Cooperative Prosocial Behavior with Multi-Agent LLMs: Evidence and Mechanisms for AI Agents to Inform Policy Decisions | Karthik Sreedhar et.al. | 2502.12504 | null |
2025-02-17 | Stochastic Real-Time Deception in Nash Equilibrium Seeking for Games with Quadratic Payoffs | Michael Tang et.al. | 2502.12337 | null |
2025-02-17 | HARBOR: Exploring Persona Dynamics in Multi-Agent Competition | Kenan Jiang et.al. | 2502.12149 | null |
2025-02-17 | A survey about perceptions of mobility to inform an agent-based simulator of subjective modal choice | Carole Adam et.al. | 2502.12058 | null |
2025-02-17 | Multi-agent coordination via communication partitions | Wei-Chen Lee et.al. | 2502.12042 | null |
2025-02-17 | Table-Critic: A Multi-Agent Framework for Collaborative Criticism and Refinement in Table Reasoning | Peiying Yu et.al. | 2502.11799 | link |
2025-02-17 | Changing the Rules of the Game: Reasoning about Dynamic Phenomena in Multi-Agent Systems | Rustam Galimullin et.al. | 2502.11785 | null |
2025-02-17 | Enhancing Recommendation Explanations through User-Centric Refinement | Jingsen Zhang et.al. | 2502.11721 | null |
2025-02-17 | Deviation Ratings: A General, Clone-Invariant Rating Method | Luke Marris et.al. | 2502.11645 | null |
2025-02-17 | A New Lyapunov-like Stability Inequality with an \textit{Asymmetric} Matrix and Application to Suboptimal LQ Control Design (to be corrected) | Avinash Kumar et.al. | 2502.11556 | null |
2025-02-17 | Generative Multi-Agent Collaboration in Embodied AI: A Systematic Review | Di Wu et.al. | 2502.11518 | null |
2025-02-16 | Unlocking the Potential of Generative AI through Neuro-Symbolic Architectures: Benefits and Limitations | Oualid Bougzime et.al. | 2502.11269 | null |
2025-02-14 | Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations | Abdelrhman Shaheen et.al. | 2502.10303 | null |
2025-02-14 | Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel Decoding | Laurin Luttmann et.al. | 2502.10233 | link |
2025-02-14 | Reinforcement Learning based Constrained Optimal Control: an Interpretable Reward Design | Jingjie Ni et.al. | 2502.10187 | null |
2025-02-14 | Cooperative Multi-Agent Planning with Adaptive Skill Synthesis | Zhiyuan Li et.al. | 2502.10148 | null |
2025-02-14 | A Survey on LLM-powered Agents for Recommender Systems | Qiyao Peng et.al. | 2502.10050 | null |
2025-02-14 | Evaluating and Improving Graph-based Explanation Methods for Multi-Agent Coordination | Siva Kailas et.al. | 2502.09889 | null |
2025-02-14 | Robust Event-Triggered Integrated Communication and Control with Graph Information Bottleneck Optimization | Ziqiong Wang et.al. | 2502.09846 | null |
2025-02-13 | Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games | Tong Yang et.al. | 2502.09780 | null |
2025-02-13 | KIMAs: A Configurable Knowledge Integrated Multi-Agent System | Zitao Li et.al. | 2502.09596 | null |
2025-02-13 | Language Agents as Digital Representatives in Collective Decision-Making | Daniel Jarrett et.al. | 2502.09369 | null |
2025-02-13 | Mind the Gaps: Logical English, Prolog, and Multi-agent Systems for Autonomous Vehicles | Galileo Sartor et.al. | 2502.09216 | null |
2025-02-13 | Multi-agent systems with multiple-wise interaction: Propagation of chaos and macroscopic limit | Thierry Paul et.al. | 2502.09098 | null |
2025-02-13 | Few is More: Task-Efficient Skill-Discovery for Multi-Task Offline Multi-Agent Reinforcement Learning | Xun Wang et.al. | 2502.08985 | null |
2025-02-13 | SkyRover: A Modular Simulator for Cross-Domain Pathfinding | Wenhui Ma et.al. | 2502.08969 | null |
2025-02-13 | Single-Agent Planning in a Multi-Agent System: A Unified Framework for Type-Based Planners | Fengming Zhu et.al. | 2502.08950 | link |
2025-02-13 | PathFinder: A Multi-Modal Multi-Agent System for Medical Diagnostic Decision-Making Applied to Histopathology | Fatemeh Ghezloo et.al. | 2502.08916 | null |
2025-02-12 | If Multi-Agent Debate is the Answer, What is the Question? | Hangfan Zhang et.al. | 2502.08788 | null |
2025-02-12 | Poly-Autoregressive Prediction for Modeling Interactions | Neerja Thakkar et.al. | 2502.08646 | null |
2025-02-12 | Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation | Mahnaz Koupaee et.al. | 2502.08514 | link |
2025-02-12 | Resilient Quantized Consensus in Multi-Hop Relay Networks | Liwei Yuan et.al. | 2502.08455 | null |
2025-02-12 | Towards Principled Multi-Agent Task Agnostic Exploration | Riccardo Zamboni et.al. | 2502.08365 | null |
2025-02-12 | Hierarchical Multi-Agent Framework for Carbon-Efficient Liquid-Cooled Data Center Clusters | Soumyendu Sarkar et.al. | 2502.08337 | null |
2025-02-12 | Decentralised multi-agent coordination for real-time railway traffic management | Leo D’Amato et.al. | 2502.08324 | null |
2025-02-12 | Flow-of-Action: SOP Enhanced LLM-Based Multi-Agent System for Root Cause Analysis | Changhua Pei et.al. | 2502.08224 | null |
2025-02-12 | Generative AI-Enhanced Cooperative MEC of UAVs and Ground Stations for Unmanned Surface Vehicles | Jiahao You et.al. | 2502.08119 | null |
2025-02-12 | Multi-Agent Performative Prediction Beyond the Insensitivity Assumption: A Case Study for Mortgage Competition | Guanghui Wang et.al. | 2502.08063 | null |
2025-02-12 | End-to-End Predictive Planner for Autonomous Driving with Consistency Models | Anjian Li et.al. | 2502.08033 | null |
2025-02-11 | Distributed Value Decomposition Networks with Networked Agents | Guilherme S. Varela et.al. | 2502.07635 | null |
2025-02-11 | A Near-optimal, Scalable and Corruption-tolerant Framework for Stochastic Bandits: From Single-Agent to Multi-Agent and Beyond | Zicheng Hu et.al. | 2502.07514 | null |
2025-02-11 | Multi-Agent Collaboration for Multilingual Code Instruction Tuning | Jian Yang et.al. | 2502.07487 | null |
2025-02-11 | On Event-Triggered Resilient Consensus Using Auxiliary Layer | Pushkal Purohit et.al. | 2502.07470 | null |
2025-02-11 | Approximating Human Strategic Reasoning with LLM-Enhanced Recursive Reasoners Leveraging Multi-agent Hypergames | Vince Trencsenyi et.al. | 2502.07443 | null |
2025-02-11 | EvoFlow: Evolving Diverse Agentic Workflows On The Fly | Guibin Zhang et.al. | 2502.07373 | null |
2025-02-11 | KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems | Jusheng Zhang et.al. | 2502.07350 | null |
2025-02-11 | Fairness in Multi-Agent AI: A Unified Framework for Ethical and Equitable Autonomous Systems | Rajesh Ranjan et.al. | 2502.07254 | null |
2025-02-11 | Don’t Just Demo, Teach Me the Principles: A Principle-Based Multi-Agent Prompting Strategy for Text Classification | Peipei Wei et.al. | 2502.07165 | null |
2025-02-10 | Who is Helping Whom? Analyzing Inter-dependencies to Evaluate Cooperation in Human-AI Teaming | Upasana Biswas et.al. | 2502.06976 | null |
2025-02-10 | KARMA: Leveraging Multi-Agent LLMs for Automated Knowledge Graph Enrichment | Yuxing Lu et.al. | 2502.06472 | link |
2025-02-10 | SIGMA: Sheaf-Informed Geometric Multi-Agent Pathfinding | Shuhao Liao et.al. | 2502.06440 | link |
2025-02-10 | Reducing Variance Caused by Communication in Decentralized Multi-agent Deep Reinforcement Learning | Changxi Zhu et.al. | 2502.06261 | null |
2025-02-10 | Amplifying Minority Voices: AI-Mediated Devil’s Advocate System for Inclusive Group Decision-Making | Soohwan Lee et.al. | 2502.06251 | null |
2025-02-10 | C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Generation | Guoxin Chen et.al. | 2502.06205 | null |
2025-02-10 | Towards Bio-inspired Heuristically Accelerated Reinforcement Learning for Adaptive Underwater Multi-Agents Behaviour | Antoine Vivien et.al. | 2502.06113 | null |
2025-02-09 | Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning | Bidipta Sarkar et.al. | 2502.06060 | link |
2025-02-09 | Preventing Rogue Agents Improves Multi-Agent Collaboration | Ohav Barbi et.al. | 2502.05986 | link |
2025-02-09 | Redefining Robot Generalization Through Interactive Intelligence | Sharmita Dey et.al. | 2502.05963 | null |
2025-02-09 | MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents | Jiabin Tang et.al. | 2502.05957 | link |
2025-02-07 | Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency | Qixin Zhang et.al. | 2502.05028 | null |
2025-02-07 | $TAR^2$ : Temporal-Agent Reward Redistribution for Optimal Policy Preservation in Multi-Agent Reinforcement Learning | Aditya Kapoor et.al. | 2502.04864 | null |
2025-02-07 | S $^2$ -MAD: Breaking the Token Barrier to Enhance Multi-Agent Debate Efficiency | Yuting Zeng et.al. | 2502.04790 | null |
2025-02-07 | SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning | Wanjia Zhao et.al. | 2502.04780 | link |
2025-02-07 | An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative Tasks | George Papadopoulos et.al. | 2502.04773 | link |
2025-02-07 | Multi-Agent Coverage Control in Non-Convex Annulus Region with Conformal Mapping | Xun Feng et.al. | 2502.04697 | null |
2025-02-06 | Distributed Resilient Asymmetric Bipartite Consensus: A Data-Driven Event-Triggered Mechanism | Yi Zhang et.al. | 2502.04497 | null |
2025-02-06 | Multi-Agent Reinforcement Learning with Focal Diversity Optimization | Selim Furkan Tekin et.al. | 2502.04492 | link |
2025-02-06 | ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization | Yinjie Wang et.al. | 2502.04306 | link |
2025-02-06 | DECAF: Learning to be Fair in Multi-agent Resource Allocation | Ashwin Kumar et.al. | 2502.04281 | null |
2025-02-06 | Free Energy Risk Metrics for Systemically Safe AI: Gatekeeping Multi-Agent Study | Michael Walters et.al. | 2502.04249 | null |
2025-02-06 | Multi-agent Architecture Search via Agentic Supernet | Guibin Zhang et.al. | 2502.04180 | link |
2025-02-06 | Simulating the Emergence of Differential Case Marking with Communicating Neural-Network Agents | Yuchen Lian et.al. | 2502.04038 | null |
2025-02-06 | Deep Meta Coordination Graphs for Multi-agent Reinforcement Learning | Nikunj Gupta et.al. | 2502.04028 | link |
2025-02-06 | Fairness Aware Reinforcement Learning via Proximal Policy Optimization | Gabriele La Malfa et.al. | 2502.03953 | null |
2025-02-06 | Enhancing Online Learning Efficiency Through Heterogeneous Resource Integration with a Multi-Agent RAG System | Devansh Srivastav et.al. | 2502.03948 | null |
2025-02-06 | Geometric Stabilization of Virtual Nonlinear Nonholonomic Constraints | Efstratios Stratoglou et.al. | 2502.03902 | null |
2025-02-06 | Any theory that admits a Wigner’s Friend type multi-agent paradox is logically contextual | Nuriya Nurgalieva et.al. | 2502.03874 | null |
2025-02-05 | Energy-Efficient Flying LoRa Gateways: A Multi-Agent Reinforcement Learning Approach | Abdullahi Isa Ahmed et.al. | 2502.03377 | null |
2025-02-05 | Inverse Mixed Strategy Games with Generative Trajectory Models | Max Muchen Sun et.al. | 2502.03356 | null |
2025-02-05 | Double Distillation Network for Multi-Agent Reinforcement Learning | Yang Zhou et.al. | 2502.03125 | null |
2025-02-05 | Learning Efficient Flocking Control based on Gibbs Random Fields | Dengyu Zhang et.al. | 2502.02984 | null |
2025-02-05 | Heterogeneous Value Decomposition Policy Fusion for Multi-Agent Cooperation | Siying Wang et.al. | 2502.02875 | null |
2025-02-05 | Gap-Dependent Bounds for Federated $Q$ -learning | Haochen Zhang et.al. | 2502.02859 | null |
2025-02-05 | Wolfpack Adversarial Attack for Robust Multi-Agent Reinforcement Learning | Sunwoo Lee et.al. | 2502.02844 | link |
2025-02-04 | Intelligent Sensing-to-Action for Robust Autonomy at the Edge: Opportunities and Challenges | Amit Ranjan Trivedi et.al. | 2502.02692 | null |
2025-02-04 | Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies | Han Zhou et.al. | 2502.02533 | null |
2025-02-04 | MAGNNET: Multi-Agent Graph Neural Network-based Efficient Task Allocation for Autonomous Vehicles with Deep Reinforcement Learning | Lavanya Ratnabala et.al. | 2502.02311 | null |
2025-02-04 | Sequential Multi-objective Multi-agent Reinforcement Learning Approach for Predictive Maintenance | Yan Chen et.al. | 2502.02071 | null |
2025-02-04 | CH-MARL: Constrained Hierarchical Multiagent Reinforcement Learning for Sustainable Maritime Logistics | Saad Alqithami et.al. | 2502.02060 | null |
2025-02-04 | Bottom-Up Reputation Promotes Cooperation with Multi-Agent Reinforcement Learning | Tianyu Ren et.al. | 2502.01971 | link |
2025-02-04 | VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play | Zelai Xu et.al. | 2502.01932 | null |
2025-02-03 | An Agentic AI Workflow for Detecting Cognitive Concerns in Real-world Data | Jiazi Tian et.al. | 2502.01789 | null |
2025-02-03 | Position: Towards a Responsible LLM-empowered Multi-Agent Systems | Jinwei Hu et.al. | 2502.01714 | null |
2025-02-04 | Visual Theory of Mind Enables the Invention of Writing Systems | Benjamin A. Spiegel et.al. | 2502.01568 | null |
2025-02-05 | TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets | Yuzhe Yang et.al. | 2502.01506 | link |
2025-01-31 | Learning Contracts in Hierarchical Multi-Agent Systems | Antoine Scheid et.al. | 2501.19388 | null |
2025-01-31 | Multi-agent Multi-armed Bandit with Fully Heavy-tailed Dynamics | Xingyu Wang et.al. | 2501.19239 | null |
2025-01-31 | A parallelizable variant of HCA* | Sreenivasan Ganti et.al. | 2501.19218 | null |
2025-01-31 | Autonomous Legacy Web Application Upgrades Using a Multi-Agent System | Valtteri Ala-Salmi et.al. | 2501.19204 | link |
2025-01-31 | Prediction-Aware Learning in Multi-Agent Systems | Aymeric Capitaine et.al. | 2501.19144 | null |
2025-01-31 | O-MAPL: Offline Multi-agent Preference Learning | The Viet Bui et.al. | 2501.18944 | null |
2025-01-31 | Language Games as the Pathway to Artificial Superhuman Intelligence | Ying Wen et.al. | 2501.18924 | null |
2025-01-30 | Deceptive Sequential Decision-Making via Regularized Policy Optimization | Yerin Kim et.al. | 2501.18803 | null |
2025-01-30 | Survey and Improvement Strategies for Gene Prioritization with Large Language Models | Matthew Neeley et.al. | 2501.18794 | null |
2025-02-04 | Invisible Traces: Using Hybrid Fingerprinting to identify underlying LLMs in GenAI Apps | Devansh Bhardwaj et.al. | 2501.18712 | null |
2025-01-30 | Leveraging LLM Agents for Automated Optimization Modeling for SASP Problems: A Graph-RAG based Approach | Tianpeng Pan et.al. | 2501.18320 | null |
2025-01-30 | Model Checking for Multi-Agent Systems Modeled By Epistemic Process Calculus | Qixian Yu et.al. | 2501.18155 | null |
2025-01-30 | B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning | Woojun Kim et.al. | 2501.18138 | null |
2025-01-29 | Free Agent in Agent-Based Mixture-of-Experts Generative AI Framework | Jung-Hua Liu et.al. | 2501.17903 | null |
2025-01-29 | Multi-Agent Path Finding Using Conflict-Based Search and Structural-Semantic Topometric Maps | Scott Fredriksson et.al. | 2501.17661 | null |
2025-01-29 | Coalitional control: a bottom-up approach | Filiberto Fele et.al. | 2501.17614 | null |
2025-01-29 | Optimal Utility Design with Arbitrary Information Networks | Vartika Singh et.al. | 2501.17385 | null |
2025-01-28 | Anomaly Detection in Cooperative Vehicle Perception Systems under Imperfect Communication | Ashish Bastola et.al. | 2501.17329 | link |
2025-01-28 | Learning Mean Field Control on Sparse Graphs | Christian Fabian et.al. | 2501.17079 | null |
2025-01-28 | Revisit Mixture Models for Multi-Agent Simulation: Experimental Study within a Unified Framework | Longzhong Lin et.al. | 2501.17015 | null |
2025-01-28 | Towards Open-Source and Modular Space Systems with ATMOS | Pedro Roque et.al. | 2501.16973 | link |
2025-01-28 | Beyond Human Intervention: Algorithmic Collusion through Multi-Agent Learning Strategies | Suzie Grondin et.al. | 2501.16935 | null |
2025-01-28 | Optimization and Learning in Open Multi-Agent Systems | Diego Deplano et.al. | 2501.16847 | null |
2025-01-28 | RG-Attn: Radian Glue Attention for Multi-modality Multi-agent Cooperative Perception | Lantao Li et.al. | 2501.16803 | link |
2025-01-29 | MACI: Multi-Agent Collaborative Intelligence for Adaptive Reasoning and Temporal Planning | Edward Y. Chang et.al. | 2501.16689 | null |
2025-01-28 | Jupybara: Operationalizing a Design Space for Actionable Data Analysis and Storytelling with LLMs | Huichen Will Wang et.al. | 2501.16661 | null |
2025-01-28 | MCTS-SQL: An Effective Framework for Text-to-SQL with Monte Carlo Tree Search | Shuozhi Yuan et.al. | 2501.16607 | null |
2025-01-27 | Multi-Agent Geospatial Copilots for Remote Sensing Workflows | Chaehong Lee et.al. | 2501.16254 | null |
2025-01-27 | Multi-Agent Meta-Offline Reinforcement Learning for Timely UAV Path Planning and Data Collection | Eslam Eldeeb et.al. | 2501.16098 | null |
2025-01-27 | Value-oriented forecast reconciliation for renewables in electricity markets | Honglin Wen et.al. | 2501.16086 | null |
2025-01-27 | Modeling and stability analysis of live systems with time-varying dimension | Andrii Mironchenko et.al. | 2501.15991 | null |
2025-01-27 | MADP: Multi-Agent Deductive Planning for Enhanced Cognitive-Behavioral Mental Health Question Answer | Qi Chen et.al. | 2501.15826 | null |
2025-01-27 | Adaptive AI-based Decentralized Resource Management in the Cloud-Edge Continuum | Lanpei Li et.al. | 2501.15802 | null |
2025-01-27 | Harnessing Diverse Perspectives: A Multi-Agent Framework for Enhanced Error Detection in Knowledge Graphs | Yu Li et.al. | 2501.15791 | link |
2025-01-27 | LLM-powered Multi-agent Framework for Goal-oriented Learning in Intelligent Tutoring System | Tianfu Wang et.al. | 2501.15749 | link |
2025-01-27 | Selective Experience Sharing in Reinforcement Learning Enhances Interference Management | Madan Dahal et.al. | 2501.15735 | null |
2025-01-27 | Prioritized Value-Decomposition Network for Explainable AI-Enabled Network Slicing | Shavbo Salehi et.al. | 2501.15734 | null |
2025-01-24 | Hybrid Quantum-Classical Multi-Agent Pathfinding | Thore Gerlach et.al. | 2501.14568 | null |
2025-01-24 | Breaking the Pre-Planning Barrier: Real-Time Adaptive Coordination of Mission and Charging UAVs Using Graph Reinforcement Learning | Yuhan Hu et.al. | 2501.14488 | null |
2025-01-24 | MARL-OT: Multi-Agent Reinforcement Learning Guided Online Fuzzing to Detect Safety Violation in Autonomous Driving Systems | Linfeng Liang et.al. | 2501.14451 | null |
2025-01-24 | MASTER: A Multi-Agent System with LLM Specialized MCTS | Bingzheng Gan et.al. | 2501.14304 | null |
2025-01-24 | Multi-agent KTO: Reinforcing Strategic Interactions of Large Language Model in Language Game | Rong Ye et.al. | 2501.14225 | null |
2025-01-24 | Distributed Multi-Agent Coordination Using Multi-Modal Foundation Models | Saaduddin Mahmud et.al. | 2501.14189 | null |
2025-01-23 | Collaborating in a competitive world: Heterogeneous Multi-Agent Decision Making in Symbiotic Supply Chain Environments | Wan Wang et.al. | 2501.14111 | link |
2025-01-23 | Scalable Safe Multi-Agent Reinforcement Learning for Multi-Agent System | Haikuo Du et.al. | 2501.13727 | link |
2025-01-23 | WFCRL: A Multi-Agent Reinforcement Learning Benchmark for Wind Farm Control | Claire Bizon Monroc et.al. | 2501.13592 | link |
2025-01-23 | Explainable AI-aided Feature Selection and Model Reduction for DRL-based V2X Resource Allocation | Nasir Khan et.al. | 2501.13552 | null |
2025-01-23 | Knowledge-Informed Multi-Agent Trajectory Prediction at Signalized Intersections for Infrastructure-to-Everything | Huilin Yin et.al. | 2501.13461 | null |
2025-01-23 | BMG-Q: Localized Bipartite Match Graph Attention Q-Learning for Ride-Pooling Order Dispatch | Yulong Hu et.al. | 2501.13448 | null |
2025-01-23 | VulnBot: Autonomous Penetration Testing for A Multi-Agent Collaborative Framework | He Kong et.al. | 2501.13411 | link |
2025-01-23 | Do as We Do, Not as You Think: the Conformity of Large Language Models | Zhiyuan Weng et.al. | 2501.13381 | link |
2025-01-23 | Task Allocation in Customer-led Two-sided Markets with Satellite Constellation Services | Jianglin Qiao et.al. | 2501.13364 | null |
2025-01-23 | AgentRec: Agent Recommendation Using Sentence Embeddings Aligned to Human Feedback | Joshua Park et.al. | 2501.13333 | link |
2025-01-22 | SRMT: Shared Memory for Multi-agent Lifelong Pathfinding | Alsu Sagirova et.al. | 2501.13200 | link |
2025-01-22 | An Offline Multi-Agent Reinforcement Learning Framework for Radio Resource Management | Eslam Eldeeb et.al. | 2501.12991 | null |
2025-01-22 | Learning-based Distributed Model Predictive Control using Multi-Agent Bayesian Optimization | Hossein Nejatbakhsh Esfahani et.al. | 2501.12989 | null |
2025-01-22 | FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces | Zhenran Xu et.al. | 2501.12909 | null |
2025-01-22 | ACEBench: Who Wins the Match Point in Tool Learning? | Chen Chen et.al. | 2501.12851 | null |
2025-01-22 | Optimal Rebate Design: Incentives, Competition and Efficiency in Auction Markets | Thibaut Mastrolia et.al. | 2501.12591 | null |
2025-01-21 | mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework | Bingyi Liu et.al. | 2501.12263 | null |
2025-01-21 | Multi-Agent Feedback Motion Planning using Probably Approximately Correct Nonlinear Model Predictive Control | Mark Gonzales et.al. | 2501.12234 | null |
2025-01-21 | Experience-replay Innovative Dynamics | Tuo Zhang et.al. | 2501.12199 | null |
2025-01-21 | Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics | Somnath Hazra et.al. | 2501.12061 | link |
2025-01-21 | Equilibria under Dynamic Benchmark Consistency in Non-Stationary Multi-Agent Systems | Ludovico Crippa et.al. | 2501.11897 | null |
2025-01-21 | Policy-Adaptable Methods For Resolving Normative Conflicts Through Argumentation and Graph Colouring | Johnny Joyce et.al. | 2501.11799 | null |
2025-01-20 | Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks | Zhenhailong Wang et.al. | 2501.11733 | null |
2025-01-20 | Conversation Routines: A Prompt Engineering Framework for Task-Oriented Dialog Systems | Giorgio Robino et.al. | 2501.11613 | null |
2025-01-20 | A Deep Reinforcement Learning based Scheduler for IoT Devices in Co-existence with 5G-NR | Shahida Jabeen et.al. | 2501.11574 | null |
2025-01-20 | PlotEdit: Natural Language-Driven Accessible Chart Editing in PDFs via Multimodal LLM Agents | Kanika Goswami et.al. | 2501.11233 | null |
2025-01-17 | Towards Human-Guided, Data-Centric LLM Co-Pilots | Evgeny Saveliev et.al. | 2501.10321 | null |
2025-01-17 | Deployment of an Aerial Multi-agent System for Automated Task Execution in Large-scale Underground Mining Environments | Niklas Dahlquist et.al. | 2501.10262 | null |
2025-01-17 | GAWM: Global-Aware World Model for Multi-Agent Reinforcement Learning | Zifeng Shi et.al. | 2501.10116 | null |
2025-01-16 | Crossover-BPSO Driven Multi-Agent Technology for Managing Local Energy Systems | Hafiz Majid Hussain et.al. | 2501.09832 | null |
2025-01-16 | Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework | Yushen Lin et.al. | 2501.09631 | null |
2025-01-16 | A Multi-agent System for Hybrid Optimization | Eric S. Fraga et.al. | 2501.09563 | null |
2025-01-18 | Solving the Unsolvable: Translating Case Law in Hong Kong | King-kui Sin et.al. | 2501.09444 | null |
2025-01-16 | ADAGE: A generic two-layer framework for adaptive agent based modelling | Benjamin Patrick Evans et.al. | 2501.09429 | null |
2025-01-16 | AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling | Ancheng Xu et.al. | 2501.09426 | null |
2025-01-16 | Hierarchical Deep Reinforcement Learning for Adaptive Resource Management in Integrated Terrestrial and Non-Terrestrial Networks | Muhammad Ahmed Mohsin et.al. | 2501.09212 | link |
2025-01-15 | Evaluating GenAI for Simplifying Texts for Education: Improving Accuracy and Consistency for Enhanced Readability | Stephanie L. Day et.al. | 2501.09158 | null |
2025-01-15 | A Reinforcement Learning Approach to Quiet and Safe UAM Traffic Management | Surya Murthy et.al. | 2501.08941 | null |
2025-01-15 | Networked Agents in the Dark: Team Value Learning under Partial Observability | Guilherme S. Varela et.al. | 2501.08778 | null |
2025-01-15 | Application of Deep Reinforcement Learning to UAV Swarming for Ground Surveillance | Raúl Arranz et.al. | 2501.08655 | null |
2025-01-15 | AutoRestTest: A Tool for Automated REST API Testing Using LLMs and MARL | Tyler Stennett et.al. | 2501.08600 | null |
2025-01-14 | ADAM-1: AI and Bioinformatics for Alzheimer’s Detection and Microbiome-Clinical Data Integrations | Ziyuan Huang et.al. | 2501.08324 | null |
2025-01-14 | Engineering LLM Powered Multi-agent Framework for Autonomous CloudOps | Kannan Parthasarathy et.al. | 2501.08243 | null |
2025-01-14 | Dynamic Pricing in High-Speed Railways Using Multi-Agent Reinforcement Learning | Enrique Adrian Villarrubia-Martin et.al. | 2501.08234 | null |
2025-01-14 | Cooperative Patrol Routing: Optimizing Urban Crime Surveillance through Multi-Agent Reinforcement Learning | Juan Palma-Borda et.al. | 2501.08020 | link |
2025-01-14 | Flow: A Modular Approach to Automated Agentic Workflow Generation | Boye Niu et.al. | 2501.07834 | link |
2025-01-14 | Agent-Centric Projection of Prompting Techniques and Implications for Synthetic Training Data for Large Language Models | Dhruv Dhamani et.al. | 2501.07815 | null |
2025-01-14 | Talk to Right Specialists: Routing and Planning in Multi-agent System for Question Answering | Feijie Wu et.al. | 2501.07813 | null |
2025-01-14 | CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation | Ruwei Pan et.al. | 2501.07811 | null |
2025-01-13 | CBS with Continuous-Time Revisit | Andy Li et.al. | 2501.07744 | null |
2025-01-14 | WebWalker: Benchmarking LLMs in Web Traversal | Jialong Wu et.al. | 2501.07572 | link |
2025-01-13 | How low-cost AI universal approximators reshape market efficiency | Paolo Barucca et.al. | 2501.07489 | null |
2025-01-12 | A novel multi-agent dynamic portfolio optimization learning system based on hierarchical deep reinforcement learning | Ruoyu Sun et.al. | 2501.06832 | null |
2025-01-11 | Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks | Amr Almorsi et.al. | 2501.06625 | null |
2025-01-11 | Hierarchical Reinforcement Learning for Optimal Agent Grouping in Cooperative Systems | Liyuan Hu et.al. | 2501.06554 | null |
2025-01-11 | Cooperative Optimal Output Tracking for Discrete-Time Multiagent Systems: Stabilizing Policy Iteration Frameworks and Analysis | Dongdong Li et.al. | 2501.06510 | null |
2025-01-11 | Ptychography using Blind Multi-Mode PMACE | Qiuchen Zhai et.al. | 2501.06470 | link |
2025-01-11 | Reinforcement Learning for Enhancing Sensing Estimation in Bistatic ISAC Systems with UAV Swarms | Obed Morrison Atsu et.al. | 2501.06454 | null |
2025-01-10 | Multi-Agent Collaboration Mechanisms: A Survey of LLMs | Khanh-Tung Tran et.al. | 2501.06322 | null |
2025-01-10 | BioAgents: Democratizing Bioinformatics Analysis with Multi-Agent Systems | Nikita Mehandru et.al. | 2501.06314 | null |
2025-01-10 | A Mixed-Integer Conic Program for the Multi-Agent Moving-Target Traveling Salesman Problem | Allen George Philip et.al. | 2501.06130 | null |
2025-01-10 | Learning Flexible Heterogeneous Coordination with Capability-Aware Shared Hypernetworks | Kevin Fu et.al. | 2501.06058 | link |
2025-01-10 | Scaling Safe Multi-Agent Control for Signal Temporal Logic Specifications | Joe Eappen et.al. | 2501.05639 | link |
2025-01-09 | Control of Overpopulated Tails in Kinetic Epidemic Models | Mattia Zanella et.al. | 2501.05365 | null |
2025-01-09 | On Corrigibility and Alignment in Multi Agent Games | Edmund Dable-Heath et.al. | 2501.05360 | null |
2025-01-09 | CoDe: Communication Delay-Tolerant Multi-Agent Collaboration via Dual Alignment of Intent and Timeliness | Shoucheng Song et.al. | 2501.05207 | null |
2025-01-09 | Constrained Optimization of Charged Particle Tracking with Multi-Agent Reinforcement Learning | Tobias Kortus et.al. | 2501.05113 | null |
2025-01-07 | HIVEX: A High-Impact Environment Suite for Multi-Agent Research (extended version) | Philipp D. Siedler et.al. | 2501.04180 | null |
2025-01-09 | Collaborative Spacecraft Servicing under Partial Feedback using Lyapunov-based Deep Neural Networks | Cristian F. Nino et.al. | 2501.04160 | null |
2025-01-07 | A Unified Attack Detection Strategy for Multi-Agent Systems over Transient and Steady Stages | Jinming Gao et.al. | 2501.03496 | null |
2025-01-06 | Turn-based Multi-Agent Reinforcement Learning Model Checking | Dennis Gross et.al. | 2501.03187 | null |
2025-01-06 | CAMP: Collaborative Attention Model with Profiles for Vehicle Routing Problems | Chuanbo Hua et.al. | 2501.02977 | link |
2025-01-06 | Revisiting Communication Efficiency in Multi-Agent Reinforcement Learning from the Dimensional Analysis Perspective | Chuxiong Sun et.al. | 2501.02888 | null |
2025-01-06 | Enhancing Lifelong Multi-Agent Path Finding with Cache Mechanism | Yimin Tang et.al. | 2501.02803 | null |
2025-01-06 | Multi-Agent Path Finding under Limited Communication Range Constraint via Dynamic Leading | Hoang-Dung Bui et.al. | 2501.02770 | null |
2025-01-04 | Enhancing Workplace Productivity and Well-being Using AI Agent | Ravirajan K et.al. | 2501.02368 | null |
2025-01-04 | Stochastic Generalized Dynamic Games with Coupled Chance Constraints | Seyed Shahram Yadollahi et.al. | 2501.02279 | null |
2025-01-04 | CORD: Generalizable Cooperation via Role Diversity | Kanefumi Matsuyama et.al. | 2501.02221 | null |
2025-01-04 | TACTIC: Task-Agnostic Contrastive pre-Training for Inter-Agent Communication | Peihong Yu et.al. | 2501.02174 | null |
2025-01-03 | SMTL: A Stratified Logic for Expressive Multi-Level Temporal Specifications | Ali Baheri et.al. | 2501.02094 | null |
2025-01-03 | Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification | Xiangxiang Dai et.al. | 2501.01849 | link |
2025-01-03 | Distributed Framework Construction for Affine Formation Control | Huiming Li et.al. | 2501.01817 | null |
2025-01-03 | BLAST: A Stealthy Backdoor Leverage Attack against Cooperative Multi-Agent Deep Reinforcement Learning based Systems | Yinbo Yu et.al. | 2501.01593 | null |
2025-01-02 | PIMAEX: Multi-Agent Exploration through Peer Incentivization | Michael Kölle et.al. | 2501.01266 | null |
2025-01-02 | Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects | Abdullah Mushtaq et.al. | 2501.01205 | null |
2025-01-02 | Communicating Unexpectedness for Out-of-Distribution Multi-Agent Reinforcement Learning | Min Whoo Lee et.al. | 2501.01140 | null |
2025-01-02 | Symmetries-enhanced Multi-Agent Reinforcement Learning | Nikolaos Bousias et.al. | 2501.01136 | null |
2025-01-02 | Cyber-physical Defense for Heterogeneous Multi-agent Systems Against Exponentially Unbounded Attacks on Signed Digraphs | Yichao Wang et.al. | 2501.00990 | null |
2025-01-01 | Defense Strategies for Autonomous Multi-agent Systems: Ensuring Safety and Resilience Under Exponentially Unbounded FDI Attacks | Yichao Wang et.al. | 2501.00973 | null |
2025-01-01 | Intent-based Radio Scheduler for RAN Slicing: Learning to deal with different network scenarios | Cleverson Nahum et.al. | 2501.00950 | link |
2025-01-03 | Large Language Model Based Multi-Agent System Augmented Complex Event Processing Pipeline for Internet of Multimedia Things | Talha Zeeshan et.al. | 2501.00906 | null |
2025-01-01 | Observer-Based Data-Driven Consensus Control for Nonlinear Multi-Agent Systems against DoS and FDI attacks | Yi Zhang et.al. | 2501.00872 | null |
2025-01-01 | LLM-Powered Multi-Agent System for Automated Crypto Portfolio Management | Yichen Luo et.al. | 2501.00826 | null |
2024-12-30 | Exploring and Controlling Diversity in LLM-Agent Conversation | KuanChao Chu et.al. | 2412.21102 | null |
2024-12-30 | Advances in Multi-agent Reinforcement Learning: Persistent Autonomy and Robot Learning Lab Report 2024 | Reza Azadeh et.al. | 2412.21088 | null |
2024-12-30 | Privacy-Aware Multi-Device Cooperative Edge Inference with Distributed Resource Bidding | Wenhao Zhuang et.al. | 2412.21069 | null |
2024-12-30 | UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI | Fangwei Zhong et.al. | 2412.20977 | null |
2024-12-29 | Game Theory and Multi-Agent Reinforcement Learning : From Nash Equilibria to Evolutionary Dynamics | Neil De La Fuente et.al. | 2412.20523 | null |
2024-12-29 | Planning, Living and Judging: A Multi-agent LLM-based Framework for Cyclical Urban Planning | Hang Ni et.al. | 2412.20505 | null |
2024-12-29 | Exploiting NOMA Transmissions in Multi-UAV-assisted Wireless Networks: From Aerial-RIS to Mode-switching UAVs | Songhan Zhao et.al. | 2412.20484 | null |
2024-12-29 | SatFlow: Scalable Network Planning for LEO Mega-Constellations | Sheng Cen et.al. | 2412.20475 | null |
2024-12-29 | Learning Policies for Dynamic Coalition Formation in Multi-Robot Task Allocation | Lucas C. D. Bezerra et.al. | 2412.20397 | null |
2024-12-29 | Distributed Convex Optimization with State-Dependent (Social) Interactions over Random Networks | Seyyed Shaho Alaviani et.al. | 2412.20354 | null |
2024-12-27 | Bottom-up robust modeling for the foraging behavior of Physarum polycephalum | Damiano Reginato et.al. | 2412.19790 | null |
2024-12-27 | Bidding Games on Markov Decision Processes with Quantitative Reachability Objectives | Guy Avni et.al. | 2412.19609 | null |
2024-12-27 | Scalable Hierarchical Reinforcement Learning for Hyper Scale Multi-Robot Task Planning | Xuan Zhou et.al. | 2412.19538 | null |
2024-12-27 | Casevo: A Cognitive Agents and Social Evolution Simulator | Zexun Jiang et.al. | 2412.19498 | link |
2024-12-27 | Knowledge Graph-Based Multi-Agent Path Planning in Dynamic Environments using WAITR | Ted Edward Holmberg et.al. | 2412.19469 | null |
2024-12-27 | Online distributed algorithms for mixed equilibrium problems in dynamic environments | Hang Xu et.al. | 2412.19399 | null |
2024-12-26 | Swarm Contract: A Multi-Sovereign Agent Consensus Mechanism | Haowei Yang et.al. | 2412.19256 | null |
2024-12-26 | NADER: Neural Architecture Design via Multi-Agent Collaboration | Zekang Yang et.al. | 2412.19206 | null |
2024-12-26 | Hierarchical Multi-agent Meta-Reinforcement Learning for Cross-channel Bidding | Shenghong He et.al. | 2412.19064 | null |
2024-12-24 | Agents on the Bench: Large Language Model Based Multi Agent Framework for Trustworthy Digital Justice | Cong Jiang et.al. | 2412.18697 | null |
2024-12-24 | Multi-Agent Norm Perception and Induction in Distributed Healthcare | Chao Li et.al. | 2412.18454 | null |
2024-12-24 | Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles | Zihan Wang et.al. | 2412.18416 | null |
2024-12-24 | Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering | Zhongjian Hu et.al. | 2412.18351 | null |
2024-12-24 | MMFactory: A Universal Solution Search Engine for Vision-Language Tasks | Wan-Cyuan Fan et.al. | 2412.18072 | null |
2024-12-23 | Uncertainty-Aware Critic Augmentation for Hierarchical Multi-Agent EV Charging Control | Lo Pang-Yun Ting et.al. | 2412.18047 | null |
2024-12-23 | Multi-Agent Path Finding in Continuous Spaces with Projected Diffusion Models | Jinhao Liang et.al. | 2412.17993 | null |
2024-12-23 | Dynamic Multi-Agent Orchestration and Retrieval for Multi-Source Question-Answer Systems using Large Language Models | Antony Seabra et.al. | 2412.17964 | null |
2024-12-23 | Contrato360 2.0: A Document and Database-Driven Question-Answer System using Large Language Models and Agents | Antony Seabra et.al. | 2412.17942 | null |
2024-12-23 | ResearchTown: Simulator of Human Research Community | Haofei Yu et.al. | 2412.17767 | link |
2024-12-24 | SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC | Yue Deng et.al. | 2412.17707 | link |
2024-12-23 | CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction | Yuanyuan Gao et.al. | 2412.17612 | null |
2024-12-23 | PC Agent: While You Sleep, AI Works – A Cognitive Journey into Digital World | Yanheng He et.al. | 2412.17589 | link |
2024-12-23 | DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought | Jiaan Wang et.al. | 2412.17498 | link |
2024-12-23 | A Coalition Game for On-demand Multi-modal 3D Automated Delivery System | Farzan Moosavi et.al. | 2412.17252 | null |
2024-12-22 | Multi-Agent Sampling: Scaling Inference Compute for Data Synthesis with Tree Search-Based Agentic Collaboration | Hai Ye et.al. | 2412.17061 | link |
2024-12-22 | FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos | Zhengqian Wu et.al. | 2412.17022 | link |
2024-12-22 | Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model | Songjun Tu et.al. | 2412.16878 | link |
2024-12-22 | KG4Diagnosis: A Hierarchical Multi-Agent LLM Framework with Knowledge Graph Enhancement for Medical Diagnosis | Kaiwen Zuo et.al. | 2412.16833 | null |
2024-12-20 | Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a Multi-Agentic RAG | Hasan Md Tusfiqur Alam et.al. | 2412.16086 | link |
2024-12-20 | Speedup Techniques for Switchable Temporal Plan Graph Optimization | He Jiang et.al. | 2412.15908 | null |
2024-12-20 | AIR: Unifying Individual and Cooperative Exploration in Collective Multi-Agent Reinforcement Learning | Guangchong Zhou et.al. | 2412.15700 | link |
2024-12-20 | Asynchronous Vector Consensus over Matrix-Weighted Networks | P Raghavendra Rao et.al. | 2412.15681 | null |
2024-12-20 | Tacit Learning with Adaptive Information Selection for Cooperative Multi-Agent Reinforcement Learning | Lunjun Liu et.al. | 2412.15639 | null |
2024-12-20 | Understanding Individual Agent Importance in Multi-Agent System via Counterfactual Reasoning | Chen Jianming et.al. | 2412.15619 | null |
2024-12-20 | Multi Agent Reinforcement Learning for Sequential Satellite Assignment Problems | Joshua Holder et.al. | 2412.15573 | link |
2024-12-20 | Novelty-Guided Data Reuse for Efficient and Diversified Multi-Agent Reinforcement Learning | Yangkun Chen et.al. | 2412.15517 | link |
2024-12-20 | Mitigating Social Bias in Large Language Models: A Multi-Objective Approach within a Multi-Agent Framework | Zhenjie Xu et.al. | 2412.15504 | link |
2024-12-20 | An Agent-based Model for Competitive Agents | Mohammad Daneshvar et.al. | 2412.15485 | null |
2024-12-20 | Probabilistic Strategy Logic with Degrees of Observability | Chunyan Mu et.al. | 2412.15135 | null |
2024-12-19 | Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning | Aditya Kapoor et.al. | 2412.14779 | null |
2024-12-19 | PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children | Yiqun Zhang et.al. | 2412.14769 | link |
2024-12-19 | Bel Esprit: Multi-Agent Framework for Building AI Model Pipelines | Yunsu Kim et.al. | 2412.14684 | null |
2024-12-19 | Coupling and Tensorization of Kinetic Theory and Graph Theory | Datong Zhou et.al. | 2412.14512 | null |
2024-12-18 | A Survey on Large Language Model-based Agents for Statistics and Data Science | Maojun Sun et.al. | 2412.14222 | null |
2024-12-18 | Heterogeneous Multi-Agent Reinforcement Learning for Distributed Channel Access in WLANs | Jiaming Yu et.al. | 2412.14218 | null |
2024-12-18 | Towards privacy-preserving cooperative control via encrypted distributed optimization | Philipp Binfet et.al. | 2412.13953 | null |
2024-12-18 | Meta-Reflection: A Feedback-Free Reflection Learning Framework | Yaoke Wang et.al. | 2412.13781 | null |
2024-12-18 | Heuristic Planner for Communication-Constrained Multi-Agent Multi-Goal Path Planning | Jáchym Herynek et.al. | 2412.13719 | null |
2024-12-19 | A2H: A UI Converter from Android to HarmonyOS Platform | Chen Wang et.al. | 2412.13693 | link |
2024-12-18 | Exploring Multi-Modal Integration with Tool-Augmented LLM Agents for Precise Causal Discovery | ChengAo Shen et.al. | 2412.13667 | link |
2024-12-18 | Large Language Model Federated Learning with Blockchain and Unlearning for Cross-Organizational Collaboration | Xuhan Zuo et.al. | 2412.13551 | null |
2024-12-18 | ROMAS: A Role-Based Multi-Agent System for Database monitoring and Planning | Yi Huang et.al. | 2412.13520 | link |
2024-12-18 | Gradual Vigilance and Interval Communication: Enhancing Value Alignment in Multi-Agent Debates | Rui Zou et.al. | 2412.13471 | null |
2024-12-17 | Multi-Agent Motion Planning For Differential Drive Robots Through Stationary State Search | Jingtian Yan et.al. | 2412.13359 | link |
2024-12-17 | Linear Contracts for Supermodular Functions Based on Graphs | Kanstantsin Pashkovich et.al. | 2412.13290 | null |
2024-12-18 | SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents | Sheng Yin et.al. | 2412.13178 | link |
2024-12-17 | Contract-based Design and Verification of Multi-Agent Systems with Quantitative Temporal Requirements | Rafael Dewes et.al. | 2412.13114 | null |
2024-12-17 | A Simplified Algorithm for Joint Real-Time Synchronization, NLoS Identification, and Multi-Agent Localization | Yili Deng et.al. | 2412.12677 | null |
2024-12-17 | PerSphere: A Comprehensive Framework for Multi-Faceted Perspective Retrieval and Summarization | Yun Luo et.al. | 2412.12588 | link |
2024-12-17 | ChatDiT: A Training-Free Baseline for Task-Agnostic Free-Form Chatting with Diffusion Transformers | Lianghua Huang et.al. | 2412.12571 | link |
2024-12-17 | A MARL Based Multi-Target Tracking Algorithm Under Jamming Against Radar | Ziang Wang et.al. | 2412.12547 | link |
2024-12-17 | Swarm Intelligence in Collision-free Formation Control for Multi-UAV Systems with 3D Obstacle Avoidance Maneuvers | Reza Ahmadvand et.al. | 2412.12437 | null |
2024-12-16 | Achieving Collective Welfare in Multi-Agent Reinforcement Learning via Suggestion Sharing | Yue Jin et.al. | 2412.12326 | null |
2024-12-16 | Harnessing Language for Coordination: A Framework and Benchmark for LLM-Driven Multi-Agent Control | Timothée Anne et.al. | 2412.11761 | null |
2024-12-16 | Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework | Xuanming Zhang et.al. | 2412.11713 | null |
2024-12-16 | Loosely Synchronized Rule-Based Planning for Multi-Agent Path Finding with Asynchronous Actions | Shuai Zhou et.al. | 2412.11678 | link |
2024-12-15 | Cultural Palette: Pluralising Culture Alignment via Multi-agent Palette | Jiahao Yuan et.al. | 2412.11167 | link |
2024-12-15 | PromptV: Leveraging LLM-powered Multi-Agent Prompting for High-quality Verilog Generation | Zhendong Mi et.al. | 2412.11014 | null |
2024-12-14 | DSRC: Learning Density-insensitive and Semantic-aware Collaborative Representation against Corruptions | Jingyu Zhang et.al. | 2412.10739 | link |
2024-12-14 | Cluster-Based Multi-Agent Task Scheduling for Space-Air-Ground Integrated Networks | Zhiying Wang et.al. | 2412.10700 | null |
2024-12-14 | Deviate or Not: Learning Coalition Structures with Multiple-bit Observations in Games | Yixuan Even Xu et.al. | 2412.10636 | null |
2024-12-13 | A systematic review of norm emergence in multi-agent systems | Carmengelys Cordova et.al. | 2412.10609 | null |
2024-12-17 | Heterogeneous Multi-Robot Graph Coverage with Proximity and Movement Constraints | Dolev Mutzari et.al. | 2412.10083 | null |
2024-12-13 | Optimized Coordination Strategy for Multi-Aerospace Systems in Pick-and-Place Tasks By Deep Neural Network | Ye Zhang et.al. | 2412.09877 | null |
2024-12-13 | AutoPatent: A Multi-Agent Framework for Automatic Patent Generation | Qiyao Wang et.al. | 2412.09796 | link |
2024-12-12 | MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction | Xiaohao Xu et.al. | 2412.09723 | link |
2024-12-12 | TransferLight: Zero-Shot Traffic Signal Control on any Road-Network | Johann Schmidt et.al. | 2412.09719 | null |
2024-12-12 | DiverseAgentEntropy: Quantifying Black-Box LLM Uncertainty through Diverse Perspectives and Multi-Agent Interaction | Yu Feng et.al. | 2412.09572 | null |
2024-12-12 | From Intention To Implementation: Automating Biomedical Research via LLMs | Yi Luo et.al. | 2412.09429 | null |
2024-12-12 | Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer | Adam Labiosa et.al. | 2412.09417 | null |
2024-12-13 | LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation | Yijun Liu et.al. | 2412.09237 | null |
2024-12-12 | Reconfigurable Intelligent Surface for Internet of Robotic Things | Wanli Ni et.al. | 2412.09117 | null |
2024-12-12 | Quantum-Train-Based Distributed Multi-Agent Reinforcement Learning | Kuan-Cheng Chen et.al. | 2412.08845 | null |
2024-12-11 | Automated Soap Opera Testing Directed by LLMs and Scenario Knowledge: Feasibility, Challenges, and Road Ahead | Yanqi Su et.al. | 2412.08581 | null |
2024-12-11 | An End-to-End Collaborative Learning Approach for Connected Autonomous Vehicles in Occluded Scenarios | Leandro Parada et.al. | 2412.08562 | null |
2024-12-11 | TapeAgents: a Holistic Framework for Agent Development and Optimization | Dzmitry Bahdanau et.al. | 2412.08445 | null |
2024-12-12 | Learn How to Query from Unlabeled Data Streams in Federated Learning | Yuchang Sun et.al. | 2412.08138 | link |
2024-12-10 | Where Common Knowledge Cannot Be Formed, Common Belief Can – Planning with Multi-Agent Belief Using Group Justified Perspectives | Guang Hu et.al. | 2412.07981 | null |
2024-12-10 | Thinking Fast and Laterally: Multi-Agentic Approach for Reasoning about Uncertain Emerging Events | Stefan Dernbach et.al. | 2412.07977 | null |
2024-12-10 | Beyond Static Assumptions: the Predictive Justified Perspective Model for Epistemic Planning | Weijia Li et.al. | 2412.07941 | null |
2024-12-12 | Towards Foundation-model-based Multiagent System to Accelerate AI for Social Impact | Yunfan Zhao et.al. | 2412.07880 | null |
2024-12-10 | Finite-time Non-overshooting Leader-following Consensus Control for Multi-Agent Systems | Min Li et.al. | 2412.07855 | null |
2024-12-10 | MAGE: A Multi-Agent Engine for Automated RTL Code Generation | Yujie Zhao et.al. | 2412.07822 | link |
2024-12-10 | Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization | Zongkai Liu et.al. | 2412.07639 | link |
2024-12-10 | Event-Triggered Memory Control for Interval Type-2 Fuzzy Heterogeneous Multi-Agent Systems | Sen Kong et.al. | 2412.07471 | null |
2024-12-10 | A Distributed Deep Koopman Learning Algorithm for Control | Wenjian Hao et.al. | 2412.07212 | null |
2024-12-10 | A linear-quadratic partially observed Stackelberg stochastic differential game with multiple followers and its application to multi-agent formation control | Yichun Li et.al. | 2412.07159 | null |
2024-12-09 | Reasoning about Strategic Abilities in Stochastic Multi-agent Systems | Yedi Zhang et.al. | 2412.06509 | null |
2024-12-09 | Augmenting the action space with conventions to improve multi-agent cooperation in Hanabi | F. Bredell et.al. | 2412.06333 | link |
2024-12-09 | Vision-Based Deep Reinforcement Learning of UAV Autonomous Navigation Using Privileged Information | Junqiao Wang et.al. | 2412.06313 | null |
2024-12-09 | Enhanced Multi-Object Tracking Using Pose-based Virtual Markers in 3x3 Basketball | Li Yin et.al. | 2412.06258 | null |
2024-12-09 | AgentAlign: Misalignment-Adapted Multi-Agent Perception for Resilient Inter-Agent Sensor Correlations | Zonglin Meng et.al. | 2412.06142 | null |
2024-12-09 | A Logic for Paraconsistent Belief Revision based on Epistemic Entrenchment | Marcelo E. Coniglio et.al. | 2412.06117 | null |
2024-12-08 | Towards Modeling Human-Agentic Collaborative Workflows: A BPMN Extension | Adem Ait et.al. | 2412.05958 | link |
2024-12-08 | A Collaborative Multi-Agent Approach to Retrieval-Augmented Generation Across Diverse Data | Aniruddha Salve et.al. | 2412.05838 | null |
2024-12-06 | Towards Effective GenAI Multi-Agent Collaboration: Design and Evaluation for Enterprise Applications | Raphael Shu et.al. | 2412.05449 | link |
2024-12-06 | TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in Minecraft | Qian Long et.al. | 2412.05255 | link |
2024-12-06 | Who Speaks Next? Multi-party AI Discussion Leveraging the Systematics of Turn-taking in Murder Mystery Games | Ryota Nonomura et.al. | 2412.04937 | link |
2024-12-06 | Breaking Event Rumor Detection via Stance-Separated Multi-Agent Debate | Mingqing Zhang et.al. | 2412.04859 | null |
2024-12-05 | GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration | Kaiyi Huang et.al. | 2412.04440 | null |
2024-12-05 | Intersection-Aware Assessment of EMS Accessibility in NYC: A Data-Driven Approach | Haoran Su et.al. | 2412.04369 | null |
2024-12-06 | Reinforcement Learning for Freeway Lane-Change Regulation via Connected Vehicles | Ke Sun et.al. | 2412.04341 | null |
2024-12-05 | Transient Multi-Agent Path Finding for Lifelong Navigation in Dense Environments | Jonathan Morag et.al. | 2412.04256 | null |
2024-12-05 | HyperMARL: Adaptive Hypernetworks for Multi-Agent RL | Kale-ab Abebe Tessera et.al. | 2412.04233 | link |
2024-12-05 | Dimension Reduction via Random Projection for Privacy in Multi-Agent Systems | Puspanjali Ghoshal et.al. | 2412.04031 | null |
2024-12-05 | Is FISHER All You Need in The Multi-AUV Underwater Target Tracking Task? | Jingzehua Xu et.al. | 2412.03959 | null |
2024-12-05 | Traffic Co-Simulation Framework Empowered by Infrastructure Camera Sensing and Reinforcement Learning | Talha Azfar et.al. | 2412.03925 | null |
2024-12-05 | A Multi-agent Simulation for the Mass School Shootings | Wei Dai et.al. | 2412.03882 | null |
2024-12-05 | Educational-Psychological Dialogue Robot Based on Multi-Agent Collaboration | Shiwen Ni et.al. | 2412.03847 | null |
2024-12-04 | WiS Platform: Enhancing Evaluation of LLM-Based Multi-Agent Systems Through Game-Based Analysis | Chengwei Hu et.al. | 2412.03359 | null |
2024-12-04 | Coordinated Multi-Armed Bandits for Improved Spatial Reuse in Wi-Fi | Francesc Wilhelmi et.al. | 2412.03076 | null |
2024-12-04 | Preference-based opponent shaping in differentiable games | Xinyu Qiao et.al. | 2412.03072 | null |
2024-12-03 | Mobile Cell-Free Massive MIMO with Multi-Agent Reinforcement Learning: A Scalable Framework | Ziheng Liu et.al. | 2412.02581 | null |
2024-12-03 | General Resetting Theory for Group Avoidance | Juhee Lee et.al. | 2412.02524 | null |
2024-12-03 | A Multi-Agent Framework for Extensible Structured Text Generation in PLCs | Donghao Yang et.al. | 2412.02410 | null |
2024-12-03 | Distributed Task Allocation for Multi-Agent Systems: A Submodular Optimization Approach | Jing Liu et.al. | 2412.02146 | null |
2024-12-03 | The Problem of Social Cost in Multi-Agent General Reinforcement Learning: Survey and Synthesis | Kee Siong Ng et.al. | 2412.02091 | null |
2024-12-03 | Evolution of Collective AI Beyond Individual Optimization | Ryosuke Takata et.al. | 2412.02085 | null |
2024-12-03 | Comparative Analysis of Multi-Agent Reinforcement Learning Policies for Crop Planning Decision Support | Anubha Mahajan et.al. | 2412.02057 | null |
2024-12-02 | Who’s Gaming the System? A Causally-Motivated Approach for Detecting Strategic Adaptation | Trenton Chang et.al. | 2412.02000 | link |
2024-12-02 | ChatCollab: Exploring Collaboration Between Humans and AI Agents in Software Teams | Benjamin Klieger et.al. | 2412.01992 | null |
2024-12-02 | MALT: Improving Reasoning with Multi-Agent LLM Training | Sumeet Ramesh Motwani et.al. | 2412.01928 | null |
2024-11-29 | RMIO: A Model-Based MARL Framework for Scenarios with Observation Loss in Some Agents | Shi Zifeng et.al. | 2411.19639 | null |
2024-12-02 | Fixed-relative-switch strategies for learning based event-triggered control of nonlinear multiagent systems | Ziming Wang et.al. | 2411.19571 | null |
2024-11-29 | A Local Information Aggregation based Multi-Agent Reinforcement Learning for Robot Swarm Dynamic Task Allocation | Yang Lv et.al. | 2411.19526 | null |
2024-11-29 | Two Timescale EXTRA for Smooth Non-convex Distributed Optimization Problems | Zeyu Peng et.al. | 2411.19483 | null |
2024-11-28 | Integrating Transit Signal Priority into Multi-Agent Reinforcement Learning based Traffic Signal Control | Dickness Kakitahi Kwesiga et.al. | 2411.19359 | null |
2024-11-28 | Mars-PO: Multi-Agent Reasoning System Preference Optimization | Xiaoxuan Lou et.al. | 2411.19039 | null |
2024-11-28 | Backward Linear-Quadratic Mean Field Stochastic Differential Games: A Direct Method | Yu Si et.al. | 2411.18891 | null |
2024-11-27 | Collective decision making by embodied neural agents | Nicolas Coucke et.al. | 2411.18498 | link |
2024-11-27 | Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator | Frederic Kirstein et.al. | 2411.18444 | null |
2024-11-28 | A Multi-Agent Dual Dialogue System to Support Mental Health Care Providers | Onno P. Kampman et.al. | 2411.18429 | null |
2024-11-30 | InterHub: A Naturalistic Trajectory Dataset with Dense Interaction for Autonomous Driving | Xiyan Jiang et.al. | 2411.18302 | link |
2024-11-27 | Exploration of LLM Multi-Agent Application Implementation Based on LangGraph+CrewAI | Zhihua Duan et.al. | 2411.18241 | null |
2024-11-27 | DMVC-Tracker: Distributed Multi-Agent Trajectory Planning for Target Tracking Using Dynamic Buffered Voronoi and Inter-Visibility Cells | Yunwoo Lee et.al. | 2411.18086 | null |
2024-11-26 | Joint Resource Optimization, Computation Offloading and Resource Slicing for Multi-Edge Traffic-Cognitive Networks | Ting Xiaoyang et.al. | 2411.17782 | null |
2024-11-26 | MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation | Harsh Singh et.al. | 2411.17636 | null |
2024-11-26 | Ensuring Safety in Target Pursuit Control: A CBF-Safe Reinforcement Learning Approach | Yaosheng Deng et.al. | 2411.17552 | null |
2024-11-26 | A “Breathing” Mobile Communication Network | Chao Ge et.al. | 2411.17290 | null |
2024-11-26 | Creative Agents: Simulating the Systems Model of Creativity with Generative Agents | Naomi Imasato et.al. | 2411.17065 | null |
2024-11-25 | Avoiding Deadlocks Is Not Enough: Analysis and Resolution of Blocked Airplanes | Shuhao Qi et.al. | 2411.16911 | null |
2024-11-25 | MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM | Vladimir Yugay et.al. | 2411.16785 | null |
2024-11-25 | Barriers on the EDGE: A scalable CBF architecture over EDGE for safe aerial-ground multi-agent coordination | Viswa Narayanan Sankaranarayanan et.al. | 2411.16608 | null |
2024-11-25 | Online Guidance Graph Optimization for Lifelong Multi-Agent Path Finding | Hongzhi Zang et.al. | 2411.16506 | link |
2024-11-25 | A Multi-agent Framework for Materials Laws Discovery | Bo Hu et.al. | 2411.16416 | null |
2024-11-25 | Enhancing Multi-Agent Consensus through Third-Party LLM Integration: Analyzing Uncertainty and Mitigating Hallucinations in Large Language Models | Zhihua Duan et.al. | 2411.16189 | null |
2024-11-25 | Multi-Robot Reliable Navigation in Uncertain Topological Environments with Graph Attention Networks | Zhuoyuan Yu et.al. | 2411.16134 | link |
2024-11-24 | PIANIST: Learning Partially Observable World Models with LLMs for Multi-Agent Decision Making | Jonathan Light et.al. | 2411.15998 | null |
2024-11-24 | Lattice $φ^{4}$ field theory as a multi-agent system of financial markets | Dimitrios Bachtis et.al. | 2411.15813 | link |
2024-11-24 | DrugAgent: Automating AI-aided Drug Discovery Programming through LLM Multi-Agent Collaboration | Sizhe Liu et.al. | 2411.15692 | null |
2024-11-23 | Instruct or Interact? Exploring and Eliciting LLMs’ Capability in Code Snippet Adaptation Through Prompt Engineering | Tanghaoran Zhang et.al. | 2411.15501 | link |
2024-11-23 | Multi-Agent Disk Inspection | James Conley et.al. | 2411.15391 | null |
2024-11-22 | On Multi-Agent Inverse Reinforcement Learning | Till Freihaut et.al. | 2411.15046 | null |
2024-11-22 | Safe Multi-Agent Reinforcement Learning with Convergence to Generalized Nash Equilibrium | Zeyang Li et.al. | 2411.15036 | null |
2024-11-22 | Enhancing Clinical Trial Patient Matching through Knowledge Augmentation with Multi-Agents | Hanwen Shi et.al. | 2411.14637 | null |
2024-11-21 | A Systematic Study of Multi-Agent Deep Reinforcement Learning for Safe and Robust Autonomous Highway Ramp Entry | Larry Schester et.al. | 2411.14593 | null |
2024-11-21 | Energy Efficient Automated Driving as a GNEP: Vehicle-in-the-loop Experiments | Viranjan Bhattacharyya et.al. | 2411.14567 | null |
2024-11-21 | Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models | Yuhao Dong et.al. | 2411.14432 | link |
2024-11-21 | Multi-Agent Environments for Vehicle Routing Problems | Ricardo Gama et.al. | 2411.14411 | link |
2024-11-21 | Explainable Multi-Agent Reinforcement Learning for Extended Reality Codec Adaptation | Pedro Enrique Iturria-Rivera et.al. | 2411.14264 | null |
2024-11-23 | Learning Two-agent Motion Planning Strategies from Generalized Nash Equilibrium for Model Predictive Control | Hansung Kim et.al. | 2411.13983 | link |
2024-11-23 | Cooperative Grasping and Transportation using Multi-agent Reinforcement Learning with Ternary Force Representation | Ing-Sheng Bernard-Tiong et.al. | 2411.13942 | null |
2024-11-21 | LLMs as Continuous Learners: Improving the Reproduction of Defective Code in Software Issues | Yalan Lin et.al. | 2411.13941 | null |
2024-11-21 | Learning to Cooperate with Humans using Generative Agents | Yancheng Liang et.al. | 2411.13934 | link |
2024-11-21 | XAgents: A Framework for Interpretable Rule-Based Multi-Agents Cooperation | Hailong Yang et.al. | 2411.13932 | null |
2024-11-21 | PIORS: Personalized Intelligent Outpatient Reception based on Large Language Model with Multi-Agents Medical Scenario Simulation | Zhijie Bao et.al. | 2411.13902 | link |
2024-11-21 | Weak synchronization in heterogeneous multi-agent systems | Anton A. Stoorvogel et.al. | 2411.13806 | null |
2024-11-20 | WHALES: A Multi-agent Scheduling Dataset for Enhanced Cooperation in Autonomous Driving | Siwei Chen et.al. | 2411.13340 | link |
2024-11-20 | Cyborg Insect Factory: Automatic Assembly System to Build up Insect-computer Hybrid Robot Based on Vision-guided Robotic Arm Manipulation of Custom Bipolar Electrodes | Qifeng Lin et.al. | 2411.13164 | null |
2024-11-19 | Human-In-the-Loop Software Development Agents | Wannita Takerngsaksiri et.al. | 2411.12924 | null |
2024-11-19 | C $^{2}$ INet: Realizing Incremental Trajectory Prediction with Prior-Aware Continual Causal Intervention | Xiaohe Li et.al. | 2411.12313 | null |
2024-11-19 | Efficient Training in Multi-Agent Reinforcement Learning: A Communication-Free Framework for the Box-Pushing Problem | David Ge et.al. | 2411.12246 | null |
2024-11-19 | Safe Navigation in Dynamic Environments using Density Functions | Sriram S. K. S Narayanan et.al. | 2411.12206 | link |
2024-11-19 | A More Advanced Group Polarization Measurement Approach Based on LLM-Based Agents and Graphs | Zixin Liu et.al. | 2411.12196 | null |
2024-11-19 | Adversarial Multi-Agent Reinforcement Learning for Proactive False Data Injection Detection | Kejun Chen et.al. | 2411.12130 | null |
2024-11-18 | Competing Bandits in Decentralized Large Contextual Matching Markets | Satush Parikh et.al. | 2411.11794 | null |
2024-11-18 | The Power of Many: Multi-Agent Multimodal Models for Cultural Image Captioning | Longju Bai et.al. | 2411.11758 | link |
2024-11-19 | Signaling and Social Learning in Swarms of Robots | Leo Cazenille et.al. | 2411.11616 | null |
2024-11-18 | Syllabus: Portable Curricula for Reinforcement Learning Agents | Ryan Sullivan et.al. | 2411.11318 | link |
2024-11-17 | Emergent Structure in Multi-agent Systems Using Geometric Embeddings | Dimitria Silveria et.al. | 2411.11142 | null |
2024-11-17 | Mitigating Relative Over-Generalization in Multi-Agent Reinforcement Learning | Ting Zhu et.al. | 2411.11099 | null |
2024-11-17 | Joint Precoding and AP Selection for Energy Efficient RIS-aided Cell-Free Massive MIMO Using Multi-agent Reinforcement Learning | Enyu Shi et.al. | 2411.11070 | null |
2024-11-17 | Reinforcing Competitive Multi-Agents for Playing So Long Sucker | Medant Sharan et.al. | 2411.11057 | null |
2024-11-16 | Existence of $ε$ -Nash Equilibria in Nonzero-Sum Borel Stochastic Games and Equilibria of Quantized Models | Naci Saldi et.al. | 2411.10805 | null |
2024-11-16 | Distributed Optimization Method Based On Optimal Control | Ziyuan Guo et.al. | 2411.10658 | null |
2024-11-15 | Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash | Parsa Hejabi et.al. | 2411.10422 | link |
2024-11-15 | Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review | Hossein Hassani et.al. | 2411.10268 | null |
2024-11-15 | Agentic LLMs in the Supply Chain: Towards Autonomous Multi-Agent Consensus-Seeking | Valeria Jannelli et.al. | 2411.10184 | null |
2024-11-15 | Enforcing Cooperative Safety for Reinforcement Learning-based Mixed-Autonomy Platoon Control | Jingyuan Zhou et.al. | 2411.10031 | null |
2024-11-15 | Reaching Resilient Leader-Follower Consensus in Time-Varying Networks via Multi-Hop Relays | Liwei Yuan et.al. | 2411.09954 | null |
2024-11-15 | InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma | Xiaoxuan Hou et.al. | 2411.09856 | link |
2024-11-14 | Strategic Sacrifice: Self-Organized Robot Swarm Localization for Inspection Productivity | Sneha Ramshanker et.al. | 2411.09493 | null |
2024-11-13 | FinRobot: AI Agent for Equity Research and Valuation with Large Language Models | Tianyu Zhou et.al. | 2411.08804 | link |
2024-11-13 | BAMAX: Backtrack Assisted Multi-Agent Exploration using Reinforcement Learning | Geetansh Kalra et.al. | 2411.08400 | null |
2024-11-13 | Communication Efficient Decentralization for Smoothed Online Convex Optimization | Neelkamal Bhuyan et.al. | 2411.08355 | null |
2024-11-13 | DNN Task Assignment in UAV Networks: A Generative AI Enhanced Multi-Agent Reinforcement Learning Approach | Xin Tang et.al. | 2411.08299 | null |
2024-11-13 | Collaborative Participatory Research with LLM Agents in South Asia: An Empirically-Grounded Methodological Initiative and Agenda from Field Evidence in Sri Lanka | Xinjie Zhao et.al. | 2411.08294 | null |
2024-11-12 | Collision-Free Multi-Agent Coverage Control for Non-Cooperating Swarms: Preliminary Results | Karolina Schmidt et.al. | 2411.08190 | null |
2024-11-12 | Multi-Agent Stochastic Bandits Robust to Adversarial Corruptions | Fatemeh Ghaffari et.al. | 2411.08167 | null |
2024-11-12 | Adaptive Meta-Learning for Robust Deepfake Detection: A Multi-Agent Framework to Data Drift and Model Generalization | Dinesh Srivasthav P et.al. | 2411.08148 | link |
2024-11-12 | Incentive Design with Spillovers | Krishna Dasaratha et.al. | 2411.08026 | null |
2024-11-12 | Mitigating Bias in Queer Representation within Large Language Models: A Collaborative Agent Approach | Tianyi Huang et.al. | 2411.07656 | link |
2024-11-12 | Exploring Multi-Agent Reinforcement Learning for Unrelated Parallel Machine Scheduling | Maria Zampella et.al. | 2411.07634 | null |
2024-11-12 | A Simple Multi-agent Joint Prediction Method for Autonomous Driving | Mingyi Wang et.al. | 2411.07612 | null |
2024-11-12 | Stability for a stochastic fractional differential variational inequality with Lévy jump | Yue Zeng et.al. | 2411.07557 | null |
2024-11-12 | BudgetMLAgent: A Cost-Effective LLM Multi-Agent system for Automating Machine Learning Tasks | Shubham Gandhi et.al. | 2411.07464 | null |
2024-11-11 | Using Generative AI and Multi-Agents to Provide Automatic Feedback | Shuchen Guo et.al. | 2411.07407 | null |
2024-11-11 | Factorised Active Inference for Strategic Multi-Agent Interactions | Jaime Ruiz-Serra et.al. | 2411.07362 | null |
2024-11-11 | RoundTable: Investigating Group Decision-Making Mechanism in Multi-Agent Collaboration | Young-Min Cho et.al. | 2411.07161 | null |
2024-11-11 | Learning Multi-Agent Collaborative Manipulation for Long-Horizon Quadrupedal Pushing | Chuye Hong et.al. | 2411.07104 | null |
2024-11-11 | A Multi-Agent Approach for REST API Testing with Semantic Graphs and LLM-Driven Inputs | Myeongsoo Kim et.al. | 2411.07098 | null |
2024-11-11 | Learning Collective Dynamics of Multi-Agent Systems using Event-based Vision | Minah Lee et.al. | 2411.07039 | null |
2024-11-11 | Distributed Graph Augmentation Protocols to Achieve Strong Connectivity in Multi-Agent Networks | Guilherme Ramos et.al. | 2411.06880 | link |
2024-11-11 | Ambient AI Scribing Support: Comparing the Performance of Specialized AI Agentic Architecture to Leading Foundational Models | Chanseo Lee et.al. | 2411.06713 | null |
2024-11-10 | OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control | Rohit Bokade et.al. | 2411.06601 | null |
2024-11-10 | MA-DV2F: A Multi-Agent Navigation Framework using Dynamic Velocity Vector Field | Yining Ma et.al. | 2411.06404 | null |
2024-11-10 | Do you want to play a game? Learning to play Tic-Tac-Toe in Hypermedia Environments | Katharine Beaumont et.al. | 2411.06398 | null |
2024-11-10 | Optimal Execution with Reinforcement Learning | Yadh Hafsi et.al. | 2411.06389 | null |
2024-11-08 | Data-Driven Distributed Common Operational Picture from Heterogeneous Platforms using Multi-Agent Reinforcement Learning | Indranil Sur et.al. | 2411.05683 | null |
2024-11-08 | Expectation vs. Reality: Towards Verification of Psychological Games | Marta Kwiatkowska et.al. | 2411.05599 | null |
2024-11-08 | Parameterized Voter Relevance in Facility Location Games with Tree-Shaped Invitation Graphs | Ryoto Ando et.al. | 2411.05574 | null |
2024-11-08 | Time-to-reach Bounds for Verification of Dynamical Systems Using the Koopman Spectrum | Jianqiang Ding et.al. | 2411.05554 | null |
2024-11-08 | Emergent Cooperative Strategies for Multi-Agent Shepherding via Reinforcement Learning | Italo Napolitano et.al. | 2411.05454 | null |
2024-11-08 | VISTA: Visual Integrated System for Tailored Automation in Math Problem Generation Using LLM | Jeongwoo Lee et.al. | 2411.05423 | null |
2024-11-08 | LLM-PySC2: Starcraft II learning environment for Large Language Models | Zongyuan Li et.al. | 2411.05348 | link |
2024-11-07 | Performative Reinforcement Learning with Linear Markov Decision Process | Debmalya Mandal et.al. | 2411.05234 | null |
2024-11-07 | Maximizing User Connectivity in AI-Enabled Multi-UAV Networks: A Distributed Strategy Generalized to Arbitrary User Distributions | Bowei Li et.al. | 2411.05205 | null |
2024-11-07 | PentestAgent: Incorporating LLM Agents to Automated Penetration Testing | Xiangmin Shen et.al. | 2411.05185 | null |
2024-11-07 | StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration | Panwen Hu et.al. | 2411.04925 | null |
2024-11-07 | Think Smart, Act SMARL! Analyzing Probabilistic Logic Driven Safety in Multi-Agent Reinforcement Learning | Satchit Chatterji et.al. | 2411.04867 | link |
2024-11-07 | Enhancing Investment Analysis: Optimizing AI-Agent Collaboration in Financial Research | Xuewen Han et.al. | 2411.04788 | link |
2024-11-07 | Learning from Demonstration with Hierarchical Policy Abstractions Toward High-Performance and Courteous Autonomous Racing | Chanyoung Chung et.al. | 2411.04735 | null |
2024-11-07 | A dynamical model of platform choice and online segregation | Sven Banisch et.al. | 2411.04681 | null |
2024-11-07 | CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation | Jie Liu et.al. | 2411.04679 | link |
2024-11-07 | Semantic-Aware Resource Management for C-V2X Platooning via Multi-Agent Reinforcement Learning | Zhiyu Shao et.al. | 2411.04672 | link |
2024-11-07 | Multi-Agents are Social Groups: Investigating Social Influence of Multiple Agents in Human-Agent Interactions | Tianqi Song et.al. | 2411.04578 | null |
2024-11-07 | Magentic-One: A Generalist Multi-Agent System for Solving Complex Tasks | Adam Fourney et.al. | 2411.04468 | null |
2024-11-06 | AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-Making | Yizhe Huang et.al. | 2411.03865 | link |
2024-11-06 | Privacy-Preserving Resilient Vector Consensus | Bing Liu et.al. | 2411.03633 | null |
2024-11-06 | CPEG: Leveraging Consistency Policy with Consensus Guidance for Multi-agent Exploration | Yuqian Fu et.al. | 2411.03603 | null |
2024-11-05 | AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution | Zhiqiang Xie et.al. | 2411.03519 | null |
2024-11-05 | SAUCE: Synchronous and Asynchronous User-Customizable Environment for Multi-Agent LLM Interaction | Shlomo Neuberger et.al. | 2411.03397 | link |
2024-11-05 | SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents | Dawei Li et.al. | 2411.03284 | link |
2024-11-05 | Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities | Ryosuke Takata et.al. | 2411.03252 | null |
2024-11-05 | Polyhedral study of a temporal rural postman problem: application in inspection of railway track without disturbing train schedules | Somnath Buriuly et.al. | 2411.02822 | null |
2024-11-05 | DroidSpeak: Enhancing Cross-LLM Communication | Yuhan Liu et.al. | 2411.02820 | null |
2024-11-04 | Multi-Agent Decision Transformers for Dynamic Dispatching in Material Handling Systems Leveraging Enterprise Big Data | Xian Yeow Lee et.al. | 2411.02584 | null |
2024-11-05 | Revisiting Game-Theoretic Control in Socio-Technical Networks: Emerging Design Frameworks and Contemporary Applications | Quanyan Zhu et.al. | 2411.01794 | null |
2024-11-05 | Lyapunov-guided Multi-Agent Reinforcement Learning for Delay-Sensitive Wireless Scheduling | Cheng Zhang et.al. | 2411.01766 | null |
2024-11-03 | Large-Scale Multi-Robot Coverage Path Planning on Grids with Path Deconfliction | Jingtao Tang et.al. | 2411.01707 | link |
2024-11-03 | Learning to Construct Implicit Communication Channel | Han Wang et.al. | 2411.01553 | null |
2024-11-03 | HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation | Zirui Wang et.al. | 2411.01455 | null |
2024-11-03 | Online Relational Inference for Evolving Multi-agent Interacting Systems | Beomseok Kang et.al. | 2411.01442 | link |
2024-11-02 | Guiding Multi-agent Multi-task Reinforcement Learning by a Hierarchical Framework with Logical Reward Shaping | Chanjuan Liu et.al. | 2411.01184 | null |
2024-11-02 | Role Play: Learning Adaptive Role-Specific Strategies in Multi-Agent Interactions | Weifan Long et.al. | 2411.01166 | null |
2024-11-01 | Sample-Efficient Regret-Minimizing Double Oracle in Extensive-Form Games | Xiaohang Tang et.al. | 2411.00954 | null |
2024-11-01 | LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation | Bowen Li et.al. | 2411.00773 | link |
2024-10-31 | Language-Driven Policy Distillation for Cooperative Driving in Multi-Agent Reinforcement Learning | Jiaqi Liu et.al. | 2410.24152 | null |
2024-10-31 | Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks | Yingzhe Peng et.al. | 2410.24032 | null |
2024-10-31 | QuACK: A Multipurpose Queuing Algorithm for Cooperative $k$ -Armed Bandits | Benjamin Howson et.al. | 2410.23867 | null |
2024-10-31 | Edges’ Riemannian energy analysis for synchronization of multi-agent nonlinear systems over undirected weighted graphs | Vincent Andrieu et.al. | 2410.23700 | null |
2024-10-31 | Anytime-Constrained Multi-Agent Reinforcement Learning | Jeremy McMahan et.al. | 2410.23637 | null |
2024-10-31 | Adaptive Distributed Observer-based Model Predictive Control for Multi-agent Formation with Resilience to Communication Link Faults | Binyan Xu et.al. | 2410.23592 | null |
2024-10-30 | Adaptive Network Intervention for Complex Systems: A Hierarchical Graph Reinforcement Learning Approach | Qiliang Chen et.al. | 2410.23396 | null |
2024-10-30 | Resource Governance in Networked Systems via Integrated Variational Autoencoders and Reinforcement Learning | Qiliang Chen et.al. | 2410.23393 | null |
2024-10-30 | Relational Weight Optimization for Enhancing Team Performance in Multi-Agent Multi-Armed Bandits | Monish Reddy Kotturu et.al. | 2410.23379 | null |
2024-11-01 | Multi-Agent Large Language Models for Conversational Task-Solving | Jonas Becker et.al. | 2410.22932 | null |
2024-10-30 | Self-optimization in distributed manufacturing systems using Modular State-based Stackelberg Games | Steve Yuwono et.al. | 2410.22912 | null |
2024-10-30 | Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents | Safwan Labbi et.al. | 2410.22908 | null |
2024-10-30 | Modelling vehicle and pedestrian collective dynamics: Challenges and advancements | Cécile Appert-Rolland et.al. | 2410.22896 | null |
2024-10-30 | $\textbf{EMOS}$: $\textbf{E}$mbodiment-aware Heterogeneous $\textbf{M}$ulti-robot $\textbf{O}$perating $\textbf{S}$ ystem with LLM Agents | Junting Chen et.al. | 2410.22662 | null |
2024-10-29 | Energy-Aware Multi-Agent Reinforcement Learning for Collaborative Execution in Mission-Oriented Drone Networks | Ying Li et.al. | 2410.22578 | null |
2024-10-29 | Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning | Yihe Deng et.al. | 2410.22304 | null |
2024-10-29 | EconoJax: A Fast & Scalable Economic Simulation in Jax | Koen Ponse et.al. | 2410.22165 | link |
2024-10-29 | Improving Performance of Commercially Available AI Products in a Multi-Agent Configuration | Cory Hymel et.al. | 2410.22129 | null |
2024-10-29 | Inverse Attention Agent for Multi-Agent System | Qian Long et.al. | 2410.21794 | link |
2024-10-29 | MARCO: Multi-Agent Real-time Chat Orchestration | Anubhav Shrimal et.al. | 2410.21784 | null |
2024-10-29 | Enhancing Financial Question Answering with a Multi-Agent Reflection Framework | Sorouralsadat Fatemi et.al. | 2410.21741 | null |
2024-10-28 | A Multi-Agent Reinforcement Learning Testbed for Cognitive Radio Applications | Sriniketh Vangaru et.al. | 2410.21521 | null |
2024-10-28 | You Can’t Always Get What You Want : Games of Ordered Preference | Dong Ho Lee et.al. | 2410.21447 | null |
2024-10-28 | Deploying Ten Thousand Robots: Scalable Imitation Learning for Lifelong Multi-Agent Path Finding | He Jiang et.al. | 2410.21415 | null |
2024-10-28 | CT2C-QA: Multimodal Question Answering over Chinese Text, Table and Chart | Bowen Zhao et.al. | 2410.21414 | null |
2024-10-28 | LiGAR: LiDAR-Guided Hierarchical Transformer for Multi-Modal Group Activity Recognition | Naga Venkata Sai Raviteja Chappa et.al. | 2410.21108 | null |
2024-10-28 | CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models | Meiqi Chen et.al. | 2410.21067 | null |
2024-10-28 | FairStream: Fair Multimedia Streaming Benchmark for Reinforcement Learning Agents | Jannis Weil et.al. | 2410.21029 | link |
2024-10-28 | Bilevel Model for Electricity Market Mechanism Optimisation via Quantum Computing Enhanced Reinforcement Learning | Shuyang Zhu et.al. | 2410.20968 | null |
2024-10-27 | Observability of Linear Time-Invariant Systems with Relative Measurements: A Geometric Approach | Ioannis Raptis et.al. | 2410.20637 | null |
2024-10-29 | AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions | Ziming Li et.al. | 2410.20424 | link |
2024-10-27 | Logarithmically Quantized Distributed Optimization over Dynamic Multi-Agent Networks | Mohammadreza Doostmohammadian et.al. | 2410.20345 | null |
2024-10-26 | Who is Responsible? Explaining Safety Violations in Multi-Agent Cyber-Physical Systems | Luyao Niu et.al. | 2410.20288 | null |
2024-10-26 | SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative Refinement | Antonis Antoniades et.al. | 2410.20285 | link |
2024-10-26 | A Digital Twin-based Intelligent Network Architecture for Underwater Acoustic Sensor Networks | Shanshan Song et.al. | 2410.20151 | null |
2024-10-25 | Evolving Neural Networks Reveal Emergent Collective Behavior from Minimal Agent Interactions | Guilherme S. Y. Giardini et.al. | 2410.19718 | null |
2024-10-25 | The Sound of Silence in Social Networks | Jesús Aranda et.al. | 2410.19685 | null |
2024-10-25 | MetaTrading: An Immersion-Aware Model Trading Framework for Vehicular Metaverse Services | Hongjia Wu et.al. | 2410.19665 | null |
2024-10-25 | Planning-Aware Diffusion Networks for Enhanced Motion Forecasting in Autonomous Driving | Liu Yunhao et.al. | 2410.19639 | null |
2024-10-25 | PMM-Net: Single-stage Multi-agent Trajectory Prediction with Patching-based Embedding and Explicit Modal Modulation | Huajian Liu et.al. | 2410.19544 | link |
2024-10-25 | Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration | Hai Zhong et.al. | 2410.19450 | null |
2024-10-28 | Multi-Agent Reinforcement Learning with Selective State-Space Models | Jemma Daniel et.al. | 2410.19382 | null |
2024-10-25 | Toward Finding Strong Pareto Optimal Policies in Multi-Agent Reinforcement Learning | Bang Giang Le et.al. | 2410.19372 | link |
2024-10-25 | Joint User Scheduling and Precoding for RIS-Aided MU-MISO Systems: A MADRL Approach | Yangjing Wang et.al. | 2410.19359 | null |
2024-10-25 | VisionCoder: Empowering Multi-Agent Auto-Programming for Image Processing with Hybrid LLMs | Zixiao Zhao et.al. | 2410.19245 | null |
2024-10-24 | Schema-Guided Culture-Aware Complex Event Simulation with Multi-Agent Role-Play | Sha Li et.al. | 2410.18935 | null |
2024-10-24 | Multi-agent cooperation through learning-aware policy gradients | Alexander Meulemans et.al. | 2410.18636 | null |
2024-10-24 | Leveraging Graph Neural Networks and Multi-Agent Reinforcement Learning for Inventory Control in Supply Chains | Niki Kotecha et.al. | 2410.18631 | null |
2024-10-24 | Evolutionary Dispersal of Ecological Species via Multi-Agent Deep Reinforcement Learning | Wonhyung Choi et.al. | 2410.18621 | null |
2024-10-24 | LLM as a code generator in Agile Model Driven Development | Ahmed R. Sadik et.al. | 2410.18489 | null |
2024-10-24 | Observer-Based Event-Triggered Secure Consensus Control for Multi-Agent Systems | Jingyao Wang et.al. | 2410.18440 | null |
2024-10-23 | PyTSC: A Unified Platform for Multi-Agent Reinforcement Learning in Traffic Signal Control | Rohit Bokade et.al. | 2410.18202 | link |
2024-10-23 | GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration | Xin Li et.al. | 2410.18032 | link |
2024-10-23 | On Regularity and Normalization in Sequential Screening | Ian Ball et.al. | 2410.17962 | null |
2024-10-23 | Guide for Defense (G4D): Dynamic Guidance for Robust and Balanced Defense in Large Language Models | He Cao et.al. | 2410.17922 | link |
2024-10-23 | Scalable Offline Reinforcement Learning for Mean Field Games | Axel Brunnbauer et.al. | 2410.17898 | link |
2024-10-23 | TranSPORTmer: A Holistic Approach to Trajectory Understanding in Multi-Agent Sports | Guillem Capellera et.al. | 2410.17785 | null |
2024-10-23 | Navigate Complex Physical Worlds via Geometrically Constrained LLM | Yongqiang Huang et.al. | 2410.17529 | null |
2024-10-22 | Evolution with Opponent-Learning Awareness | Yann Bouteiller et.al. | 2410.17466 | link |
2024-10-22 | Decoding Time Series with LLMs: A Multi-Agent Framework for Cross-Domain Annotation | Minhua Lin et.al. | 2410.17462 | null |
2024-10-22 | Cooperative Multi-Agent Constrained Stochastic Linear Bandits | Amirhossein Afsharrad et.al. | 2410.17382 | null |
2024-10-22 | Episodic Future Thinking Mechanism for Multi-agent Reinforcement Learning | Dongsu Lee et.al. | 2410.17373 | null |
2024-10-22 | Responsibility in a Multi-Value Strategic Setting | Timothy Parker et.al. | 2410.17229 | null |
2024-10-22 | Scalable spectral representations for network multiagent control | Zhaolin Ren et.al. | 2410.17221 | null |
2024-10-22 | Layered LA-MAPF: a decomposition of large agent MAPF instance to accelerate solving without compromising solvability | Zhuo Yao et.al. | 2410.17160 | link |
2024-10-22 | Delay-Constrained Grant-Free Random Access in MIMO Systems: Distributed Pilot Allocation and Power Control | Jianan Bai et.al. | 2410.17068 | link |
2024-10-22 | Self-Evolving Multi-Agent Collaboration Networks for Software Development | Yue Hu et.al. | 2410.16946 | null |
2024-10-22 | SERN: Simulation-Enhanced Realistic Navigation for Multi-Agent Robotic Systems in Contested Environments | Jumman Hossain et.al. | 2410.16686 | null |
2024-10-22 | Convex Markov Games: A Framework for Fairness, Imitation, and Creativity in Multi-Agent Learning | Ian Gemp et.al. | 2410.16600 | null |
2024-10-21 | The Social Cost of Growth: Evaluating GMV-Centric and Welfare-Centric Strategies in Online Food Delivery Platforms | Yukun Zhang et.al. | 2410.16566 | link |
2024-10-21 | Distributed Online Life-Long Learning (DOL3) for Multi-agent Trust and Reputation Assessment in E-commerce | Hariprasauth Ramamoorthy et.al. | 2410.16529 | null |
2024-10-21 | RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space | Jingdi Chen et.al. | 2410.16517 | null |
2024-10-21 | IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems | Yihuan Mao et.al. | 2410.16237 | null |
2024-10-21 | A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns | Tianyi Men et.al. | 2410.16155 | null |
2024-10-21 | A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models | Yue Deng et.al. | 2410.16024 | link |
2024-10-21 | Analyzing Closed-loop Training Techniques for Realistic Traffic Agent Models in Autonomous Highway Driving Simulations | Matthias Bitzer et.al. | 2410.15987 | null |
2024-10-21 | FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL | Woosung Koh et.al. | 2410.15876 | null |
2024-10-21 | Towards Efficient Collaboration via Graph Modeling in Reinforcement Learning | Wenzhe Fan et.al. | 2410.15841 | null |
2024-10-22 | Three connected problems: principal with multiple agents in cooperation, Principal–Agent with Mckean–Vlasov dynamics and multitask Principal–Agent | Mao Fabrice Djete et.al. | 2410.15818 | null |
2024-10-21 | Hierarchical Search-Based Cooperative Motion Planning | Yuchen Wu et.al. | 2410.15710 | link |
2024-10-21 | Students Rather Than Experts: A New AI For Education Pipeline To Model More Human-Like And Personalised Early Adolescences | Yiping Ma et.al. | 2410.15701 | null |
2024-10-21 | NetSafe: Exploring the Topological Safety of Multi-agent Networks | Miao Yu et.al. | 2410.15686 | null |
2024-10-18 | Teaching Models to Balance Resisting and Accepting Persuasion | Elias Stengel-Eskin et.al. | 2410.14596 | link |
2024-10-18 | Toolshed: Scale Tool-Equipped Agents with Advanced RAG-Tool Fusion and Tool Knowledge Bases | Elias Lumer et.al. | 2410.14594 | null |
2024-10-18 | MARLIN: Multi-Agent Reinforcement Learning Guided by Language-Based Inter-Robot Negotiation | Toby Godfrey et.al. | 2410.14383 | null |
2024-10-18 | A Model Checker for Natural Strategic Ability | Marco Aruta et.al. | 2410.14374 | null |
2024-10-18 | CoMAL: Collaborative Multi-Agent Large Language Models for Mixed-Autonomy Traffic | Huaiyuan Yao et.al. | 2410.14368 | link |
2024-10-18 | Good Parenting is all you need – Multi-agentic LLM Hallucination Mitigation | Edward et.al. | 2410.14262 | null |
2024-10-18 | Synthesizing Post-Training Data for LLMs through Multi-Agent Simulation | Shuo Tang et.al. | 2410.14251 | link |
2024-10-18 | Agents4PLC: Automating Closed-loop PLC Code Generation and Verification in Industrial Control Systems using LLM-based Agents | Zihan Liu et.al. | 2410.14209 | link |
2024-10-17 | On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow | Tonghan Wang et.al. | 2410.13953 | null |
2024-10-17 | AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents | Ke Yang et.al. | 2410.13825 | null |
2024-10-17 | Rapid and Automated Alloy Design with Graph Neural Network-Powered LLM-Driven Multi-Agent Systems | Alireza Ghafarollahi et.al. | 2410.13768 | null |
2024-10-17 | Real Eventual Exponential Positivity of Complex-valued Laplacians: Applications to Consensus in Multi-agent Systems | Aditi Saxena et.al. | 2410.13700 | null |
2024-10-17 | Byzantine-Resilient Output Optimization of Multiagent via Self-Triggered Hybrid Detection Approach | Chenhang Yan et.al. | 2410.13454 | null |
2024-10-16 | Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning | Mingyang Chen et.al. | 2410.12952 | link |
2024-10-16 | JudgeBench: A Benchmark for Evaluating LLM-based Judges | Sijun Tan et.al. | 2410.12784 | link |
2024-10-16 | HEnRY: A Multi-Agent System Framework for Multi-Domain Contexts | Emmanuele Lacavalla et.al. | 2410.12720 | link |
2024-10-16 | Hybrid Decision Making for Scalable Multi-Agent Navigation: Integrating Semantic Maps, Discrete Coordination, and Model Predictive Control | Koen de Vos et.al. | 2410.12651 | null |
2024-10-16 | Zeroth-Order Feedback Optimization in Multi-Agent Systems: Tackling Coupled Constraints | Yingpeng Duan et.al. | 2410.12647 | null |
2024-10-16 | A Communication Consistent Approach to Signal Temporal Logic Task Decomposition in Multi-Agent Systems | Gregorio Marchesini et.al. | 2410.12563 | null |
2024-10-16 | Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making | Stelios Triantafyllou et.al. | 2410.12539 | link |
2024-10-17 | MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration | Jinjie Wei et.al. | 2410.12532 | null |
2024-10-17 | Aegis:An Advanced LLM-Based Multi-Agent for Intelligent Functional Safety Engineering | Lu Shi et.al. | 2410.12475 | null |
2024-10-17 | Enhancing LLM Trading Performance with Fact-Subjectivity Aware Reasoning | Qian Wang et.al. | 2410.12464 | link |
2024-10-16 | Corridor Generating Algorithm for Multi-Agent Pathfinding | Arseniy Pertzovsky et.al. | 2410.12397 | null |
2024-10-15 | G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks | Guibin Zhang et.al. | 2410.11782 | null |
2024-10-15 | Markov-Nash equilibria in mean-field games under model uncertainty | Johannes Langner et.al. | 2410.11652 | null |
2024-10-15 | AGENTiGraph: An Interactive Knowledge Graph Platform for LLM-based Chatbots Utilizing Private Data | Xinjie Zhao et.al. | 2410.11531 | null |
2024-10-15 | Agent-Based Modelling of Older Adult Needs for Autonomous Mobility-on-Demand: A Case Study in Winnipeg, Canada | Manon Prédhumeau et.al. | 2410.11416 | null |
2024-10-15 | Strategic and Fair Aggregator Interactions in Energy Markets: Mutli-agent Dynamics and Quasiconcave Games | Jiayi Li et.al. | 2410.11296 | null |
2024-10-15 | Multi-Objective-Optimization Multi-AUV Assisted Data Collection Framework for IoUT Based on Offline Reinforcement Learning | Yimian Ding et.al. | 2410.11282 | null |
2024-10-15 | Biologically Inspired Swarm Dynamic Target Tracking and Obstacle Avoidance | Lucas Page et.al. | 2410.11237 | null |
2024-10-14 | DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model | Yuqi Wang et.al. | 2410.10738 | null |
2024-10-14 | Consensus in Multiagent Systems with lack of connection | Mohamed Bentaibi et.al. | 2410.10486 | null |
2024-10-14 | Compositional Shielding and Reinforcement Learning for Multi-Agent Systems | Asger Horn Brorholt et.al. | 2410.10460 | null |
2024-10-14 | Content Caching-Assisted Vehicular Edge Computing Using Multi-Agent Graph Attention Reinforcement Learning | Jinjin Shen et.al. | 2410.10071 | null |
2024-10-13 | GRRIS: a real-time intra-site observation scheduling scheme for distributed survey telescope arrays | Yajie Zhang et.al. | 2410.09881 | null |
2024-10-13 | Transformers as Game Players: Provable In-context Game-playing Capabilities of Pre-trained Models | Chengshuai Shi et.al. | 2410.09701 | null |
2024-10-12 | CAMPHOR: Collaborative Agents for Multi-input Planning and High-Order Reasoning On Device | Yicheng Fu et.al. | 2410.09407 | null |
2024-10-12 | Two Heads Are Better Than One: A Multi-Agent System Has the Potential to Improve Scientific Idea Generation | Haoyang Su et.al. | 2410.09403 | link |
2024-10-12 | LLM-SmartAudit: Advanced Smart Contract Vulnerability Detection | Zhiyuan Wei et.al. | 2410.09381 | link |
2024-10-11 | PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents | Xiangyu Yin et.al. | 2410.09034 | link |
2024-10-11 | The Dynamics of Social Conventions in LLM populations: Spontaneous Emergence, Collective Biases and Tipping Points | Ariel Flint Ashery et.al. | 2410.08948 | null |
2024-10-11 | Deep Learning Algorithms for Mean Field Optimal Stopping in Finite Space and Discrete Time | Lorenzo Magnino et.al. | 2410.08850 | null |
2024-10-11 | Efficiently Scanning and Resampling Spatio-Temporal Tasks with Irregular Observations | Bryce Ferenczi et.al. | 2410.08681 | null |
2024-10-11 | Derivation of macroscopic epidemic models from multi-agent systems | Mattia Zanella et.al. | 2410.08610 | null |
2024-10-11 | Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learning | Xinran Li et.al. | 2410.08540 | link |
2024-10-11 | Distributed Adaptive Consensus with Obstacle and Collision Avoidance for Networks of Heterogeneous Multi-Agent Systems | Armel Koulong et.al. | 2410.08440 | null |
2024-10-10 | Large Legislative Models: Towards Efficient AI Policymaking in Economic Simulations | Henry Gasztowtt et.al. | 2410.08345 | link |
2024-10-10 | Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System | Weize Chen et.al. | 2410.08115 | null |
2024-10-10 | Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining | Tianyi Bai et.al. | 2410.08102 | link |
2024-10-10 | Variational Inequality Methods for Multi-Agent Reinforcement Learning: Performance and Stability Gains | Baraah A. M. Sidahmed et.al. | 2410.07976 | null |
2024-10-10 | Dynamic Programming based Local Search approaches for Multi-Agent Path Finding problems on Directed Graphs | Irene Saccani et.al. | 2410.07954 | null |
2024-10-10 | Learning to Balance Altruism and Self-interest Based on Empathy in Mixed-Motive Games | Fanqi Kong et.al. | 2410.07863 | null |
2024-10-10 | MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization | Yougang Lyu et.al. | 2410.07672 | null |
2024-10-10 | AI-Press: A Multi-Agent News Generating and Feedback Simulation System Powered by Large Language Models | Xiawei Liu et.al. | 2410.07561 | null |
2024-10-10 | COMMA: A Communicative Multimodal Multi-Agent Benchmark | Timothy Ossowski et.al. | 2410.07553 | null |
2024-10-09 | CAFEEN: A Cooperative Approach for Energy Efficient NoCs with Multi-Agent Reinforcement Learning | Kamil Khan et.al. | 2410.07426 | null |
2024-10-09 | A Rapid Trajectory Optimization and Control Framework for Resource-Constrained Applications | Deep Parikh et.al. | 2410.07413 | null |
2024-10-09 | I Want to Break Free! Anti-Social Behavior and Persuasion Ability of LLMs in Multi-Agent Settings with Social Hierarchy | Gian Maria Campedelli et.al. | 2410.07109 | link |
2024-10-09 | MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses | Zonglin Yang et.al. | 2410.07076 | link |
2024-10-09 | Seeker: Enhancing Exception Handling in Code with LLM-based Multi-Agent Approach | Xuanming Zhang et.al. | 2410.06949 | link |
2024-10-09 | Variations in Multi-Agent Actor-Critic Frameworks for Joint Optimizations in UAV Swarm Networks: Recent Evolution, Challenges, and Directions | Muhammad Morshed Alam et.al. | 2410.06627 | null |
2024-10-09 | Cooperative Multi-Target Positioning for Cell-Free Massive MIMO with Multi-Agent Reinforcement Learning | Ziheng Liu et.al. | 2410.06506 | null |
2024-10-08 | Cooperative and Asynchronous Transformer-based Mission Planning for Heterogeneous Teams of Mobile Robots | Milad Farjadnasab et.al. | 2410.06372 | link |
2024-10-10 | An Algorithm for Distributed Computation of Reachable Sets for Multi-Agent Systems | Omanshu Thapliyal et.al. | 2410.06321 | null |
2024-10-08 | Multimodal Situational Safety | Kaiwen Zhou et.al. | 2410.06172 | null |
2024-10-08 | Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning | Hao Ma et.al. | 2410.06101 | link |
2024-10-08 | Learning Equilibria in Adversarial Team Markov Games: A Nonconvex-Hidden-Concave Min-Max Optimization Problem | Fivos Kalogiannis et.al. | 2410.05673 | null |
2024-10-07 | GLEE: A Unified Framework and Benchmark for Language-based Economic Environments | Eilam Shapira et.al. | 2410.05254 | link |
2024-10-07 | Scalable and Accurate Graph Reasoning with LLM-based Multi-Agents | Yuwei Hu et.al. | 2410.05130 | null |
2024-10-07 | Cloud-Based Scheduling Mechanism for Scalable and Resource-Efficient Centralized Controllers | Achilleas Santi Seisa et.al. | 2410.04920 | null |
2024-10-07 | Distributed Collaborative User Positioning for Cell-Free Massive MIMO with Multi-Agent Reinforcement Learning | Ziheng Liu et.al. | 2410.04871 | null |
2024-10-07 | Adversarial Multi-Agent Evaluation of Large Language Models through Iterative Debates | Chaithanya Bandi et.al. | 2410.04663 | null |
2024-10-06 | The Role of Social Support and Influencers in Social Media Communities | Junwei Su et.al. | 2410.04619 | null |
2024-10-06 | Social Choice for Heterogeneous Fairness in Recommendation | Amanda Aird et.al. | 2410.04551 | null |
2024-10-06 | MindScope: Exploring cognitive biases in large language models through Multi-Agent Systems | Zhentao Xie et.al. | 2410.04452 | link |
2024-10-05 | Trajectory Design and Resource Allocation for Multi-UAV-Assisted Sensing, Communication, and Edge Computing Integration | Sicong Peng et.al. | 2410.04151 | null |
2024-10-05 | Compositional Planning for Logically Constrained Multi-Agent Markov Decision Processes | Krishna C. Kalagarla et.al. | 2410.04004 | null |
2024-10-04 | Steering Large Language Models between Code Execution and Textual Reasoning | Yongchao Chen et.al. | 2410.03524 | null |
2024-10-03 | AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML | Patara Trirat et.al. | 2410.02958 | link |
2024-10-03 | Grounded Answers for Multi-agent Decision-making Problem through Generative World Model | Zeyang Liu et.al. | 2410.02664 | null |
2024-10-03 | Towards Implicit Bias Detection and Mitigation in Multi-Agent LLM Interactions | Angana Borah et.al. | 2410.02584 | link |
2024-10-03 | Boosting Sample Efficiency and Generalization in Multi-agent Reinforcement Learning via Equivariance | Joshua McClellan et.al. | 2410.02581 | null |
2024-10-03 | ColaCare: Enhancing Electronic Health Record Modeling through Large Language Model-Driven Multi-Agent Collaboration | Zixiang Wang et.al. | 2410.02551 | null |
2024-10-03 | Learning Emergence of Interaction Patterns across Independent RL Agents in Multi-Agent Environments | Vasanth Reddy Baddam et.al. | 2410.02516 | null |
2024-10-03 | Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration | Yun Qu et.al. | 2410.02511 | link |
2024-10-03 | Can Large Language Models Grasp Legal Theories? Enhance Legal Reasoning with Insights from Multi-Agent Collaboration | Weikang Yuan et.al. | 2410.02507 | link |
2024-10-03 | Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems | Guibin Zhang et.al. | 2410.02506 | null |
2024-10-03 | An Online Feasible Point Method for Benign Generalized Nash Equilibrium Problems | Sarah Sachs et.al. | 2410.02400 | null |
2024-10-03 | Agent-Oriented Planning in Multi-Agent Systems | Ao Li et.al. | 2410.02189 | link |
2024-10-02 | Windowed MAPF with Completeness Guarantees | Rishi Veerapaneni et.al. | 2410.01798 | null |
2024-10-02 | Social coordination perpetuates stereotypic expectations and behaviors across generations in deep multi-agent reinforcement learning | Rebekah A. Gelpí et.al. | 2410.01763 | null |
2024-10-02 | Performant, Memory Efficient and Scalable Multi-Agent Reinforcement Learning | Omayma Mahjoub et.al. | 2410.01706 | null |
2024-10-02 | Agent-Driven Large Language Models for Mandarin Lyric Generation | Hong-Hsiang Liu et.al. | 2410.01450 | null |
2024-10-02 | MARLens: Understanding Multi-agent Reinforcement Learning for Traffic Signal Control via Visual Analytics | Yutian Zhang et.al. | 2410.01364 | null |
2024-10-02 | FanCric : Multi-Agentic Framework for Crafting Fantasy 11 Cricket Teams | Mohit Bhatnagar et.al. | 2410.01307 | null |
2024-10-01 | Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank | Wenhao Zhan et.al. | 2410.01101 | null |
2024-10-01 | From Facts to Insights: A Study on the Generation and Evaluation of Analytical Reports for Deciphering Earnings Calls | Tomas Goldsack et.al. | 2410.01039 | null |
2024-10-01 | Fast-and-flexible decision-making with modulatory interactions | Rodrigo Moreno-Morton et.al. | 2410.00798 | null |
2024-10-01 | Human-Robot Collaborative Minimum Time Search through Sub-priors in Ant Colony Optimization | Oscar Gil Viyuela et.al. | 2410.00517 | null |
2024-09-30 | LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner | Xiaopan Zhang et.al. | 2409.20560 | null |
2024-09-30 | MARLadona – Towards Cooperative Team Play Using Multi-Agent Reinforcement Learning | Zichong Li et.al. | 2409.20326 | null |
2024-09-30 | Distributed NeRF Learning for Collaborative Multi-Robot Perception | Hongrui Zhao et.al. | 2409.20289 | null |
2024-09-30 | Can We Break the Curse of Multiagency in Robust Multi-Agent Reinforcement Learning? | Laixi Shi et.al. | 2409.20067 | null |
2024-09-30 | Fuel tax loss in a world of electric mobility: A window of opportunity for congestion pricing | Thi Ngoc Nguyen et.al. | 2409.20033 | null |
2024-09-30 | Variational Auto-encoder Based Solutions to Interactive Dynamic Influence Diagrams | Yinghui Pan et.al. | 2409.19965 | null |
2024-10-01 | TRANSAGENT: An LLM-Based Multi-Agent System for Code Translation | Zhiqiang Yuan et.al. | 2409.19894 | null |
2024-09-30 | Enabling Multi-Robot Collaboration from Single-Human Guidance | Zhengran Ji et.al. | 2409.19831 | null |
2024-10-02 | T2Vs Meet VLMs: A Scalable Multimodal Dataset for Visual Harmfulness Recognition | Chen Yeh et.al. | 2409.19734 | link |
2024-09-29 | An action language-based formalisation of an abstract argumentation framework | Yann Munro et.al. | 2409.19625 | null |
2024-09-27 | Mean-Field Control Barrier Functions: A Framework for Real-Time Swarm Control | Samy Wu Fung et.al. | 2409.18945 | null |
2024-09-27 | Safe Decentralized Multi-Agent Control using Black-Box Predictors, Conformal Decision Policies, and Control Barrier Functions | Sacha Huriot et.al. | 2409.18862 | null |
2024-09-27 | Enhancing Spectrum Efficiency in 6G Satellite Networks: A GAIL-Powered Policy Learning via Asynchronous Federated Inverse Reinforcement Learning | Sheikh Salman Hassan et.al. | 2409.18718 | null |
2024-09-27 | DP-SCC-PL:Differentially Private Decentralized Byzantine-Resilient Stochastic Optimization via Self-Centered Clipping Under Polyak-Łojasiewicz Condition | Jinhui Hu et.al. | 2409.18632 | null |
2024-09-27 | Multi-agent Reinforcement Learning for Dynamic Dispatching in Material Handling Systems | Xian Yeow Lee et.al. | 2409.18435 | null |
2024-09-26 | Inverse Reinforcement Learning with Multiple Planning Horizons | Jiayu Yao et.al. | 2409.18051 | null |
2024-09-26 | Reasoning Multi-Agent Behavioral Topology for Interactive Autonomous Driving | Haochen Liu et.al. | 2409.18031 | link |
2024-09-26 | Compositional Hardness of Code in Large Language Models – A Probabilistic Perspective | Yotam Wolf et.al. | 2409.18028 | null |
2024-09-26 | Filtering-Linearization: A First-Order Method for Nonconvex Trajectory Optimization with Filter-Based Warm-Starting | Minsen Yuan et.al. | 2409.17944 | null |
2024-09-26 | AssistantX: An LLM-Powered Proactive Assistant in Collaborative Human-Populated Environment | Nan Sun et.al. | 2409.17655 | null |
2024-09-26 | Cat-and-Mouse Satellite Dynamics: Divergent Adversarial Reinforcement Learning for Contested Multi-Agent Space Operations | Cameron Mehlman et.al. | 2409.17443 | null |
2024-09-25 | Language Grounded Multi-agent Communication for Ad-hoc Teamwork | Huao Li et.al. | 2409.17348 | null |
2024-09-25 | Communication Backbone Reconfiguration with Connectivity Maintenance | Leonardo Santos et.al. | 2409.16851 | null |
2024-09-25 | Asynchronous Fractional Multi-Agent Deep Reinforcement Learning for Age-Minimal Mobile Edge Computing | Lyudong Jin et.al. | 2409.16832 | null |
2024-09-25 | Dashing for the Golden Snitch: Multi-Drone Time-Optimal Motion Planning with Multi-Agent Reinforcement Learning | Xian Wang et.al. | 2409.16720 | link |
2024-09-24 | MBC: Multi-Brain Collaborative Control for Quadruped Robots | Hang Liu et.al. | 2409.16460 | null |
2024-09-24 | A Multi-Agent Multi-Environment Mixed Q-Learning for Partially Decentralized Wireless Network Optimization | Talha Bozkus et.al. | 2409.16450 | link |
2024-09-25 | Multi-UAV Pursuit-Evasion with Online Planning in Unknown Environments by Deep Reinforcement Learning | Jiayu Chen et.al. | 2409.15866 | null |
2024-09-24 | A Robust, Task-Agnostic and Fully-Scalable Voxel Mapping System for Large Scale Environments | Jinche La et.al. | 2409.15779 | null |
2024-09-24 | Linear Contextual Bandits with Interference | Yang Xu et.al. | 2409.15682 | null |
2024-09-23 | Revolutionizing Biomarker Discovery: Leveraging Generative AI for Bio-Knowledge-Embedded Continuous Space Exploration | Wangyang Ying et.al. | 2409.15612 | null |
2024-09-23 | SymAware: A Software Development Framework for Trustworthy Multi-Agent Systems with Situational Awareness | Ernesto Casablanca et.al. | 2409.14833 | null |
2024-09-18 | Residual Descent Differential Dynamic Game (RD3G) – A Fast Newton Solver for Constrained General Sum Games | Zhiyuan Zhang et.al. | 2409.12152 | null |
2024-09-18 | MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning | Justin Chih-Yao Chen et.al. | 2409.12147 | link |
2024-09-18 | Putting Data at the Centre of Offline Multi-Agent Reinforcement Learning | Claude Formanek et.al. | 2409.12001 | null |
2024-09-18 | XP-MARL: Auxiliary Prioritization in Multi-Agent Reinforcement Learning to Address Non-Stationarity | Jianye Xu et.al. | 2409.11852 | link |
2024-09-18 | HARP: Human-Assisted Regrouping with Permutation Invariant Critic for Multi-Agent Reinforcement Learning | Huawen Hu et.al. | 2409.11741 | null |
2024-09-17 | Hyper-SAMARL: Hypergraph-based Coordinated Task Allocation and Socially-aware Navigation for Multi-Robot Systems | Weizheng Wang et.al. | 2409.11561 | null |
2024-09-17 | Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent | Fatemeh Haji et.al. | 2409.11527 | link |
2024-09-19 | The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives | Samee Arif et.al. | 2409.11261 | link |
2024-09-17 | Reactive Environments for Active Inference Agents with RxEnvironments.jl | Wouter W. L. Nuijten et.al. | 2409.11087 | link |
2024-09-17 | Distributed Optimization for Traffic Light Control and Connected Automated Vehicle Coordination in Mixed-Traffic Intersections | Viet-Anh Le et.al. | 2409.10864 | null |
2024-09-16 | AutoSafeCoder: A Multi-Agent Framework for Securing LLM Code Generation through Static Analysis and Fuzz Testing | Ana Nunez et.al. | 2409.10737 | link |
2024-09-16 | Multi-agent Path Finding in Continuous Environment | Kristýna Janovská et.al. | 2409.10680 | null |
2024-09-16 | Decentralized and Asymmetric Multi-Agent Learning in Construction Sites | Yakov Miron et.al. | 2409.10375 | null |
2024-09-16 | Multi-Agent Obstacle Avoidance using Velocity Obstacles and Control Barrier Functions | Alejandro Sánchez Roncero et.al. | 2409.10117 | null |
2024-09-16 | A Social Force Model for Multi-Agent Systems With Application to Robots Traversal in Cluttered Environments | Chenxi Li et.al. | 2409.10049 | null |
2024-09-16 | Bearing-Distance Based Flocking with Zone-Based Interactions | Hossein B. Jond et.al. | 2409.10047 | null |
2024-09-15 | Decentralized Safe and Scalable Multi-Agent Control under Limited Actuation | Vrushabh Zinage et.al. | 2409.09573 | null |
2024-09-14 | Learning Nudges for Conditional Cooperation: A Multi-Agent Reinforcement Learning Model | Shatayu Kulkarni et.al. | 2409.09509 | null |
2024-09-14 | Learning Keypoints for Multi-Agent Behavior Analysis using Self-Supervision | Daniel Khalil et.al. | 2409.09455 | null |
2024-09-14 | Capability Augmentation for Heterogeneous Dynamic Teaming with Temporal Logic Tasks | Carter Berlind et.al. | 2409.09285 | null |
2024-09-13 | Measure Preserving Flows for Ergodic Search in Convoluted Environments | Albert Xu et.al. | 2409.09164 | null |
2024-09-13 | HOLA-Drone: Hypergraphic Open-ended Learning for Zero-Shot Multi-Drone Cooperative Pursuit | Yang Li et.al. | 2409.08767 | null |
2024-09-13 | Average Consensus over Directed Networks in Open Multi-Agent Systems with Acknowledgement Feedback | Evagoras Makridis et.al. | 2409.08634 | null |
2024-09-12 | Knowledge Tagging with Large Language Model based Multi-Agent System | Hang Li et.al. | 2409.08406 | null |
2024-09-12 | Covariance Intersection-based Invariant Kalman Filtering(DInCIKF) for Distributed Pose Estimation | Haoying Li et.al. | 2409.07933 | null |
2024-09-12 | Reinforcement Learning Discovers Efficient Decentralized Graph Path Search Strategies | Alexei Pisacane et.al. | 2409.07932 | link |
2024-09-12 | Tera-SpaceCom: GNN-based Deep Reinforcement Learning for Joint Resource Allocation and Task Offloading in TeraHertz Band Space Networks | Zhifeng Hu et.al. | 2409.07911 | null |
2024-09-12 | Mapping Technical Safety Research at AI Companies: A literature review and incentives analysis | Oscar Delaney et.al. | 2409.07878 | null |
2024-09-12 | A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning | Yinbo Yu et.al. | 2409.07775 | null |
2024-09-12 | Accelerated Multi-Time-Scale Stochastic Approximation: Optimal Complexity and Applications in Reinforcement Learning and Multi-Agent Games | Sihan Zeng et.al. | 2409.07767 | null |
2024-09-12 | CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model | Yang Li et.al. | 2409.07714 | null |
2024-09-11 | “My Grade is Wrong!”: A Contestable AI Framework for Interactive Feedback in Evaluating Student Essays | Shengxin Hong et.al. | 2409.07453 | null |
2024-09-11 | Explanation, Debate, Align: A Weak-to-Strong Framework for Language Model Generalization | Mehrdad Zakershahrak et.al. | 2409.07335 | null |
2024-09-11 | Propaganda to Hate: A Multimodal Analysis of Arabic Memes with Multi-Agent LLMs | Firoj Alam et.al. | 2409.07246 | link |
2024-09-11 | DCMAC: Demand-aware Customized Multi-Agent Communication via Upper Bound Training | Dongkun Huo et.al. | 2409.07127 | null |
2024-09-10 | A Quality Diversity Approach to Automatically Generate Multi-Agent Path Finding Benchmark Maps | Cheng Qian et.al. | 2409.06888 | null |
2024-09-10 | Can Agents Spontaneously Form a Society? Introducing a Novel Architecture for Generative Multi-Agents to Elicit Social Emergence | H. Zhang et.al. | 2409.06750 | null |
2024-09-10 | Fixed-budget and Multiple-issue Quadratic Voting | Laura Georgescu et.al. | 2409.06614 | null |
2024-09-10 | Think-on-Process: Dynamic Process Generation for Collaborative Development of Multi-Agent System | Leilei Lin et.al. | 2409.06568 | link |
2024-09-10 | Coordinated Motion Planning: Multi-Agent Path Finding in a Densely Packed, Bounded Domain | Sándor P. Fekete et.al. | 2409.06486 | null |
2024-09-10 | Exploring the Integration of Large Language Models in Industrial Test Maintenance Processes | Ludvig Lemner et.al. | 2409.06416 | null |
2024-09-10 | MAGDA: Multi-agent guideline-driven diagnostic assistance | David Bani-Harouni et.al. | 2409.06351 | null |
2024-09-10 | Foragax: An Agent Based Modelling framework based on JAX | Siddharth Chaturvedi et.al. | 2409.06345 | link |
2024-09-10 | Towards Agentic AI on Particle Accelerators | Antonin Sulc et.al. | 2409.06336 | null |
2024-09-10 | Automate Strategy Finding with LLM in Quant investment | Zhizhuo Kou et.al. | 2409.06289 | null |
2024-09-09 | When Learning Meets Dynamics: Distributed User Connectivity Maximization in UAV-Based Communication Networks | Bowei Li et.al. | 2409.06010 | null |
2024-09-09 | Cooperative Decision-Making for CAVs at Unsignalized Intersections: A MARL Approach with Attention and Hierarchical Game Priors | Jiaqi Liu et.al. | 2409.05712 | null |
2024-09-09 | SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning | Alireza Ghafarollahi et.al. | 2409.05556 | link |
2024-09-09 | Adaptive Multi-Layer Deployment for A Digital Twin Empowered Satellite-Terrestrial Integrated Network | Yihong Tao et.al. | 2409.05480 | null |
2024-09-09 | Distributed Robust Continuous-Time Optimization Algorithms for Time-Varying Constrained Cost | Zeinab Ebrahimi et.al. | 2409.05293 | null |
2024-09-08 | Difference Between Cyclic and Distributed Approach in Stochastic Optimization for Multi-agent System | Jiahao Shi et.al. | 2409.05155 | null |
2024-09-08 | Nonlinear Cooperative Output Regulation with Input Delay Compensation | Shiqi Zheng et.al. | 2409.05113 | null |
2024-09-08 | Decentralized Control of Multi-Agent Systems Under Acyclic Spatio-Temporal Task Dependencies | Gregorio Marchesini et.al. | 2409.05106 | null |
2024-09-08 | Towards Multi-agent Policy-based Directed Hypergraph Learning for Traffic Signal Control | Kang Wang et.al. | 2409.05037 | null |
2024-09-08 | Cooperative Learning-Based Framework for VNF Caching and Placement Optimization over Low Earth Orbit Satellite Networks | Khai Doan et.al. | 2409.05025 | null |
2024-09-07 | Adaptation Procedure in Misinformation Games | Konstantinos Varsos et.al. | 2409.04854 | link |
2024-09-06 | Using Large Language Models to Generate Authentic Multi-agent Knowledge Work Datasets | Desiree Heim et.al. | 2409.04286 | null |
2024-09-06 | Advancing Multi-Organ Disease Care: A Hierarchical Multi-Agent Reinforcement Learning Framework | Daniel J. Tan et.al. | 2409.04224 | null |
2024-09-06 | Tighter Analysis for Decentralized Stochastic Gradient Method: Impact of Data Homogeneity | Qiang Li et.al. | 2409.04092 | null |
2024-09-05 | DRAL: Deep Reinforcement Adaptive Learning for Multi-UAVs Navigation in Unknown Indoor Environment | Kangtong Mo et.al. | 2409.03930 | null |
2024-09-05 | On the Convergence Rates of Federated Q-Learning across Heterogeneous Environments | Muxing Wang et.al. | 2409.03897 | null |
2024-09-05 | Multi-agent Path Finding for Mixed Autonomy Traffic Coordination | Han Zheng et.al. | 2409.03881 | null |
2024-09-05 | PARCO: Learning Parallel Autoregressive Policies for Efficient Multi-Agent Combinatorial Optimization | Federico Berto et.al. | 2409.03811 | link |
2024-09-06 | LLM-based multi-agent poetry generation in non-cooperative environments | Ran Zhang et.al. | 2409.03659 | link |
2024-09-05 | From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents | Jifan Yu et.al. | 2409.03512 | null |
2024-09-05 | Predefined-time distributed non-convex optimization via a time-base generator | Qinlong Lin et.al. | 2409.03188 | null |
2024-09-04 | An Introduction to Centralized Training for Decentralized Execution in Cooperative Multi-Agent Reinforcement Learning | Christopher Amato et.al. | 2409.03052 | null |
2024-09-04 | Creating a Gen-AI based Track and Trace Assistant MVP (SuperTracy) for PostNL | Mohammad Reshadati et.al. | 2409.02711 | null |
2024-09-04 | Generalized Individual Q-learning for Polymatrix Games with Partial Observations | Ahmed Said Donmez et.al. | 2409.02663 | null |
2024-09-04 | A Survey on Emergent Language | Jannik Peters et.al. | 2409.02645 | null |
2024-09-04 | Context-Aware Agent-based Model for Smart Long Distance Transport System | Muhammad Raees et.al. | 2409.02434 | null |
2024-09-03 | Multi-Agent Reinforcement Learning for Joint Police Patrol and Dispatch | Matthew Repasky et.al. | 2409.02246 | null |
2024-09-03 | What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices | Zhi Chen et.al. | 2409.01893 | link |
2024-09-03 | Bridging the Gap Between Central and Local Decision-Making: The Efficacy of Collaborative Equilibria in Altruistic Congestion Games | Bryce L Ferguson et.al. | 2409.01525 | null |
2024-09-02 | Performance-Aware Self-Configurable Multi-Agent Networks: A Distributed Submodular Approach for Simultaneous Coordination and Network Design | Zirui Xu et.al. | 2409.01411 | link |
2024-09-02 | Two-Timescale Synchronization and Migration for Digital Twin Networks: A Multi-Agent Deep Reinforcement Learning Approach | Wenshuai Liu et.al. | 2409.01092 | null |
2024-09-02 | Multiagent Reinforcement Learning Enhanced Decision-making of Crew Agents During Floor Construction Process | Bin Yang et.al. | 2409.01060 | null |
2024-08-30 | All You Need is Group Actions: Advancing Robust Autonomous Planning | Vincenzo Basco et.al. | 2408.17295 | null |
2024-08-29 | Learning Multi-agent Multi-machine Tending by Mobile Robots | Abdalwhab Abdalwhab et.al. | 2408.16875 | null |
2024-08-29 | 3D Topological Modeling and Multi-Agent Movement Simulation for Viral Infection Risk Analysis | Wassim Jabi et.al. | 2408.16417 | null |
2024-09-04 | Efficient Multi-agent Navigation with Lightweight DRL Policy | Xingrong Diao et.al. | 2408.16370 | null |
2024-08-29 | Guided Reasoning: A Non-Technical Introduction | Gregor Betz et.al. | 2408.16331 | link |
2024-08-29 | Action potential dynamics on heterogenous neural networks: from kinetic to macroscopic equations | Marzia Bisi et.al. | 2408.16214 | null |
2024-08-28 | WebPilot: A Versatile and Autonomous Multi-Agent System for Web Task Execution with Strategic Exploration | Yao Zhang et.al. | 2408.15978 | null |
2024-08-28 | BattleAgentBench: A Benchmark for Evaluating Cooperation and Competition Capabilities of Language Models in Multi-Agent Systems | Wei Wang et.al. | 2408.15971 | null |
2024-09-02 | Persuasion Games using Large Language Models | Ganesh Prasath Ramani et.al. | 2408.15879 | null |
2024-08-28 | TrafficGamer: Reliable and Flexible Traffic Simulation for Safety-Critical Scenarios with Game-Theoretic Oracles | Guanren Qiao et.al. | 2408.15538 | link |
2024-08-27 | Graph Attention Inference of Network Topology in Multi-Agent Systems | Akshay Kolli et.al. | 2408.15449 | null |
2024-08-27 | Fast and Modular Autonomy Software for Autonomous Racing Vehicles | Andrew Saba et.al. | 2408.15425 | null |
2024-08-27 | On Stateful Value Factorization in Multi-Agent Reinforcement Learning | Enrico Marchesini et.al. | 2408.15381 | null |
2024-08-27 | A Multi-Agent Reinforcement Learning Scheme for SFC Placement in Edge Computing Networks | Congzhou Li et.al. | 2408.15337 | null |
2024-08-27 | Exploiting Approximate Symmetry for Efficient Multi-Agent Reinforcement Learning | Batuhan Yardim et.al. | 2408.15173 | null |
2024-08-27 | Applications in CityLearn Gym Environment for Multi-Objective Control Benchmarking in Grid-Interactive Buildings and Districts | Kingsley Nweye et.al. | 2408.15170 | null |
2024-08-27 | AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems | Chi-Min Chan et.al. | 2408.14972 | link |
2024-08-27 | Decentralized Unlabeled Multi-agent Pathfinding Via Target And Priority Swapping (With Supplementary) | Stepan Dergachev et.al. | 2408.14948 | link |
2024-08-26 | Emergent Language in Open-Ended Environments | Cornelius Wolff et.al. | 2408.14649 | null |
2024-08-26 | On Centralized Critics in Multi-Agent Reinforcement Learning | Xueguang Lyu et.al. | 2408.14597 | link |
2024-08-26 | Multi-Agent Path Finding with Real Robot Dynamics and Interdependent Tasks for Automated Warehouses | Vassilissa Lehoux-Lebacque et.al. | 2408.14527 | null |
2024-08-25 | CoT Rerailer: Enhancing the Reliability of Large Language Models in Complex Reasoning Tasks through Error Detection and Correction | Guangya Wan et.al. | 2408.13940 | null |
2024-08-25 | Demo: Generative Open xG Network Simulation with Multi-Agent LLM and ns-3 (GenOnet) | Farhad Rezazadeh et.al. | 2408.13781 | null |
2024-08-25 | MASQ: Multi-Agent Reinforcement Learning for Single Quadruped Robot Locomotion | Qi Liu et.al. | 2408.13759 | null |
2024-08-25 | Multi-Agent Target Assignment and Path Finding for Intelligent Warehouse: A Cooperative Multi-Agent Deep Reinforcement Learning Perspective | Qi Liu et.al. | 2408.13750 | null |
2024-08-24 | Reaching New Heights in Multi-Agent Collective Construction | Martin Rameš et.al. | 2408.13615 | null |
2024-08-24 | Hybrid Training for Enhanced Multi-task Generalization in Multi-agent Reinforcement Learning | Mingliang Zhang et.al. | 2408.13567 | null |
2024-08-24 | Unleashing Collaborative Computing for Adaptive Video Streaming with Multi-objective Optimization in Satellite Terrestrial Networks | Zhishu Shen et.al. | 2408.13512 | null |
2024-08-23 | Optimizing Collaboration of LLM based Agents for Finite Element Analysis | Chuan Tian et.al. | 2408.13406 | null |
2024-08-23 | DrugAgent: Explainable Drug Repurposing Agent with Large Language Model-based Reasoning | Yoshitaka Inoue et.al. | 2408.13378 | null |
2024-08-23 | Optimally Solving Simultaneous-Move Dec-POMDPs: The Sequential Central Planning Approach | Johan Peralez et.al. | 2408.13139 | null |
2024-08-23 | Diffusion-based Episodes Augmentation for Offline Multi-Agent Reinforcement Learning | Jihwan Oh et.al. | 2408.13092 | null |
2024-08-22 | Can LLMs Understand Social Norms in Autonomous Driving Games? | Boxuan Wang et.al. | 2408.12680 | null |
2024-08-25 | MuMA-ToM: Multi-modal Multi-Agent Theory of Mind | Haojun Shi et.al. | 2408.12574 | link |
2024-08-22 | MEDCO: Medical Education Copilots Based on A Multi-Agent Framework | Hao Wei et.al. | 2408.12496 | null |
2024-08-22 | Multi Agent Framework for Collective Intelligence Research | Alexandru Dochian et.al. | 2408.12391 | link |
2024-08-22 | Recursive Distributed Collaborative Aided Inertial Navigation | Roland Jung et.al. | 2408.12360 | link |
2024-08-22 | MDD-5k: A New Diagnostic Conversation Dataset for Mental Disorders Synthesized via Neuro-Symbolic LLM Agents | Congchi Yin et.al. | 2408.12142 | link |
2024-08-22 | Distributed Noncoherent Joint Transmission Based on Multi-Agent Reinforcement Learning for Dense Small Cell MISO Systems | Shaozhuang Bai et.al. | 2408.12067 | null |
2024-08-21 | Empirical Equilibria in Agent-based Economic systems with Learning agents | Kshama Dwarakanath et.al. | 2408.12038 | null |
2024-08-21 | Leveraging Chemistry Foundation Models to Facilitate Structure Focused Retrieval Augmented Generation in Multi-Agent Workflows for Catalyst and Materials Design | Nathaniel H. Park et.al. | 2408.11793 | null |
2024-08-21 | DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework | Zhifei Xie et.al. | 2408.11788 | null |
2024-08-23 | Consensus over Clustered Networks Using Intermittent and Asynchronous Output Feedback | Federico M. Zegers et.al. | 2408.11752 | null |
2024-08-21 | Bayesian Optimization Framework for Efficient Fleet Design in Autonomous Multi-Robot Exploration | David Molina Concha et.al. | 2408.11751 | null |
2024-08-21 | Less is more: AI Decision-Making using Dynamic Deep Neural Networks for Short-Term Stock Index Prediction | CJ Finnegan et.al. | 2408.11740 | null |
2024-08-21 | Optimizing QoS in HD Map Updates: Cross-Layer Multi-Agent with Hierarchical and Independent Learning | Jeffrey Redondo et.al. | 2408.11605 | null |
2024-08-21 | Drama Engine: A Framework for Narrative Agents | Martin Pichlmair et.al. | 2408.11574 | null |
2024-08-21 | Subgoal-based Hierarchical Reinforcement Learning for Multi-Agent Collaboration | Cheng Xu et.al. | 2408.11416 | link |
2024-08-21 | Swarm Intelligence in Geo-Localization: A Multi-Agent Large Vision-Language Model Collaborative Framework | Xiao Han et.al. | 2408.11312 | null |
2024-08-20 | CooPre: Cooperative Pretraining for V2X Cooperative Perception | Seth Z. Zhao et.al. | 2408.11241 | null |
2024-08-20 | The Evolution of Reinforcement Learning in Quantitative Finance | Nikolaos Pippas et.al. | 2408.10932 | null |
2024-08-20 | DBHP: Trajectory Imputation in Multi-Agent Sports Using Derivative-Based Hybrid Prediction | Hanjun Choi et.al. | 2408.10878 | null |
2024-08-20 | Multi-agent Multi-armed Bandits with Stochastic Sharable Arm Capacities | Hong Xie et.al. | 2408.10865 | null |
2024-08-20 | Multi-Agent Based Simulation for Decentralized Electric Vehicle Charging Strategies and their Impacts | Kristoffer Christensen et.al. | 2408.10790 | null |
2024-08-20 | Multi-agent based modeling for investigating excess heat utilization from electrolyzer production to district heating network | Kristoffer Christensen et.al. | 2408.10783 | null |
2024-08-20 | Multi-Agent Based Simulation for Investigating Centralized Charging Strategies and their Impact on Electric Vehicle Home Charging Ecosystem | Kristoffer Christensen et.al. | 2408.10773 | null |
2024-08-20 | Strategist: Learning Strategic Skills by LLMs via Bi-Level Tree Search | Jonathan Light et.al. | 2408.10635 | null |
2024-08-20 | Synchronization behind Learning in Periodic Zero-Sum Games Triggers Divergence from Nash equilibrium | Yuma Fujimoto et.al. | 2408.10595 | null |
2024-08-20 | Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks | Yun Qu et.al. | 2408.10556 | link |
2024-08-19 | Tax Credits and Household Behavior: The Roles of Myopic Decision-Making and Liquidity in a Simulated Economy | Jialin Dong et.al. | 2408.10391 | null |
2024-08-19 | Synthesis of Reward Machines for Multi-Agent Equilibrium Design (Full Version) | Muhammad Najib et.al. | 2408.10074 | null |
2024-08-20 | MegaAgent: A Practical Framework for Autonomous Cooperation in Large-Scale LLM Agent Systems | Qian Wang et.al. | 2408.09955 | link |
2024-08-19 | Mitigating the Stability-Plasticity Dilemma in Adaptive Train Scheduling with Curriculum-Driven Continual DQN Expansion | Achref Jaziri et.al. | 2408.09838 | null |
2024-08-19 | GoNoGo: An Efficient LLM-based Multi-Agent System for Streamlining Automotive Software Release Decision-Making | Arsham Gholamzadeh Khoee et.al. | 2408.09785 | null |
2024-08-19 | Algorithmic Contract Design with Reinforcement Learning Agents | David Molina Concha et.al. | 2408.09686 | null |
2024-08-19 | Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey | Ruiqi Zhang et.al. | 2408.09675 | link |
2024-08-18 | Prescribed-time Convergent Distributed Multiobjective Optimization with Dynamic Event-triggered Communication | Tengyang Gong et.al. | 2408.09602 | null |
2024-08-18 | Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning | Zhiwei Xu et.al. | 2408.09501 | null |
2024-08-16 | On the Completeness of Conflict-Based Search: Temporally-Relative Duplicate Pruning | Thayne T Walker et.al. | 2408.09028 | null |
2024-08-16 | The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset Generation | Samee Arif et.al. | 2408.08688 | link |
2024-08-16 | AgentSimulator: An Agent-based Approach for Data-driven Business Process Simulation | Lukas Kirchdorfer et.al. | 2408.08571 | link |
2024-08-15 | A semi-centralized multi-agent RL framework for efficient irrigation scheduling | Bernard T. Agyeman et.al. | 2408.08442 | null |
2024-08-15 | Stochastic Semi-Gradient Descent for Learning Mean Field Games with Population-Aware Function Approximation | Chenyu Zhang et.al. | 2408.08192 | null |
2024-08-15 | Independent Policy Mirror Descent for Markov Potential Games: Scaling to Large Number of Players | Pragnya Alatur et.al. | 2408.08075 | null |
2024-08-15 | Text2BIM: Generating Building Models Using a Large Language Model-based Multi-Agent Framework | Changyu Du et.al. | 2408.08054 | link |
2024-08-16 | MAG-SQL: Multi-Agent Generative Approach with Soft Schema Linking and Iterative Sub-SQL Refinement for Text-to-SQL | Wenxuan Xie et.al. | 2408.07930 | link |
2024-08-14 | SustainDC – Benchmarking for Sustainable Data Center Control | Avisek Naug et.al. | 2408.07841 | link |
2024-08-14 | SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement Learning Framework for Motion Planning | Jianye Xu et.al. | 2408.07644 | link |
2024-08-14 | Development of a Multi-Agent Clinical Decision Support System for Korean Triage and Acuity Scale (KTAS)-Based Triage and Treatment Planning in Emergency Departments | Seungjun Han et.al. | 2408.07531 | null |
2024-08-14 | Bridging Training and Execution via Dynamic Directed Graph-Based Communication in Cooperative Multi-Agent Systems | Zhuohui Zhang et.al. | 2408.07397 | null |
2024-08-14 | Improving Global Parameter-sharing in Physically Heterogeneous Multi-agent Reinforcement Learning with Unified Action Space | Xiaoyang Yu et.al. | 2408.07395 | null |
2024-08-12 | QTypeMix: Enhancing Multi-Agent Cooperative Strategies through Heterogeneous and Homogeneous Value Decomposition | Songchen Fu et.al. | 2408.07098 | link |
2024-08-13 | Robust Model Predictive Control for Aircraft Intent-Aware Collision Avoidance | Arash Bahari Kordabad et.al. | 2408.06999 | null |
2024-08-13 | Multi-Agent Continuous Control with Generative Flow Networks | Shuang Luo et.al. | 2408.06920 | link |
2024-08-13 | MAPPO-PIS: A Multi-Agent Proximal Policy Optimization Method with Prior Intent Sharing for CAVs’ Cooperative Decision-Making | Yicheng Guo et.al. | 2408.06656 | link |
2024-08-12 | Decentralized Cooperation in Heterogeneous Multi-Agent Reinforcement Learning via Graph Neural Network-Based Intrinsic Motivation | Jahir Sadik Monon et.al. | 2408.06503 | link |
2024-08-12 | Learning in Time-Varying Monotone Network Games with Dynamic Populations | Feras Al Taha et.al. | 2408.06253 | null |
2024-08-12 | Multi-Agent Deep Reinforcement Learning Framework for Wireless MAC Protocol Design and Optimization | Navid Keshtiarast et.al. | 2408.05884 | null |
2024-08-11 | DeepAir: A Multi-Agent Deep Reinforcement Learning Based Scheme for an Unknown User Location Problem | Baris Yamansavascilar et.al. | 2408.05712 | null |
2024-08-10 | Multi-agent Planning using Visual Language Models | Michele Brienza et.al. | 2408.05478 | null |
2024-08-09 | Expected $1.x$ -Makespan-Optimal MAPF on Grids in Low-Poly Time | Teng Guo et.al. | 2408.05385 | link |
2024-08-09 | Modeling Transit in a Fully Integrated Agent-Based Framework: Methodology and Large-Scale Application | Omer Verbas et.al. | 2408.05176 | null |
2024-08-08 | A Multi-Scale Cognitive Interaction Model of Instrument Operations at the Linac Coherent Light Source | Jonathan Segal et.al. | 2408.04734 | null |
2024-08-08 | Emergence in Multi-Agent Systems: A Safety Perspective | Philipp Altmann et.al. | 2408.04514 | link |
2024-08-08 | Can LLMs Beat Humans in Debating? A Dynamic Multi-agent Framework for Competitive Debate | Yiqun Zhang et.al. | 2408.04472 | link |
2024-08-08 | KnowPC: Knowledge-Driven Programmatic Reinforcement Learning for Zero-shot Coordination | Yin Gu et.al. | 2408.04336 | null |
2024-08-08 | Assigning Credit with Partial Reward Decoupling in Multi-Agent Proximal Policy Optimization | Aditya Kapoor et.al. | 2408.04295 | link |
2024-08-08 | Cooperative Multi-Agent Deep Reinforcement Learning in Content Ranking Optimization | Zhou Qin et.al. | 2408.04251 | null |
2024-08-07 | From Data to Story: Towards Automatic Animated Data Video Creation with LLM-based Multi-Agent Systems | Leixian Shen et.al. | 2408.03876 | null |
2024-08-07 | Automated Code Fix Suggestions for Accessibility Issues in Mobile Apps | Forough Mehralian et.al. | 2408.03827 | null |
2024-08-07 | A time-dependent symplectic network for non-convex path planning problems with linear and nonlinear dynamics | Zhen Zhang et.al. | 2408.03785 | null |
2024-08-07 | MS-Mapping: An Uncertainty-Aware Large-Scale Multi-Session LiDAR Mapping System | Xiangcheng Hu et.al. | 2408.03723 | link |
2024-08-07 | Asynchronous Credit Assignment Framework for Multi-Agent Reinforcement Learning | Yongheng Liang et.al. | 2408.03692 | null |
2024-08-07 | AgentsCoMerge: Large Language Model Empowered Collaborative Decision Making for Ramp Merging | Senkang Hu et.al. | 2408.03624 | null |
2024-08-06 | Combining Diverse Information for Coordinated Action: Stochastic Bandit Algorithms for Heterogeneous Agents | Lucia Gordon et.al. | 2408.03405 | link |
2024-08-06 | Social Behavior as a Key to Learning-based Multi-Agent Pathfinding Dilemmas | Chengyang He et.al. | 2408.03063 | null |
2024-08-06 | Anytime Multi-Agent Path Finding with an Adaptive Delay-Based Heuristic | Thomy Phan et.al. | 2408.02960 | link |
2024-08-06 | Reinforcement Learning based Workflow Scheduling in Cloud and Edge Computing Environments: A Taxonomy, Review and Future Directions | Amanda Jayanetti et.al. | 2408.02938 | null |
2024-08-05 | Heterogeneous graph attention network improves cancer multiomics integration | Sina Tabakhi et.al. | 2408.02845 | link |
2024-08-05 | Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information | Yauwai Yim et.al. | 2408.02559 | null |
2024-08-05 | Spatio-Temporal Communication Compression in Distributed Prime-Dual Flows | Zihao Ren et.al. | 2408.02332 | null |
2024-08-05 | Data time travel and consistent market making: taming reinforcement learning in multi-agent systems with anonymous data | Vincent Ragel et.al. | 2408.02322 | null |
2024-08-05 | ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems | Andrew Zhu et.al. | 2408.02248 | link |
2024-08-04 | Environment Complexity and Nash Equilibria in a Sequential Social Dilemma | Mustafa Yasir et.al. | 2408.02148 | null |
2024-08-04 | Shaping Rewards, Shaping Routes: On Multi-Agent Deep Q-Networks for Routing in Satellite Constellation Networks | Manuel M. H. Roth et.al. | 2408.01979 | null |
2024-08-04 | MAO: A Framework for Process Model Generation with Multi-Agent Orchestration | Leilei Lin et.al. | 2408.01916 | null |
2024-08-03 | MALADE: Orchestration of LLM-powered Agents with Retrieval Augmented Generation for Pharmacovigilance | Jihye Choi et.al. | 2408.01869 | link |
2024-08-03 | A Robust Compressed Push-Pull Method for Decentralized Nonconvex Optimization | Yiwei Liao et.al. | 2408.01727 | null |
2024-08-03 | The Drama Machine: Simulating Character Development with LLM Agents | Liam Magee et.al. | 2408.01725 | null |
2024-08-02 | PsybORG+: Cognitive Modeling for Triggering and Detection of Cognitive Biases of Advanced Persistent Threats | Shuo Huang et.al. | 2408.01310 | null |
2024-08-02 | Game Theory Based Community-Aware Opinion Dynamics | Shanfan Zhang et.al. | 2408.01196 | link |
2024-08-02 | On Game Based Distributed Decision Approach for Multi-agent Optimal Coverage Problem with Application to Constellations Reconfiguration | Zixin Feng et.al. | 2408.01193 | null |
2024-08-05 | Agentic LLM Workflows for Generating Patient-Friendly Medical Reports | Malavikha Sudarshan et.al. | 2408.01112 | link |
2024-08-02 | A Survey on Self-play Methods in Reinforcement Learning | Ruize Zhang et.al. | 2408.01072 | null |
2024-08-02 | Learning with Linear Function Approximations in Mean-Field Control | Erhan Bayraktar et.al. | 2408.00991 | null |
2024-08-02 | On the Resilience of Multi-Agent Systems with Malicious Agents | Jen-tse Huang et.al. | 2408.00989 | link |
2024-08-01 | A Policy-Gradient Approach to Solving Imperfect-Information Games with Iterate Convergence | Mingyang Liu et.al. | 2408.00751 | null |
2024-08-01 | Jailbreaking Text-to-Image Models with LLM-Based Agents | Yingkai Dong et.al. | 2408.00523 | null |
2024-08-01 | A Novel Edge Laplacian-based Approach for Adaptive Formation Control of Uncertain Multi-agent Systems with Unified Relative Error Performance | Kun Li et.al. | 2408.00323 | null |
2024-07-31 | CREW: Facilitating Human-AI Teaming Research | Lingyu Zhang et.al. | 2408.00170 | link |
2024-07-31 | Artificial Intelligence Approaches for Energy Efficiency: A Review | Alberto Pasqualetto et.al. | 2407.21726 | null |
2024-07-31 | MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory Prediction | Seongju Lee et.al. | 2407.21635 | link |
2024-07-31 | Multi-agent reinforcement learning for the control of three-dimensional Rayleigh-Bénard convection | Joel Vasanth et.al. | 2407.21565 | link |
2024-07-31 | Multi-agent Assessment with QoS Enhancement for HD Map Updates in a Vehicular Network | Jeffrey Redondo et.al. | 2407.21460 | null |
2024-07-31 | MetaOpenFOAM: an LLM-based multi-agent framework for CFD | Yuxuan Chena et.al. | 2407.21320 | link |
2024-08-02 | MSMA: Multi-agent Trajectory Prediction in Connected and Autonomous Vehicle Environment with Multi-source Data Integration | Xi Chen et.al. | 2407.21310 | link |
2024-07-30 | Amelia: A Large Model and Dataset for Airport Surface Movement Forecasting | Ingrid Navarro et.al. | 2407.21185 | null |
2024-07-30 | Securing Proof of Stake Blockchains: Leveraging Multi-Agent Reinforcement Learning for Detecting and Mitigating Malicious Nodes | Faisal Haque Bappy et.al. | 2407.20983 | null |
2024-07-30 | Distributed Adaptive Time-Varying Optimization with Global Asymptotic Convergence | Liangze Jiang et.al. | 2407.20897 | null |
2024-07-30 | Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification | Boyang Zhang et.al. | 2407.20859 | null |
2024-07-30 | Architectural Influence on Variational Quantum Circuits in Multi-Agent Reinforcement Learning: Evolutionary Strategies for Optimization | Michael Kölle et.al. | 2407.20739 | null |
2024-07-30 | Implementation of Formal Standard for Interoperability in M&S/System of Systems Integration with DEVS/SOA | Saurabh Mittal et.al. | 2407.20696 | null |
2024-07-30 | Interpreting and Mitigating Hallucination in MLLMs through Multi-agent Debate | Zheng Lin et.al. | 2407.20505 | link |
2024-07-29 | Finite-Time Analysis of Asynchronous Multi-Agent TD Learning | Nicolò Dal Fabbro et.al. | 2407.20441 | null |
2024-07-29 | MindSearch: Mimicking Human Minds Elicits Deep AI Searcher | Zehui Chen et.al. | 2407.20183 | link |
2024-07-29 | Counterfactual rewards promote collective transport using individually controlled swarm microrobots | Veit-Lorenz Heuthe et.al. | 2407.20041 | null |
2024-07-29 | Minimum Time Consensus of Multi-agent System under Fuel Constraints | Akansha Rautela et.al. | 2407.19927 | null |
2024-07-29 | Sustainable Task Offloading in Secure UAV-assisted Smart Farm Networks: A Multi-Agent DRL with Action Mask Approach | Tingnan Bao et.al. | 2407.19657 | null |
2024-07-28 | Conversational AI Multi-Agent Interoperability, Universal Open APIs for Agentic Natural Language Multimodal Communications | Diego Gosmar et.al. | 2407.19438 | null |
2024-07-27 | Collaborative Adaptation for Recovery from Unforeseen Malfunctions in Discrete and Continuous MARL Domains | Yasin Findik et.al. | 2407.19144 | null |
2024-07-27 | Relational Q-Functionals: Multi-Agent Learning to Recover from Unforeseen Robot Malfunctions in Continuous Action Domains | Yasin Findik et.al. | 2407.19128 | null |
2024-07-26 | Solving Robotics Problems in Zero-Shot with Vision-Language Models | Zidan Wang et.al. | 2407.19094 | link |
2024-07-26 | Multi-Robot System Architecture design in SysML and BPMN | Ahmed R. Sadik et.al. | 2407.18749 | null |
2024-07-26 | Multi-Agent Deep Reinforcement Learning for Energy Efficient Multi-Hop STAR-RIS-Assisted Transmissions | Pei-Hsiang Liao et.al. | 2407.18627 | null |
2024-07-26 | Reinforcement Learning for Sustainable Energy: A Survey | Koen Ponse et.al. | 2407.18597 | null |
2024-07-26 | REAPER: Reasoning based Retrieval Planning for Complex RAG Systems | Ashutosh Joshi et.al. | 2407.18553 | null |
2024-07-29 | Multi-Agent Trajectory Prediction with Difficulty-Guided Feature Enhancement Network | Guipeng Xin et.al. | 2407.18551 | link |
2024-07-25 | Multi-Agent Deep Reinforcement Learning for Resilience Optimization in 5G RAN | Soumeya Kaada et.al. | 2407.18066 | null |
2024-07-25 | Long-term Fairness in Ride-Hailing Platform | Yufan Kang et.al. | 2407.17839 | null |
2024-07-25 | Advanced deep-reinforcement-learning methods for flow control: group-invariant and positional-encoding networks improve learning speed and quality | Joogoo Jeon et.al. | 2407.17822 | link |
2024-07-25 | Very Large-Scale Multi-Agent Simulation in AgentScope | Xuchen Pan et.al. | 2407.17789 | link |
2024-07-25 | Strategic Pseudo-Goal Perturbation for Deadlock-Free Multi-Agent Navigation in Social Mini-Games | Abhishek Jha et.al. | 2407.17766 | null |
2024-07-24 | CityX: Controllable Procedural Content Generation for Unbounded 3D Cities | Shougao Zhang et.al. | 2407.17572 | null |
2024-07-24 | A process algebraic framework for multi-agent dynamic epistemic systems | Alessandro Aldini et.al. | 2407.17537 | null |
2024-07-24 | The Möbius Game: A Quantum-Inspired Test of General Relativity | Eleftherios-Ermis Tselentis et.al. | 2407.17203 | null |
2024-07-24 | Reinforced Prompt Personalization for Recommendation with Large Language Models | Wenyu Mao et.al. | 2407.17115 | link |
2024-07-24 | AI-Gadget Kit: Integrating Swarm User Interfaces with LLM-driven Agents for Rich Tabletop Game Applications | Yijie Guo et.al. | 2407.17086 | null |
2024-07-24 | Applications of Multi-Agent Deep Reinforcement Learning Communication in Network Management: A Survey | Yue Pi et.al. | 2407.17030 | null |
2024-07-23 | Topology-Guided ORCA: Smooth Multi-Agent Motion Planning in Constrained Environments | Fatemeh Cheraghi Pouria et.al. | 2407.16771 | null |
2024-07-23 | RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent | Huiyu Xu et.al. | 2407.16667 | null |
2024-07-24 | Velocity Driven Vision: Asynchronous Sensor Fusion Birds Eye View Models for Autonomous Vehicles | Seamie Hayes et.al. | 2407.16636 | null |
2024-07-23 | Evaluating Uncertainties in Electricity Markets via Machine Learning and Quantum Computing | Shuyang Zhu et.al. | 2407.16404 | null |
2024-07-23 | MOMAland: A Set of Benchmarks for Multi-Objective Multi-Agent Reinforcement Learning | Florian Felten et.al. | 2407.16312 | link |
2024-07-22 | Faster Optimal Coalition Structure Generation via Offline Coalition Selection and Graph-Based Search | Redha Taguelmimt et.al. | 2407.16092 | null |
2024-07-22 | Efficient Replay Memory Architectures in Multi-Agent Reinforcement Learning for Traffic Congestion Control | Mukul Chodhary et.al. | 2407.16034 | null |
2024-07-21 | B2MAPO: A Batch-by-Batch Multi-Agent Policy Optimization to Balance Performance and Efficiency | Wenjing Zhang et.al. | 2407.15077 | null |
2024-07-21 | Multi-Agent Causal Discovery Using Large Language Models | Hao Duong Le et.al. | 2407.15073 | null |
2024-07-20 | POGEMA: A Benchmark Platform for Cooperative Multi-Agent Navigation | Alexey Skrynnik et.al. | 2407.14931 | link |
2024-07-19 | Value Internalization: Learning and Generalizing from Social Reward | Frieda Rong et.al. | 2407.14681 | link |
2024-07-19 | The Vision of Autonomic Computing: Can LLMs Make It a Reality? | Zhiyang Zhang et.al. | 2407.14402 | null |
2024-07-19 | KoMA: Knowledge-driven Multi-agent Framework for Autonomous Driving with Large Language Models | Kemou Jiang et.al. | 2407.14239 | null |
2024-07-18 | Multi-agent Coverage Control: From Discrete Assignments to Continuous Multi-agent Distribution Matching | Solmaz Kia et.al. | 2407.13890 | null |
2024-07-22 | Geometric Data Fusion for Collaborative Attitude Estimation | Yixiao Ge et.al. | 2407.13176 | null |
2024-07-18 | Reconfigurable Intelligent Surface Aided Vehicular Edge Computing: Joint Phase-shift Optimization and Multi-User Power Allocation | Kangwei Qi et.al. | 2407.13123 | link |
2024-07-17 | Agent-E: From Autonomous Web Navigation to Foundational Design Principles in Agentic Systems | Tamer Abuelsaad et.al. | 2407.13032 | link |
2024-07-17 | Multiple Access Integrated Adaptive Finite Blocklength for Ultra-Low Delay in 6G Wireless Networks | Yixin Zhang et.al. | 2407.12706 | null |
2024-07-17 | Mechanism Design via the Interim Relaxation | Kshipra Bhawalkar et.al. | 2407.12699 | null |
2024-07-17 | IICPilot: An Intelligent Integrated Circuit Backend Design Framework Using Open EDA | Zesong Jiang et.al. | 2407.12576 | null |
2024-07-17 | Navigating the Smog: A Cooperative Multi-Agent RL for Accurate Air Pollution Mapping through Data Assimilation | Ichrak Mokhtari et.al. | 2407.12539 | null |
2024-07-17 | Towards Collaborative Intelligence: Propagating Intentions and Reasoning for Multi-Agent Coordination with Large Language Models | Xihe Qiu et.al. | 2407.12532 | null |
2024-07-18 | PersLLM: A Personified Training Approach for Large Language Models | Zheni Zeng et.al. | 2407.12393 | link |
2024-07-16 | Learning to Imitate Spatial Organization in Multi-robot Systems | Ayomide O. Agunloye et.al. | 2407.11592 | null |
2024-07-16 | Distributed Prescribed-Time Convex Optimization: Cascade Design and Time-Varying Gain Approach | Gewei Zuo et.al. | 2407.11413 | null |
2024-07-16 | Prescribed-time Cooperative Output Regulation of Linear Heterogeneous Multi-agent Systems | Gewei Zuo et.al. | 2407.11408 | null |
2024-07-16 | InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains | Yinzhu Quan et.al. | 2407.11384 | link |
2024-07-16 | Digital Twin Vehicular Edge Computing Network: Task Offloading and Resource Allocation | Yu Xie et.al. | 2407.11310 | link |
2024-07-16 | Sibyl: Simple yet Effective Agent Framework for Complex Real-world Reasoning | Yulong Wang et.al. | 2407.10718 | link |
2024-07-16 | Back to Newton’s Laws: Learning Vision-based Agile Flight via Differentiable Physics | Yuang Zhang et.al. | 2407.10648 | null |
2024-07-15 | Cooperative Reward Shaping for Multi-Agent Pathfinding | Zhenyu Song et.al. | 2407.10403 | null |
2024-07-14 | Ontology-driven Reinforcement Learning for Personalized Student Support | Ryan Hare et.al. | 2407.10332 | null |
2024-07-14 | Consensus and Flocking under Communication Failure | Fabio Ancona et.al. | 2407.10306 | null |
2024-07-14 | Learning to Steer Markovian Agents under Model Uncertainty | Jiawei Huang et.al. | 2407.10207 | link |
2024-07-14 | Long-Horizon Planning for Multi-Agent Robots in Partially Observable Environments | Siddharth Nayak et.al. | 2407.10031 | link |
2024-07-13 | AtomAgents: Alloy design and discovery through physics-aware multi-modal multi-agent artificial intelligence | Alireza Ghafarollahi et.al. | 2407.10022 | null |
2024-07-13 | Cohesive Conversations: Enhancing Authenticity in Multi-Agent Simulated Dialogues | KuanChao Chu et.al. | 2407.09897 | null |
2024-07-13 | Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks | Shengbin Yue et.al. | 2407.09893 | link |
2024-07-12 | Benchmarking Large Neighborhood Search for Multi-Agent Path Finding | Jiaqi Tan et.al. | 2407.09451 | link |
2024-07-12 | Security Matrix for Multimodal Agents on Mobile Devices: A Systematic and Proof of Concept Study | Yulong Yang et.al. | 2407.09295 | null |
2024-07-12 | GNN with Model-based RL for Multi-agent Systems | Hanxiao Chen et.al. | 2407.09249 | null |
2024-07-12 | Decentralized multi-agent reinforcement learning algorithm using a cluster-synchronized laser network | Shun Kotoku et.al. | 2407.09124 | null |
2024-07-12 | Fast and Accurate Multi-Agent Trajectory Prediction For Crowded Unknown Scenes | Xiuye Tao et.al. | 2407.09068 | null |
2024-07-12 | Communication-Aware Reinforcement Learning for Cooperative Adaptive Cruise Control | Sicong Jiang et.al. | 2407.08964 | null |
2024-07-15 | Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation | Biqing Qi et.al. | 2407.08940 | link |
2024-07-11 | United We Stand: Decentralized Multi-Agent Planning With Attrition | Nhat Nguyen et.al. | 2407.08254 | null |
2024-07-11 | ARCO:Adaptive Multi-Agent Reinforcement Learning-Based Hardware/Software Co-Optimization Compiler for Improved Performance in DNN Accelerator Design | Arya Fayyazi et.al. | 2407.08192 | null |
2024-07-11 | Hierarchical Consensus-Based Multi-Agent Reinforcement Learning for Multi-Robot Cooperation Tasks | Pu Feng et.al. | 2407.08164 | null |
2024-07-10 | Field Deployment of Multi-Agent Reinforcement Learning Based Variable Speed Limit Controllers | Yuhang Zhang et.al. | 2407.08021 | null |
2024-07-10 | Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities | Tianjie Ju et.al. | 2407.07791 | link |
2024-07-10 | Resource Allocation for Twin Maintenance and Computing Task Processing in Digital Twin Vehicular Edge Computing Network | Yu Xie et.al. | 2407.07575 | link |
2024-07-10 | Long-Term Fairness in Sequential Multi-Agent Selection with Positive Reinforcement | Bhagyashree Puranik et.al. | 2407.07350 | link |
2024-07-09 | Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models | Logan Cross et.al. | 2407.07086 | link |
2024-07-10 | Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence | Weize Chen et.al. | 2407.07061 | link |
2024-07-10 | PEER: Expertizing Domain-Specific Tasks with a Multi-Agent Framework and Tuning Methods | Yiying Wang et.al. | 2407.06985 | link |
2024-07-09 | Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy | Zhenyu Guan et.al. | 2407.06813 | link |
2024-07-10 | FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making | Yangyang Yu et.al. | 2407.06567 | null |
2024-07-09 | Fast Distributed Optimization over Directed Graphs under Malicious Attacks using Trust | Arif Kerem Dayı et.al. | 2407.06541 | null |
2024-07-09 | Semantic Communication in Multi-team Dynamic Games: A Mean Field Perspective | Shubham Aggarwal et.al. | 2407.06528 | null |
2024-07-08 | Cyber Physical Games | Warisa Sritriratanarak et.al. | 2407.05817 | link |
2024-07-08 | FedMRL: Data Heterogeneity Aware Federated Multi-agent Deep Reinforcement Learning for Medical Imaging | Pranab Sahoo et.al. | 2407.05800 | link |
2024-07-08 | Multi-agent Reinforcement Learning-based Network Intrusion Detection System | Amine Tellache et.al. | 2407.05766 | null |
2024-07-06 | Deception in Nash Equilibrium Seeking | Michael Tang et.al. | 2407.05168 | null |
2024-07-06 | Multi-agent Off-policy Actor-Critic Reinforcement Learning for Partially Observable Environments | Ainur Zhaikhan et.al. | 2407.04974 | null |
2024-07-05 | Maximizing utility in multi-agent environments by anticipating the behavior of other learners | Angelos Assos et.al. | 2407.04889 | null |
2024-07-05 | A Defeasible Deontic Calculus for Resolving Norm Conflicts | Taylor Olson et.al. | 2407.04869 | null |
2024-07-05 | Simple method for efficiently solving dynamic models with continuous actions using policy gradient | Takeshi Fukasawa et.al. | 2407.04227 | null |
2024-07-09 | Solving Zebra Puzzles Using Constraint-Guided Multi-Agent Systems | Shmuel Berman et.al. | 2407.03956 | null |
2024-07-04 | MobileExperts: A Dynamic Tool-Enabled Agent Team in Mobile Devices | Jiayi Zhang et.al. | 2407.03913 | null |
2024-07-04 | VDMA: Video Question Answering with Dynamically Generated Multi-Agents | Noriyuki Kugo et.al. | 2407.03610 | null |
2024-07-03 | Algorithmic Collusion And The Minimum Price Markov Game | Igor Sadoune et.al. | 2407.03521 | null |
2024-07-03 | A Review of the Applications of Deep Learning-Based Emergent Communication | Brendon Boldt et.al. | 2407.03302 | null |
2024-07-03 | Cooperative Multi-Agent Deep Reinforcement Learning Methods for UAV-aided Mobile Edge Computing Networks | Mintae Kim et.al. | 2407.03280 | null |
2024-07-03 | Hierarchical Large Scale Multirobot Path (Re)Planning | Lishuo Pan et.al. | 2407.02777 | null |
2024-07-03 | Multi-Scenario Combination Based on Multi-Agent Reinforcement Learning to Optimize the Advertising Recommendation System | Yang Zhao et.al. | 2407.02759 | null |
2024-07-03 | MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating and Attribute Control | Yeonji Lee et.al. | 2407.02736 | null |
2024-07-02 | Wildfire Autonomous Response and Prediction Using Cellular Automata (WARP-CA) | Abdelrahman Ramadan et.al. | 2407.02613 | null |
2024-07-01 | Optimizing Age of Information in Vehicular Edge Computing with Federated Graph Neural Network Multi-Agent Reinforcement Learning | Wenhua Wang et.al. | 2407.02342 | link |
2024-07-01 | A Deep Reinforcement Learning Approach to Battery Management in Dairy Farming via Proximal Policy Optimization | Nawazish Ali et.al. | 2407.01653 | null |
2024-07-01 | CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents | Tianqi Xu et.al. | 2407.01511 | link |
2024-07-01 | Online Learning of Temporal Dependencies for Sustainable Foraging Problem | John Payne et.al. | 2407.01501 | null |
2024-07-01 | Coordination Failure in Cooperative Offline MARL | Callum Rhys Tilbury et.al. | 2407.01343 | null |
2024-07-01 | HGNET: A Hierarchical Feature Guided Network for Occupancy Flow Field Prediction | Zhan Chen et.al. | 2407.01097 | null |
2024-07-01 | Data on the Move: Traffic-Oriented Data Trading Platform Powered by AI Agent with Common Sense | Yi Yu et.al. | 2407.00995 | null |
2024-06-30 | Guarding a Target Area from a Heterogeneous Group of Cooperative Attackers | Yoonjae Lee et.al. | 2407.00762 | null |
2024-07-03 | Diffusion Models for Offline Multi-agent Reinforcement Learning with Safety Constraints | Jianuo Huang et.al. | 2407.00741 | null |
2024-06-30 | Multi-Agent Training for Pommerman: Curriculum Learning and Population-based Self-Play Approach | Nhat-Minh Huynh et.al. | 2407.00662 | null |
2024-07-02 | BMW Agents – A Framework For Task Automation Through Multi-Agent Collaboration | Noel Crawford et.al. | 2406.20041 | null |
2024-06-28 | Learning Branching-Time Properties in CTL and ATL via Constraint Solving | Benjamin Bordais et.al. | 2406.19890 | null |
2024-06-28 | MetaDesigner: Advancing Artistic Typography through AI-Driven, User-Centric, and Multilingual WordArt Synthesis | Jun-Yan He et.al. | 2406.19859 | null |
2024-06-28 | Unlocking Varied Perspectives: A Persona-Based Multi-Agent Framework with Debate-Driven Text Planning for Argument Generation | Zhe Hu et.al. | 2406.19643 | link |
2024-06-27 | Multi-Agent Search-Type Problems on Polygons | Konstantinos Georgiou et.al. | 2406.19495 | null |
2024-06-27 | Multi-agent Cooperative Games Using Belief Map Assisted Training | Qinwei Huang et.al. | 2406.19477 | link |
2024-06-27 | Simulating Classroom Education with LLM-Empowered Agents | Zheyuan Zhang et.al. | 2406.19226 | null |
2024-06-27 | Formation Under Communication Constraints: Control Performance Meets Channel Capacity | Yaru Chen et.al. | 2406.18961 | null |
2024-06-27 | LayoutCopilot: An LLM-powered Multi-agent Collaborative Framework for Interactive Analog Layout Design | Bingyang Liu et.al. | 2406.18873 | null |
2024-06-26 | Algebraic Connectivity Control and Maintenance in Multi-Agent Networks under Attack | Wenjie Zhao et.al. | 2406.18467 | null |
2024-06-26 | Intrinsic Action Tendency Consistency for Cooperative Multi-Agent Reinforcement Learning | Junkai Zhang et.al. | 2406.18152 | null |
2024-06-25 | The Overcooked Generalisation Challenge | Constantin Ruhdorfer et.al. | 2406.17949 | link |
2024-06-25 | Pixel-weighted Multi-pose Fusion for Metal Artifact Reduction in X-ray Computed Tomography | Diyu Yang et.al. | 2406.17897 | null |
2024-06-25 | Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks | Feiyang Xu et.al. | 2406.17818 | link |
2024-06-25 | CuDA2: An approach for Incorporating Traitor Agents into Cooperative Multi-Agent Systems | Zhen Chen et.al. | 2406.17425 | null |
2024-06-25 | Hierarchical Framework for Optimizing Wildfire Surveillance and Suppression using Human-Autonomous Teaming | Mahdi Al-Husseini et.al. | 2406.17189 | null |
2024-06-24 | Quantum Multi-Agent Reinforcement Learning for Cooperative Mobile Access in Space-Air-Ground Integrated Networks | Gyu Seon Kim et.al. | 2406.16994 | null |
2024-06-24 | An Active Search Strategy with Multiple Unmanned Aerial Systems for Multiple Targets | Chuanxiang Gao et.al. | 2406.16370 | null |
2024-06-24 | YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals | Sandeep Mishra et.al. | 2406.16273 | null |
2024-06-22 | Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models | Yang Zhang et.al. | 2406.15836 | link |
2024-06-22 | Fault-tolerant control of random switching topology multi-agent system based on event triggering | Ouyang Lingcong et.al. | 2406.15770 | null |
2024-06-21 | Contextual Sprint Classification in Soccer Based on Deep Learning | Hyunsung Kim et.al. | 2406.15659 | null |
2024-06-21 | Effects of non-uniform number of actions by Hawkes process on spatial cooperation | Daiki Miyagawa et.al. | 2406.15036 | null |
2024-06-21 | Autonomous Agents for Collaborative Task under Information Asymmetry | Wei Liu et.al. | 2406.14928 | link |
2024-06-21 | Decentralized Concurrent Learning with Coordinated Momentum and Restart | Daniel E. Ochoa et.al. | 2406.14802 | null |
2024-06-20 | Multi-Task Lane-Free Driving Strategy for Connected and Automated Vehicles: A Multi-Agent Deep Reinforcement Learning Approach | Mehran Berahman et.al. | 2406.14766 | null |
2024-06-20 | Singular knee identification to support emergence recognition in physical swarm and cellular automata trajectories | Imraan A. Faruque et.al. | 2406.14652 | null |
2024-06-20 | CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics | Jiawei Gao et.al. | 2406.14558 | null |
2024-06-20 | Vectorized Representation Dreamer (VRD): Dreaming-Assisted Multi-Agent Motion-Forecasting | Hunter Schofield et.al. | 2406.14415 | null |
2024-06-20 | Artificial Leviathan: Exploring Social Evolution of LLM Agents Through the Lens of Hobbesian Social Contract Theory | Gordon Dai et.al. | 2406.14373 | null |
2024-06-20 | EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms | Siyu Yuan et.al. | 2406.14228 | link |
2024-06-20 | Tractable Equilibrium Computation in Markov Games through Risk Aversion | Eric Mazumdar et.al. | 2406.14156 | null |
2024-06-20 | Primal-Dual Strategy (PDS) for Composite Optimization Over Directed graphs | Sajad Zandi et.al. | 2406.14011 | null |
2024-06-20 | Robust Cooperative Multi-Agent Reinforcement Learning:A Mean-Field Type Game Perspective | Muhammad Aneeq uz Zaman et.al. | 2406.13992 | null |
2024-06-20 | Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization | Wentse Chen et.al. | 2406.13930 | link |
2024-06-19 | CLAMP: Majorized Plug-and-Play for Coherent 3D LIDAR Imaging | Tony G. Allen et.al. | 2406.13651 | null |
2024-06-19 | CoDreamer: Communication-Based Decentralised World Models | Edan Toledo et.al. | 2406.13600 | null |
2024-06-18 | MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQL | Arian Askari et.al. | 2406.12692 | link |
2024-06-18 | Ask-before-Plan: Proactive Language Agents for Real-World Planning | Xuan Zhang et.al. | 2406.12639 | link |
2024-06-18 | Large Language Models based Multi-Agent Framework for Objective Oriented Control Design in Power Electronics | Chenggang Cui et.al. | 2406.12628 | null |
2024-06-18 | Problem-Solving in Language Model Networks | Ciaran Regan et.al. | 2406.12374 | link |
2024-06-18 | Leveraging Large Language Model for Heterogeneous Ad Hoc Teamwork Collaboration | Xinzhu Liu et.al. | 2406.12224 | null |
2024-06-18 | Debate as Optimization: Adaptive Conformal Prediction and Diverse Retrieval for Event Extraction | Sijia Wang et.al. | 2406.12197 | null |
2024-06-17 | Improving Multi-Agent Debate with Sparse Communication Topology | Yunxuan Li et.al. | 2406.11776 | null |
2024-06-17 | Communication-Efficient MARL for Platoon Stability and Energy-efficiency Co-optimization in Cooperative Adaptive Cruise Control of CAVs | Min Hua et.al. | 2406.11653 | null |
2024-06-17 | Counterfactual Debating with Preset Stances for Hallucination Elimination of LLMs | Yi Fang et.al. | 2406.11514 | link |
2024-06-17 | Decentralized Collaborative Pricing and Shunting for Multiple EV Charging Stations Based on Multi-Agent Reinforcement Learning | Tianhao Bu et.al. | 2406.11496 | null |
2024-06-17 | Can AI with High Reasoning Ability Replicate Human-like Decision Making in Economic Experiments? | Ayato Kitadai et.al. | 2406.11426 | null |
2024-06-17 | KAOS: Large Model Multi-Agent Operating System | Zhao Zhuo et.al. | 2406.11342 | null |
2024-06-17 | Reconfigurable Intelligent Surface Assisted VEC Based on Multi-Agent Reinforcement Learning | Kangwei Qi et.al. | 2406.11318 | link |
2024-06-17 | Balancing Performance and Cost for Two-Hop Cooperative Communications: Stackelberg Game and Distributed Multi-Agent Reinforcement Learning | Yuanzhe Geng et.al. | 2406.11265 | link |
2024-06-17 | STEVE Series: Step-by-Step Construction of Agent Systems in Minecraft | Zhonghan Zhao et.al. | 2406.11247 | null |
2024-06-17 | The Benefits of Power Regularization in Cooperative Reinforcement Learning | Michelle Li et.al. | 2406.11240 | null |
2024-06-14 | Gradient-based Learning in State-based Potential Games for Self-Learning Production Systems | Steve Yuwono et.al. | 2406.10015 | null |
2024-06-14 | Consistent Update Synthesis via Privatized Beliefs | Thomas Schlögl et.al. | 2406.10010 | null |
2024-06-14 | Think Deep and Fast: Learning Neural Nonlinear Opinion Dynamics from Inverse Dynamic Games for Split-Second Interactions | Haimin Hu et.al. | 2406.09810 | null |
2024-06-14 | Mix Q-learning for Lane Changing: A Collaborative Decision-Making Method in Multi-Agent Deep Reinforcement Learning | Xiaojun Bi et.al. | 2406.09755 | null |
2024-06-13 | Characterising Interventions in Causal Games | Manuj Mishra et.al. | 2406.09318 | null |
2024-06-13 | Applying Multi-Agent Negotiation to Solve the Production Routing Problem With Privacy Preserving | Luiza Pellin Biasoto et.al. | 2406.09214 | null |
2024-06-13 | Dispelling the Mirage of Progress in Offline MARL through Standardised Baselines and Evaluation | Claude Formanek et.al. | 2406.09068 | link |
2024-06-13 | Multi-Agent Software Development through Cross-Team Collaboration | Zhuoyun Du et.al. | 2406.08979 | link |
2024-06-13 | Equilibrium Selection for Multi-agent Reinforcement Learning: A Unified Framework | Runyu Zhang et.al. | 2406.08844 | null |
2024-06-13 | Batch-Instructed Gradient for Prompt Evolution:Systematic Prompt Optimization for Enhanced Text-to-Image Synthesis | Xinrui Yang et.al. | 2406.08713 | link |
2024-06-12 | AlphaZeroES: Direct score maximization outperforms planning loss minimization | Carlos Martin et.al. | 2406.08687 | null |
2024-06-12 | CrowdEgress: A Multi-Agent Simulation Platform for Pedestrian Crowd | Peng Wang et.al. | 2406.08190 | null |
2024-06-12 | Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning | Yizhe Huang et.al. | 2406.08002 | null |
2024-06-12 | A Federated Online Restless Bandit Framework for Cooperative Resource Allocation | Jingwen Tong et.al. | 2406.07992 | null |
2024-06-13 | Carbon Market Simulation with Adaptive Mechanism Design | Han Wang et.al. | 2406.07875 | link |
2024-06-12 | Multi-agent Reinforcement Learning with Deep Networks for Diverse Q-Vectors | Zhenglong Luo et.al. | 2406.07848 | null |
2024-06-11 | Multi-objective optimization for multi-agent injection strategies in subsurface CO $_2$ storage | Per Pettersson et.al. | 2406.07711 | null |
2024-06-11 | Scalable Optimal Motion Planning for Multi-Agent Systems by Cosserat Theory of Rods | Amirreza Fahim Golestaneh et.al. | 2406.07684 | null |
2024-06-11 | Choreographing the Rhythms of Observation: Dynamics for Ranged Observer Bipartite-Unipartite SpatioTemporal (ROBUST) Networks | Ted Edward Holmberg et.al. | 2406.07473 | null |
2024-06-11 | Federated Multi-Agent DRL for Radio Resource Management in Industrial 6G in-X subnetworks | Bjarke Madsen et.al. | 2406.07383 | null |
2024-06-11 | EdgeTimer: Adaptive Multi-Timescale Scheduling in Mobile Edge Computing with Deep Reinforcement Learning | Yijun Hao et.al. | 2406.07342 | null |
2024-06-11 | Scaling Large-Language-Model-based Multi-Agent Collaboration | Chen Qian et.al. | 2406.07155 | link |
2024-06-11 | CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation | Renhao Li et.al. | 2406.07054 | link |
2024-06-11 | Arbitrary-Order Distributed Finite-Time Differentiator for Multi-Agent Systems | Weile Chen et.al. | 2406.07031 | null |
2024-06-11 | DNN Partitioning, Task Offloading, and Resource Allocation in Dynamic Vehicular Networks: A Lyapunov-Guided Diffusion-Based Reinforcement Learning Approach | Zhang Liu et.al. | 2406.06986 | null |
2024-06-10 | Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies | Alex DeWeese et.al. | 2406.06823 | null |
2024-06-10 | Adaptive Opponent Policy Detection in Multi-Agent MDPs: Real-Time Strategy Switch Identification Using Running Error Estimation | Mohidul Haque Mridul et.al. | 2406.06500 | null |
2024-06-11 | Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies | Junlin Wang et.al. | 2406.06461 | null |
2024-06-11 | iMotion-LLM: Motion Prediction Instruction Tuning | Abdulwahab Felemban et.al. | 2406.06211 | null |
2024-06-10 | Deep Multi-Objective Reinforcement Learning for Utility-Based Infrastructural Maintenance Optimization | Jesse van Remmerden et.al. | 2406.06184 | link |
2024-06-10 | Risk Sensitivity in Markov Games and Multi-Agent Reinforcement Learning: A Systematic Review | Hafez Ghaemi et.al. | 2406.06041 | null |
2024-06-09 | Multi-UAV Trajectory Design for Fair and Secure Communication | Hongjiang Lei et.al. | 2406.05936 | null |
2024-06-11 | Deception Analysis with Artificial Intelligence: An Interdisciplinary Perspective | Stefan Sarkadi et.al. | 2406.05724 | null |
2024-06-09 | VillagerAgent: A Graph-Based Multi-Agent Framework for Coordinating Complex Task Dependencies in Minecraft | Yubo Dong et.al. | 2406.05720 | link |
2024-06-09 | A Superalignment Framework in Autonomous Driving with Large Language Models | Xiangrui Kong et.al. | 2406.05651 | null |
2024-06-09 | Cross Language Soccer Framework: An Open Source Framework for the RoboCup 2D Soccer Simulation | Nader Zare et.al. | 2406.05621 | link |
2024-06-07 | LLM-Vectorizer: LLM-based Verified Loop Vectorizer | Jubi Taneja et.al. | 2406.04693 | null |
2024-06-07 | Mean-field limit of non-exchangeable multi-agent systems over hypergraphs with unbounded rank | Nathalie Ayi et.al. | 2406.04691 | null |
2024-06-07 | meSch: Multi-Agent Energy-Aware Scheduling for Task Persistence | Kaleb Ben Naveed et.al. | 2406.04560 | null |
2024-06-06 | Online Joint Fine-tuning of Multi-Agent Flows | Paul Mineiro et.al. | 2406.04516 | link |
2024-06-06 | Optimizing Autonomous Driving for Safety: A Human-Centric Approach with LLM-Enhanced RLHF | Yuan Sun et.al. | 2406.04481 | null |
2024-06-06 | Multi-Agent Imitation Learning: Value is Easy, Regret is Hard | Jingwu Tang et.al. | 2406.04219 | null |
2024-06-06 | MARLander: A Local Path Planning for Drone Swarms using Multiagent Deep Reinforcement Learning | Demetros Aschu et.al. | 2406.04159 | null |
2024-06-06 | Online Learning in Betting Markets: Profit versus Prediction | Haiqing Zhu et.al. | 2406.04062 | null |
2024-06-06 | Mini Honor of Kings: A Lightweight Environment for Multi-Agent Reinforcement Learning | Lin Liu et.al. | 2406.03978 | link |
2024-06-05 | AD-H: Autonomous Driving with Hierarchical Agents | Zaibin Zhang et.al. | 2406.03474 | null |
2024-06-05 | CommonPower: Supercharging Machine Learning for Smart Grids | Michael Eichelbeck et.al. | 2406.03231 | link |
2024-06-05 | Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework | Xiaoxi Sun et.al. | 2406.03075 | null |
2024-06-05 | Representation Learning For Efficient Deep Multi-Agent Reinforcement Learning | Dom Huh et.al. | 2406.02890 | null |
2024-06-04 | Chain of Agents: Large Language Models Collaborating on Long-Context Tasks | Yusen Zhang et.al. | 2406.02818 | null |
2024-06-04 | FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning | Wenzhe Li et.al. | 2406.02081 | null |
2024-06-04 | Large Language Model-Enabled Multi-Agent Manufacturing Systems | Jonghan Lim et.al. | 2406.01893 | null |
2024-06-03 | Multi-Agent Reinforcement Learning Meets Leaf Sequencing in Radiotherapy | Riqiang Gao et.al. | 2406.01853 | null |
2024-06-03 | ZAPP! Zonotope Agreement of Prediction and Planning for Continuous-Time Collision Avoidance with Discrete-Time Dynamics | Luca Paparusso et.al. | 2406.01814 | null |
2024-06-03 | Leader-Follower Density Control of Spatial Dynamics in Large-Scale Multi-Agent Systems | Gian Carlo Maffettone et.al. | 2406.01804 | link |
2024-06-03 | Multi-agent assignment via state augmented reinforcement learning | Leopoldo Agorio et.al. | 2406.01782 | null |
2024-06-03 | AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation | Junhao Cheng et.al. | 2406.01388 | link |
2024-06-03 | Multi-Agent Transfer Learning via Temporal Contrastive Learning | Weihao Zeng et.al. | 2406.01377 | null |
2024-06-03 | BELLS: A Framework Towards Future Proof Benchmarks for the Evaluation of LLM Safeguards | Diego Dorn et.al. | 2406.01364 | null |
2024-06-03 | CodeR: Issue Resolving with Multi-Agent and Task Graphs | Dong Chen et.al. | 2406.01304 | link |
2024-06-03 | Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles | Jiesong Lian et.al. | 2405.21027 | null |
2024-05-31 | Scalable Distance-based Multi-Agent Relative State Estimation via Block Multiconvex Optimization | Tianyue Wu et.al. | 2405.20883 | null |
2024-05-31 | CSDO: Enhancing Efficiency and Success in Large-Scale Multi-Vehicle Trajectory Planning | Yibin Yang et.al. | 2405.20858 | link |
2024-05-31 | InsightSee: Advancing Multi-agent Vision-Language Models for Enhanced Visual Understanding | Huaxiang Zhang et.al. | 2405.20795 | null |
2024-05-31 | No-Regret Learning for Fair Multi-Agent Social Welfare Optimization | Mengxiao Zhang et.al. | 2405.20678 | null |
2024-05-30 | Quality of Non-Convergent Best Response Processes in Multi-Agent Systems through Sink Equilibrium | Rohit Konda et.al. | 2405.20426 | null |
2024-05-30 | Towards Hierarchical Multi-Agent Workflows for Zero-Shot Prompt Optimization | Yuchi Liu et.al. | 2405.20252 | link |
2024-05-30 | Safe Multi-agent Reinforcement Learning with Natural Language Constraints | Ziyan Wang et.al. | 2405.20018 | null |
2024-05-30 | LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning | Hyungho Na et.al. | 2405.19998 | link |
2024-05-30 | A Deep Reinforcement Learning Approach for Trading Optimization in the Forex Market with Multi-Agent Asynchronous Distribution | Davoud Sarani et.al. | 2405.19982 | null |
2024-05-30 | From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems | Jianliang He et.al. | 2405.19883 | null |
2024-05-30 | Approximate Global Convergence of Independent Learning in Multi-Agent Systems | Ruiyang Jin et.al. | 2405.19811 | null |
2024-05-29 | Distributed Online Planning for Min-Max Problems in Networked Markov Games | Alexandros E. Tzikas et.al. | 2405.19570 | link |
2024-05-29 | Decentralized Optimization in Time-Varying Networks with Arbitrary Delays | Tomas Ortega et.al. | 2405.19513 | link |
2024-05-29 | Adaptive In-conversation Team Building for Language Model Agents | Linxin Song et.al. | 2405.19425 | link |
2024-05-29 | Normative Modules: A Generative Agent Architecture for Learning Norms that Supports Multi-Agent Cooperation | Atrisha Sarkar et.al. | 2405.19328 | null |
2024-05-29 | Conditional Latent ODEs for Motion Prediction in Autonomous Driving | Khang Truong Giang et.al. | 2405.19183 | link |
2024-05-29 | Cephalo: Multi-Modal Vision-Language Models for Bio-Inspired Materials Analysis and Design | Markus J. Buehler et.al. | 2405.19076 | link |
2024-05-29 | Resilient Average Consensus with Adversaries via Distributed Detection and Recovery | Liwei Yuan et.al. | 2405.18752 | null |
2024-05-29 | Efficient Learning in Chinese Checkers: Comparing Parameter Sharing in Multi-Agent Reinforcement Learning | Noah Adhikari et.al. | 2405.18733 | link |
2024-05-29 | Identifying the Most Influential Driver Nodes for Pinning Control of Multi-Agent Systems with Time-Varying Topology | Guangrui Zhang et.al. | 2405.18712 | null |
2024-05-29 | Bridging the Gap between Partially Observable Stochastic Games and Sparse POMDP Methods | Tyler Becker et.al. | 2405.18703 | null |
2024-05-28 | Synchronization on circles and spheres with nonlinear interactions | Christopher Criscitiello et.al. | 2405.18273 | null |
2024-05-28 | Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous Driving | Zhi Zheng et.al. | 2405.18209 | link |
2024-05-28 | Mutation-Bias Learning in Games | Johann Bauer et.al. | 2405.18190 | null |
2024-05-28 | PyTAG: Tabletop Games for Multi-Agent Reinforcement Learning | Martin Balla et.al. | 2405.18123 | link |
2024-05-28 | ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator | Junda Zhu et.al. | 2405.18111 | link |
2024-05-28 | Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning | Xinran Li et.al. | 2405.18110 | link |
2024-05-28 | LLM experiments with simulation: Large Language Model Multi-Agent System for Process Simulation Parametrization in Digital Twins | Yuchen Xia et.al. | 2405.18092 | link |
2024-05-28 | Cognitive Insights and Stable Coalition Matching for Fostering Multi-Agent Cooperation | Jiaqi Shao et.al. | 2405.18044 | null |
2024-05-28 | LNS2+RL: Combining Multi-agent Reinforcement Learning with Large Neighborhood Search in Multi-agent Path Finding | Yutong Wang et.al. | 2405.17794 | link |
2024-05-28 | ORLM: Training Large Language Models for Optimization Modeling | Zhengyang Tang et.al. | 2405.17743 | link |
2024-05-27 | BehaviorGPT: Smart Agent Simulation for Autonomous Driving with Next-Patch Prediction | Zikang Zhou et.al. | 2405.17372 | null |
2024-05-27 | Suppressing defection by increasing temptation: the impact of smart cooperators on a social dilemma situation | Hsuan-Wei Lee et.al. | 2405.17268 | null |
2024-05-27 | Flow control of three-dimensional cylinders transitioning to turbulence via multi-agent reinforcement learning | P. Suárez et.al. | 2405.17210 | null |
2024-05-27 | Distributed Riemannian Stochastic Gradient Tracking Algorithm on the Stiefel Manifold | Jishu Zhao et.al. | 2405.16900 | null |
2024-05-27 | A Large Language Model-based multi-agent manufacturing system for intelligent shopfloor | Zhen Zhao et.al. | 2405.16887 | null |
2024-05-27 | Knowing What Not to Do: Leverage Language Model Insights for Action Space Pruning in Multi-agent Reinforcement Learning | Zhihao Liu et.al. | 2405.16854 | link |
2024-05-27 | Advancing Behavior Generation in Mobile Robotics through High-Fidelity Procedural Simulations | Victor A. Kich et.al. | 2405.16818 | link |
2024-05-27 | LLM-Based Cooperative Agents using Information Relevance and Plan Validation | SeungWon Seo et.al. | 2405.16751 | null |
2024-05-26 | Mimicry and the Emergence of Cooperative Communication | Dylan Cope et.al. | 2405.16622 | null |
2024-05-28 | Meta-Task Planning for Language Agents | Cong Zhang et.al. | 2405.16510 | link |
2024-05-24 | SMART: Scalable Multi-agent Real-time Simulation via Next-token Prediction | Wei Wu et.al. | 2405.15677 | link |
2024-05-24 | Human-in-the-loop Reinforcement Learning for Data Quality Monitoring in Particle Physics Experiments | Olivia Jullian Parra et.al. | 2405.15508 | null |
2024-05-24 | Distributed Adaptive Control of Disturbed Interconnected Systems with High-Order Tuners | Moh. Kamalul Wafi et.al. | 2405.15178 | null |
2024-05-24 | CulturePark: Boosting Cross-cultural Understanding in Large Language Models | Cheng Li et.al. | 2405.15145 | link |
2024-05-23 | Controlling Behavioral Diversity in Multi-Agent Reinforcement Learning | Matteo Bettini et.al. | 2405.15054 | link |
2024-05-23 | CityGPT: Towards Urban IoT Learning, Analysis and Interaction with Multi-Agent System | Qinghua Guan et.al. | 2405.14691 | null |
2024-05-23 | AI-Olympics: Exploring the Generalization of Agents through Open Competitions | Chen Wang et.al. | 2405.14358 | null |
2024-05-26 | Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration | Yang Zhang et.al. | 2405.14314 | null |
2024-05-23 | A finite time analysis of distributed Q-learning | Han-Dong Lim et.al. | 2405.14078 | null |
2024-05-22 | Distributed and Decentralized Control and Task Allocation for Flexible Swarms | Yigal Koifman et.al. | 2405.13941 | null |
2024-05-22 | GameVLM: A Decision-making Framework for Robotic Task Planning Based on Visual Language Models and Zero-sum Games | Aoran Mei et.al. | 2405.13751 | null |
2024-05-22 | Towards a Distributed Platform for Normative Reasoning and Value Alignment in Multi-Agent Systems | Miguel Garcia-Bohigues et.al. | 2405.13543 | null |
2024-05-22 | Non-Deterministic Planning for Hyperproperty Verification | Raven Beutner et.al. | 2405.13488 | null |
2024-05-21 | Multi-Agent Reinforcement Learning with Hierarchical Coordination for Emergency Responder Stationing | Amutheezan Sivagnanam et.al. | 2405.13205 | null |
2024-05-21 | Reinforcement Learning Enabled Peer-to-Peer Energy Trading for Dairy Farms | Mian Ibad Ali Shah et.al. | 2405.12716 | null |
2024-05-21 | Fight Fire with Fire: How Much Can We Trust ChatGPT on Source Code-Related Tasks? | Xiao Yu et.al. | 2405.12641 | null |
2024-05-21 | Tiny Refinements Elicit Resilience: Toward Efficient Prefix-Model Against LLM Red-Teaming | Jiaxu Liu et.al. | 2405.12604 | null |
2024-05-21 | Optimizing Generative AI Networking: A Dual Perspective with Multi-Agent Systems and Mixture of Experts | Ruichen Zhang et.al. | 2405.12472 | null |
2024-05-20 | Continual Deep Reinforcement Learning for Decentralized Satellite Routing | Federico Lozano-Cuadra et.al. | 2405.12308 | null |
2024-05-20 | Multi-Agent Optimization and Learning: A Non-Expansive Operators Perspective | Nicola Bastianello et.al. | 2405.11999 | null |
2024-05-20 | PET: Multi-agent Independent PPO-based Automatic ECN Tuning for High-Speed Data Center Networks | Kai Cheng et.al. | 2405.11956 | null |
2024-05-20 | (Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts | Minghao Wu et.al. | 2405.11804 | link |
2024-05-20 | Efficient Multi-agent Reinforcement Learning by Planning | Qihan Liu et.al. | 2405.11778 | link |
2024-05-20 | Configurable Mirror Descent: Towards a Unification of Decision Making | Pengdeng Li et.al. | 2405.11746 | link |
2024-05-19 | The Logical Art of Keeping a True Secret | Alessandro Aldini et.al. | 2405.11654 | null |
2024-05-18 | MapCoder: Multi-Agent Code Generation for Competitive Problem Solving | Md. Ashraful Islam et.al. | 2405.11403 | link |
2024-05-18 | Cooperative Multi-agent Approach for Automated Computer Game Testing | Samira Shirzadeh-hajimahmood et.al. | 2405.11347 | null |
2024-05-17 | LLM-based Multi-Agent Reinforcement Learning: Current and Future Directions | Chuanneng Sun et.al. | 2405.11106 | null |
2024-05-17 | Pragmatic Communication for Remote Control of Finite-State Markov Processes | Pietro Talli et.al. | 2405.10672 | null |
2024-05-17 | Guidelines for evaluation of complex multi agent test scenarios | Ana Isabel Garcia Guerra et.al. | 2405.10526 | null |
2024-05-17 | Rethinking ChatGPT’s Success: Usability and Cognitive Behaviors Enabled by Auto-regressive LLMs’ Prompting | Xinzhe Li et.al. | 2405.10474 | null |
2024-05-16 | DEBATE: Devil’s Advocate-Based Assessment and Text Evaluation | Alex Kim et.al. | 2405.09935 | link |
2024-05-14 | A Distributed Approach to Autonomous Intersection Management via Multi-Agent Reinforcement Learning | Matteo Cederle et.al. | 2405.08655 | link |
2024-05-14 | Learning Multi-Agent Communication from Graph Modeling Perspective | Shengchao Hu et.al. | 2405.08550 | link |
2024-05-14 | Safety Constrained Multi-Agent Reinforcement Learning for Active Voltage Control | Yang Qu et.al. | 2405.08443 | null |
2024-05-14 | Multi-Agent Combinatorial Contracts | Paul Duetting et.al. | 2405.08260 | null |
2024-05-15 | POWQMIX: Weighted Value Factorization with Potentially Optimal Joint Actions Recognition for Cooperative Multi-Agent Reinforcement Learning | Chang Huang et.al. | 2405.08036 | null |
2024-05-17 | MADRL-Based Rate Adaptation for 360° Video Streaming with Multi-Viewpoint Prediction | Haopeng Wang et.al. | 2405.07759 | null |
2024-05-13 | Non-Rigid Designators in Modal and Temporal Free Description Logics (Extended Version) | Alessandro Artale et.al. | 2405.07656 | null |
2024-05-14 | Towards Adaptive IMFs – Generalization of utility functions in Multi-Agent Frameworks | Kaushik Dey et.al. | 2405.07621 | null |
2024-05-12 | AdaptNet: Rethinking Sensing and Communication for a Seamless Internet of Drones Experience | Ananya Hazarika et.al. | 2405.07318 | null |
2024-05-12 | MAxPrototyper: A Multi-Agent Generation System for Interactive User Interface Prototyping | Mingyue Yuan et.al. | 2405.07131 | null |
2024-05-11 | Optimal Multilayered Motion Planning for Multiple Differential Drive Mobile Robots with Hierarchical Prioritization (OM-MP) | Zong Chen et.al. | 2405.07043 | null |
2024-05-11 | Multi-agent Traffic Prediction via Denoised Endpoint Distribution | Yao Liu et.al. | 2405.07041 | null |
2024-05-11 | Fairness in Reinforcement Learning: A Survey | Anka Reuel et.al. | 2405.06909 | null |
2024-05-11 | Event GDR: Event-Centric Generative Document Retrieval | Yong Guan et.al. | 2405.06886 | null |
2024-05-10 | Sensing-Assisted Adaptive Channel Contention for Mobile Delay-Sensitive Communications | Bojie Lv et.al. | 2405.06186 | null |
2024-05-10 | (A Partial Survey of) Decentralized, Cooperative Multi-Agent Reinforcement Learning | Christopher Amato et.al. | 2405.06161 | null |
2024-05-09 | Smurfs: Leveraging Multiple Proficiency Agents with Context-Efficiency for Tool Planning | Junzhi Chen et.al. | 2405.05955 | link |
2024-05-09 | Federated Combinatorial Multi-Agent Multi-Armed Bandits | Fares Fourati et.al. | 2405.05950 | null |
2024-05-09 | Approximate Dec-POMDP Solving Using Multi-Agent A* | Wietze Koops et.al. | 2405.05662 | null |
2024-05-09 | Dynamic Deep Factor Graph for Multi-Agent Reinforcement Learning | Yuchen Shi et.al. | 2405.05542 | link |
2024-05-08 | Learning to Play Pursuit-Evasion with Dynamic and Sensor Constraints | Burak M. Gonultas et.al. | 2405.05372 | null |
2024-05-07 | Mitigating Negative Side Effects in Multi-Agent Systems Using Blame Assignment | Pulkit Rustagi et.al. | 2405.04702 | null |
2024-05-07 | Visually Guided Swarm Motion Coordination via Insect-inspired Small Target Motion Reactions | Md Arif Billah et.al. | 2405.04591 | null |
2024-05-07 | Parallelized Multi-Agent Bayesian Optimization in Lava | Shay Snyder et.al. | 2405.04387 | link |
2024-05-07 | Enhancing the Efficiency and Accuracy of Underlying Asset Reviews in Structured Finance: The Application of Multi-agent Framework | Xiangpeng Wan et.al. | 2405.04294 | link |
2024-05-07 | Certified Policy Verification and Synthesis for MDPs under Distributional Reach-avoidance Properties | S. Akshay et.al. | 2405.04015 | null |
2024-05-07 | Latency and Energy Minimization in NOMA-Assisted MEC Network: A Federated Deep Reinforcement Learning Approach | Arian Ahmadi et.al. | 2405.04012 | null |
2024-05-06 | Conformity, Confabulation, and Impersonation: Persona Inconstancy in Multi-Agent LLM Collaboration | Razan Baltaji et.al. | 2405.03862 | link |
2024-05-06 | Select to Perfect: Imitating desired behavior from large multi-agent data | Tim Franzmeyer et.al. | 2405.03735 | null |
2024-05-06 | MARE: Multi-Agents Collaboration Framework for Requirements Engineering | Dongming Jin et.al. | 2405.03256 | null |
2024-05-06 | A Multi-Agent Rollout Approach for Highway Bottleneck Decongenston in Mixed Autonomy | Lu Liu et.al. | 2405.03132 | null |
2024-05-06 | Compression-based Privacy Preservation for Distributed Nash Equilibrium Seeking in Aggregative Games | Wei Huo et.al. | 2405.03106 | null |
2024-05-06 | FairMonitor: A Dual-framework for Detecting Stereotypes and Biases in Large Language Models | Yanhong Bai et.al. | 2405.03098 | null |
2024-05-05 | Traffic Performance GPT (TP-GPT): Real-Time Data Informed Intelligent ChatBot for Transportation Surveillance and Management | Bingzhang Wang et.al. | 2405.03076 | null |
2024-05-05 | A Long-Short-Term Mixed-Integer Formulation for Highway Lane Change Planning | Rudolf Reiter et.al. | 2405.02979 | null |
2024-05-05 | Multi-Agent RL-Based Industrial AIGC Service Offloading over Wireless Edge Networks | Siyuan Li et.al. | 2405.02972 | null |
2024-05-05 | Language Evolution for Evading Social Media Regulation via LLM-based Multi-agent Simulation | Jinyu Cai et.al. | 2405.02858 | link |
2024-05-05 | Modelling Opaque Bilateral Market Dynamics in Financial Trading: Insights from a Multi-Agent Simulation Study | Alicia Vidler et.al. | 2405.02849 | null |
2024-05-05 | Probabilistic tube-based control synthesis of stochastic multi-agent systems under signal temporal logic | Eleftherios E. Vlahakis et.al. | 2405.02827 | null |
2024-05-03 | The Cambridge RoboMaster: An Agile Multi-Robot Research Platform | Jan Blumenkamp et.al. | 2405.02198 | null |
2024-05-03 | Simulating the economic impact of rationality through reinforcement learning and agent-based modelling | Simone Brusatin et.al. | 2405.02161 | link |
2024-05-03 | Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach | Anton Plaksin et.al. | 2405.02044 | null |
2024-05-03 | Multi-Agent Coverage Control on Surfaces Using Conformal Mapping | Chao Zhai et.al. | 2405.02034 | null |
2024-05-03 | A Model-based Multi-Agent Personalized Short-Video Recommender System | Peilun Zhou et.al. | 2405.01847 | null |
2024-05-03 | SocialGFs: Learning Social Gradient Fields for Multi-Agent Reinforcement Learning | Qian Long et.al. | 2405.01839 | null |
2024-05-02 | Unconstraining Multi-Robot Manipulation: Enabling Arbitrary Constraints in ECBS with Bounded Sub-Optimality | Yorai Shaoul et.al. | 2405.01772 | null |
2024-05-02 | A Survey on Semantic Communication Networks: Architecture, Security, and Privacy | Shaolong Guo et.al. | 2405.01221 | null |
2024-05-02 | LOQA: Learning with Opponent Q-Learning Awareness | Milad Aghajohari et.al. | 2405.01035 | null |
2024-05-02 | Rare Collision Risk Estimation of Autonomous Vehicles with Multi-Agent Situation Awareness | Mahdieh Zaker et.al. | 2405.01011 | null |
2024-05-01 | MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure | Zhicheng Zhang et.al. | 2405.00902 | null |
2024-05-01 | Communication-Efficient Training Workload Balancing for Decentralized Multi-Agent Learning | Seyed Mahmoud Sajjadi Mohammadabadi et.al. | 2405.00839 | link |
2024-05-01 | ADM: Accelerated Diffusion Model via Estimated Priors for Robust Motion Prediction under Uncertainties | Jiahui Li et.al. | 2405.00797 | link |
2024-05-01 | A Distributed Model Identification Algorithm for Multi-Agent Systems | Vivek Khatana et.al. | 2405.00637 | null |
2024-05-01 | MF-OML: Online Mean-Field Reinforcement Learning with Occupation Measures for Large Population Games | Anran Hu et.al. | 2405.00282 | null |
2024-04-30 | MGCBS: An Optimal and Efficient Algorithm for Solving Multi-Goal Multi-Agent Path Finding Problem | Mingkai Tang et.al. | 2404.19518 | link |
2024-04-30 | Quasi-determinant and right eigenvalues of dual quaternion matrices | Chen Ling et.al. | 2404.19348 | null |
2024-04-30 | Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning | Qiaosheng Zhang et.al. | 2404.19292 | null |
2024-04-30 | MAP-Former: Multi-Agent-Pair Gaussian Joint Prediction | Marlon Steiner et.al. | 2404.19283 | null |
2024-04-29 | Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental Uncertainty | Laixi Shi et.al. | 2404.18909 | null |
2024-04-29 | Multi-Agent Synchronization Tasks | Rolando Fernandez et.al. | 2404.18798 | null |
2024-04-29 | A geometric approach for stability analysis of delay systems: Applications to network dynamics | Shijie Zhou et.al. | 2404.18704 | null |
2024-04-29 | Anywhere: A Multi-Agent Framework for Reliable and Diverse Foreground-Conditioned Image Inpainting | Tianyidan Xie et.al. | 2404.18598 | null |
2024-04-28 | Using Deep Q-Learning to Dynamically Toggle between Push/Pull Actions in Computational Trust Mechanisms | Zoi Lygizou et.al. | 2404.18296 | null |
2024-04-28 | ATR-Mapping: Asymmetric Topological Representation based Mapping Framework for Multi-Robot Environment Exploration | Hao Zhang et.al. | 2404.18089 | null |
2024-04-30 | ComposerX: Multi-Agent Symbolic Music Composition with LLMs | Qixin Deng et.al. | 2404.18081 | link |
2024-04-27 | Advancing Healthcare Automation: Multi-Agent Systems for Medical Necessity Justification | Himanshu Pandey et.al. | 2404.17977 | null |
2024-04-27 | Verco: Learning Coordinated Verbal Communication for Multi-agent Reinforcement Learning | Dapeng Li et.al. | 2404.17780 | null |
2024-04-27 | UMass-BioNLP at MEDIQA-M3G 2024: DermPrompt – A Systematic Exploration of Prompt Engineering with GPT-4V for Dermatological Diagnosis | Parth Vashisht et.al. | 2404.17749 | link |
2024-04-26 | Quantum Multi-Agent Reinforcement Learning for Aerial Ad-hoc Networks | Theodora-Augustina Drăgan et.al. | 2404.17499 | null |
2024-04-26 | A multi-agent model of hierarchical decision dynamics | Paul Kinsler et.al. | 2404.17477 | null |
2024-04-26 | A Unified Debugging Approach via LLM-Based Multi-Agent Synergy | Cheryl Lee et.al. | 2404.17153 | link |
2024-04-25 | Evaluating Collaborative Autonomy in Opposed Environments using Maritime Capture-the-Flag Competitions | Jordan Beason et.al. | 2404.17038 | null |
2024-04-25 | AutoGenesisAgent: Self-Generating Multi-Agent Systems for Complex Tasks | Jeremy Harper et.al. | 2404.17017 | null |
2024-04-25 | Neural Interaction Energy for Multi-Agent Trajectory Prediction | Kaixin Shen et.al. | 2404.16579 | null |
2024-04-25 | Distributed Matrix Pencil Formulations for Prescribed-Time Leader-Following Consensus of MASs with Unknown Sensor Sensitivity | Hefu Ye et.al. | 2404.16412 | null |
2024-04-25 | Optimal and Bounded Suboptimal Any-Angle Multi-agent Pathfinding | Konstantin Yakovlev et.al. | 2404.16379 | null |
2024-04-24 | Scaling Lifelong Multi-Agent Path Finding to More Realistic Settings: Research Challenges and Opportunities | He Jiang et.al. | 2404.16162 | link |
2024-04-24 | Delay-Aware Multi-Agent Reinforcement Learning for Cooperative Adaptive Cruise Control with Model-based Stability Enhancement | Jiaqi Liu et.al. | 2404.15696 | null |
2024-04-24 | Decentralized Multi-Agent Trajectory Planning in Dynamic Environments with Spatiotemporal Occupancy Grid Maps | Siyuan Wu et.al. | 2404.15602 | null |
2024-04-24 | GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL | Lang Qin et.al. | 2404.15597 | null |
2024-04-24 | Multi-Agent Reinforcement Learning for Energy Networks: Computational Challenges, Progress and Open Problems | Sarah Keren et.al. | 2404.15583 | null |
2024-04-23 | BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis | Shuhang Lin et.al. | 2404.15532 | link |
2024-04-23 | Adaptive Mechanism Design using Multi-Agent Revealed Preferences | Luke Snow et.al. | 2404.15391 | null |
2024-04-23 | From Space-Time to Space-Order: Directly Planning a Temporal Planning Graph by Redefining CBS | Yu Wu et.al. | 2404.15137 | null |
2024-04-23 | CT-Agent: Clinical Trial Multi-Agent with Large Language Model-based Reasoning | Ling Yue et.al. | 2404.14777 | null |
2024-04-23 | Bi-CL: A Reinforcement Learning Framework for Robots Coordination Through Bi-level Optimization | Zechen Hu et.al. | 2404.14649 | null |
2024-04-22 | Multi-Agent Hybrid SAC for Joint SS-DSA in CRNs | David R. Nickel et.al. | 2404.14319 | null |
2024-04-22 | Multi-agent Reinforcement Learning-based Joint Precoding and Phase Shift Optimization for RIS-aided Cell-Free Massive MIMO Systems | Yiyang Zhu et.al. | 2404.14092 | null |
2024-04-22 | Liquid-Graph Time-Constant Network for Multi-Agent Systems Control | Antonio Marino et.al. | 2404.13982 | null |
2024-04-22 | A survey of air combat behavior modeling using machine learning | Patrick Ribu Gorton et.al. | 2404.13954 | null |
2024-04-22 | Distributional Black-Box Model Inversion Attack with Multi-Agent Reinforcement Learning | Huan Bao et.al. | 2404.13860 | null |
2024-04-23 | Multi-AUV Cooperative Underwater Multi-Target Tracking Based on Dynamic-Switching-enabled Multi-Agent Reinforcement Learning | Shengbo Wang et.al. | 2404.13654 | null |
2024-04-20 | Large Language Models as Test Case Generators: Performance Evaluation and Enhancement | Kefan Li et.al. | 2404.13340 | null |
2024-04-19 | Resource Slicing with Cross-Cell Coordination in Satellite-Terrestrial Integrated Networks | Mingcheng He et.al. | 2404.13158 | null |
2024-04-19 | Private Agent-Based Modeling | Ayush Chopra et.al. | 2404.12983 | null |
2024-04-19 | MAexp: A Generic Platform for RL-based Multi-Agent Exploration | Shaohao Zhu et.al. | 2404.12824 | link |
2024-04-19 | LayeredMAPF: a decomposition of MAPF instance without compromising solvability | Zhuo Yao et.al. | 2404.12773 | link |
2024-04-19 | Grasper: A Generalist Pursuer for Pursuit-Evasion Problems | Pengdeng Li et.al. | 2404.12626 | link |
2024-04-19 | Stackelberg Game-Theoretic Learning for Collaborative Assembly Task Planning | Yuhan Zhao et.al. | 2404.12570 | link |
2024-04-18 | HalluciBot: Is There No Such Thing as a Bad Question? | William Watson et.al. | 2404.12535 | null |
2024-04-18 | Centralized vs. Decentralized Multi-Agent Reinforcement Learning for Enhanced Control of Electric Vehicle Charging Networks | Amin Shojaeighadikolaei et.al. | 2404.12520 | null |
2024-04-18 | Stability Certificates for Receding Horizon Games | Sophie Hall et.al. | 2404.12165 | null |
2024-04-18 | mABC: multi-Agent Blockchain-Inspired Collaboration for root cause analysis in micro-services architecture | Wei Zhang et.al. | 2404.12135 | link |
2024-04-18 | X-Light: Cross-City Traffic Signal Control Using Transformer on Transformer as Meta Multi-Agent Reinforcement Learner | Haoyuan Jiang et.al. | 2404.12090 | link |
2024-04-18 | Multi-Agent Relative Investment Games in a Jump Diffusion Market with Deep Reinforcement Learning Algorithm | Liwei Lu et.al. | 2404.11967 | null |
2024-04-18 | AgentCoord: Visually Exploring Coordination Strategy for LLM-based Multi-Agent Collaboration | Bo Pan et.al. | 2404.11943 | link |
2024-04-18 | JointPPO: Diving Deeper into the Effectiveness of PPO in Multi-Agent Reinforcement Learning | Chenxing Liu et.al. | 2404.11831 | null |
2024-04-17 | The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey | Tula Masterman et.al. | 2404.11584 | null |
2024-04-17 | Open-Ended Wargames with Large Language Models | Daniel P. Hogan et.al. | 2404.11446 | link |
2024-04-17 | Towards Multi-agent Reinforcement Learning based Traffic Signal Control through Spatio-temporal Hypergraphs | Kang Wang et.al. | 2404.11014 | link |
2024-04-17 | Function Approximation for Reinforcement Learning Controller for Energy from Spread Waves | Soumyendu Sarkar et.al. | 2404.10991 | null |
2024-04-17 | Group-Aware Coordination Graph for Multi-Agent Reinforcement Learning | Wei Duan et.al. | 2404.10976 | link |
2024-04-16 | Sustainability of Data Center Digital Twins with Reinforcement Learning | Soumyendu Sarkar et.al. | 2404.10786 | link |
2024-04-16 | COMBO: Compositional World Models for Embodied Multi-Agent Cooperation | Hongxin Zhang et.al. | 2404.10775 | null |
2024-04-16 | N-Agent Ad Hoc Teamwork | Caroline Wang et.al. | 2404.10740 | link |
2024-04-16 | Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning | Hao-Lun Hsu et.al. | 2404.10728 | null |
2024-04-16 | On the external concurrency of current BDI frameworks for MAS | Martina Baiardi et.al. | 2404.10397 | null |
2024-04-16 | Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language Models | Siqiao Xue et.al. | 2404.10209 | link |
2024-04-15 | EgoPet: Egomotion and Interaction Data from an Animal’s Perspective | Amir Bar et.al. | 2404.09991 | null |
2024-04-15 | Memory Sharing for Large Language Model based Agents | Hang Gao et.al. | 2404.09982 | link |
2024-04-15 | Quality of Experience Oriented Cross-layer Optimization for Real-time XR Video Transmission | Guangjin Pan et.al. | 2404.09905 | null |
2024-04-15 | Effective Reinforcement Learning Based on Structural Information Principles | Xianghua Zeng et.al. | 2404.09760 | link |
2024-04-15 | Higher Replay Ratio Empowers Sample-Efficient Multi-Agent Reinforcement Learning | Linjie Xu et.al. | 2404.09715 | null |
2024-04-15 | Kernel-based learning with guarantees for multi-agent applications | Krzysztof Kowalczyk et.al. | 2404.09708 | null |
2024-04-14 | Correlated Mean Field Imitation Learning | Zhiyu Zhao et.al. | 2404.09324 | null |
2024-04-16 | Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation | Ruixin Yang et.al. | 2404.09127 | link |
2024-04-13 | Developing An Attention-Based Ensemble Learning Framework for Financial Portfolio Optimisation | Zhenglong Li et.al. | 2404.08935 | null |
2024-04-12 | Leveraging Multi-AI Agents for Cross-Domain Knowledge Discovery | Shiva Aryal et.al. | 2404.08511 | null |
2024-04-12 | Multi-Agent eXperimenter (MAX) | Önder Gürcan et.al. | 2404.08398 | null |
2024-04-11 | Multi-Robot Target Tracking with Sensing and Communication Danger Zones | Jiazhen Li et.al. | 2404.07880 | link |
2024-04-11 | The Role of Confidence for Trust-based Resilient Consensus (Extended Version) | Luca Ballotta et.al. | 2404.07838 | null |
2024-04-11 | Achieving violation-free distributed optimization under coupling constraints | Changxin Liu et.al. | 2404.07609 | null |
2024-04-11 | A continuous-time violation-free multi-agent optimization algorithm and its applications to safe distributed control | Xiao Tan et.al. | 2404.07571 | null |
2024-04-11 | Differentially Private Reinforcement Learning with Self-Play | Dan Qiao et.al. | 2404.07559 | null |
2024-04-11 | Security Modelling for Cyber-Physical Systems: A Systematic Literature Review | Shaofei Huang et.al. | 2404.07527 | null |
2024-04-11 | UAV-enabled Collaborative Beamforming via Multi-Agent Deep Reinforcement Learning | Saichao Liu et.al. | 2404.07453 | null |
2024-04-10 | Multi-Agent Soft Actor-Critic with Global Loss for Autonomous Mobility-on-Demand Fleet Control | Zeno Woywood et.al. | 2404.06975 | link |
2024-04-09 | Large Language Models to the Rescue: Deadlock Resolution in Multi-Robot Systems | Kunal Garg et.al. | 2404.06413 | null |
2024-04-09 | The Power in Communication: Power Regularization of Communication for Autonomy in Cooperative Multi-Agent Reinforcement Learning | Nancirose Piazza et.al. | 2404.06387 | null |
2024-04-11 | The turnpike property for high-dimensional interacting agent systems in discrete time | Martin Gugat et.al. | 2404.06134 | null |
2024-04-09 | Multi-Agent Coverage Control with Transient Behavior Consideration | Runyu Zhang et.al. | 2404.05995 | null |
2024-04-08 | Attention-Driven Multi-Agent Reinforcement Learning: Enhancing Decisions with Expertise-Informed Tasks | Andre R Kuroswiski et.al. | 2404.05840 | null |
2024-04-08 | 360°REA: Towards A Reusable Experience Accumulation with 360° Assessment for Multi-Agent System | Shen Gao et.al. | 2404.05569 | link |
2024-04-08 | Towards Objectively Benchmarking Social Intelligence for Language Agents at Action Level | Chenxu Wang et.al. | 2404.05337 | link |
2024-04-08 | ITA-ECBS: A Bounded-Suboptimal Algorithm for Combined Target-Assignment and Path-Finding Problem | Yimin Tang et.al. | 2404.05223 | link |
2024-04-08 | Multi-agent Long-term 3D Human Pose Forecasting via Interaction-aware Trajectory Conditioning | Jaewoo Jeong et.al. | 2404.05218 | link |
2024-04-07 | Graph Neural Network Meets Multi-Agent Reinforcement Learning: Fundamentals, Applications, and Future Directions | Ziheng Liu et.al. | 2404.04898 | null |
2024-04-07 | LLM-Based Multi-Agent Systems for Software Engineering: Vision and the Road Ahead | Junda He et.al. | 2404.04834 | null |
2024-04-06 | Challenges Faced by Large Language Models in Solving Multi-Agent Flocking | Peihan Li et.al. | 2404.04752 | null |
2024-04-06 | MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems | Bin Lei et.al. | 2404.04735 | link |
2024-04-06 | The Case for Developing a Foundation Model for Planning-like Tasks from Scratch | Biplav Srivastava et.al. | 2404.04540 | null |
2024-04-05 | ROMA-iQSS: An Objective Alignment Approach via State-Based Value Learning and ROund-Robin Multi-Agent Scheduling | Chi-Hui Lin et.al. | 2404.03984 | null |
2024-04-05 | Heterogeneous Multi-Agent Reinforcement Learning for Zero-Shot Scalable Collaboration | Xudong Guo et.al. | 2404.03869 | null |
2024-04-04 | SLS-BRD: A system-level approach to seeking generalised feedback Nash equilibria | Otacilio B. L. Neto et.al. | 2404.03809 | null |
2024-04-04 | Legible and Proactive Robot Planning for Prosocial Human-Robot Interactions | Jasper Geldenbott et.al. | 2404.03734 | link |
2024-04-04 | Laser Learning Environment: A new environment for coordination-critical multi-agent tasks | Yannick Molinghen et.al. | 2404.03596 | link |
2024-04-04 | No Panacea in Planning: Algorithm Selection for Suboptimal Multi-Agent Path Finding | Weizhe Chen et.al. | 2404.03554 | null |
2024-04-04 | Design of Stickbug: a Six-Armed Precision Pollination Robot | Trevor Smith et.al. | 2404.03489 | link |
2024-04-04 | MEDIATE: Mutually Endorsed Distributed Incentive Acknowledgment Token Exchange | Philipp Altmann et.al. | 2404.03431 | null |
2024-04-04 | Decentralized Learning Strategies for Estimation Error Minimization with Graph Neural Networks | Xingran Chen et.al. | 2404.03227 | null |
2024-04-03 | MARL-LNS: Cooperative Multi-agent Reinforcement Learning via Large Neighborhoods Search | Weizhe Chen et.al. | 2404.03101 | null |
2024-04-03 | Learn to Disguise: Avoid Refusal Responses in LLM’s Defense via a Multi-agent Attacker-Disguiser Game | Qianqiao Xu et.al. | 2404.02532 | null |
2024-04-03 | Versatile Scene-Consistent Traffic Scenario Generation as Optimization with Diffusion | Zhiyu Huang et.al. | 2404.02524 | null |
2024-04-03 | Measuring Social Norms of Large Language Models | Ye Yuan et.al. | 2404.02491 | null |
2024-04-02 | Task-priority Intermediated Hierarchical Distributed Policies: Reinforcement Learning of Adaptive Multi-robot Cooperative Transport | Yusei Naito et.al. | 2404.02362 | null |
2024-04-02 | EnergAIze: Multi Agent Deep Deterministic Policy Gradient for Vehicle to Grid Energy Management | Tiago Fonseca et.al. | 2404.02361 | null |
2024-04-02 | Federated Multi-Agent Mapping for Planetary Exploration | Tiberiu-Ioan Szatmari et.al. | 2404.02289 | null |
2024-04-02 | Self-Organized Agents: A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization | Yoichi Ishibashi et.al. | 2404.02183 | link |
2024-04-02 | Risk-Aware Real-Time Task Allocation for Stochastic Multi-Agent Systems under STL Specifications | Maico H. W. Engelaar et.al. | 2404.02111 | null |
2024-04-02 | Emergence of Chemotactic Strategies with Multi-Agent Reinforcement Learning | Samuel Tovey et.al. | 2404.01999 | null |
2024-04-02 | Learning-Based Joint Beamforming and Antenna Movement Design for Movable Antenna Systems | Caihao Weng et.al. | 2404.01784 | null |
2024-04-02 | CMAT: A Multi-Agent Collaboration Tuning Framework for Enhancing Small Language Models | Xuechen Liang et.al. | 2404.01663 | link |
2024-04-02 | InsightLens: Discovering and Exploring Insights from Conversational Contexts in Large-Language-Model-Powered Data Analysis | Luoxuan Weng et.al. | 2404.01644 | null |
2024-04-02 | Collaborative Optimization of Wireless Communication and Computing Resource Allocation based on Multi-Agent Federated Weighting Deep Reinforcement Learning | Junjie Wu et.al. | 2404.01638 | null |
2024-04-02 | Helmsman of the Masses? Evaluate the Opinion Leadership of Large Language Models in the Werewolf Game | Silin Du et.al. | 2404.01602 | link |
2024-04-02 | Distributed Autonomous Swarm Formation for Dynamic Network Bridging | Raffaele Galliera et.al. | 2404.01557 | null |
2024-04-02 | Multi-Agent Reinforcement Learning with Control-Theoretic Safety Guarantees for Dynamic Network Bridging | Raffaele Galliera et.al. | 2404.01551 | null |
2024-04-01 | LLM as a Mastermind: A Survey of Strategic Reasoning with Large Language Models | Yadong Zhang et.al. | 2404.01230 | null |
2024-03-29 | Improving Learnt Local MAPF Policies with Heuristic Search | Rishi Veerapaneni et.al. | 2403.20300 | null |
2024-03-29 | Decentralized Multimedia Data Sharing in IoV: A Learning-based Equilibrium of Supply and Demand | Jiani Fan et.al. | 2403.20218 | null |
2024-03-29 | A Learning-based Incentive Mechanism for Mobile AIGC Service in Decentralized Internet of Vehicles | Jiani Fan et.al. | 2403.20151 | null |
2024-03-29 | CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning | Luke Rowe et.al. | 2403.19918 | null |
2024-03-28 | Enhancing Anomaly Detection in Financial Markets with an LLM-based Multi-Agent Framework | Taejin Park et.al. | 2403.19735 | null |
2024-03-28 | Human-compatible driving partners through data-regularized self-play reinforcement learning | Daphne Cornelisse et.al. | 2403.19648 | link |
2024-03-28 | Energy-Optimal Multi-Agent Navigation as a Strategic-Form Game | Logan Beaver et.al. | 2403.19641 | null |
2024-03-28 | Base-extension Semantics for S5 Modal Logic | Timo Eckhardt et.al. | 2403.19431 | null |
2024-03-28 | Multi-Agent Team Access Monitoring: Environments that Benefit from Target Information Sharing | Andrew Dudash et.al. | 2403.19375 | null |
2024-03-28 | MATEval: A Multi-Agent Discussion Framework for Advancing Open-Ended Text Evaluation | Yu Li et.al. | 2403.19305 | link |
2024-03-28 | MineLand: Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs | Xianhao Yu et.al. | 2403.19267 | link |
2024-03-28 | Inferring Latent Temporal Sparse Coordination Graph for Multi-Agent Reinforcement Learning | Wei Duan et.al. | 2403.19253 | link |
2024-03-27 | Distributed Maximum Consensus over Noisy Links | Ehsan Lari et.al. | 2403.18509 | null |
2024-03-27 | Distributed Feedback Optimization of Linear Multi-agent Systems | Amir Mehrnoosh et.al. | 2403.18386 | null |
2024-03-27 | Fault-tolerant properties of scale-free linear protocols for synchronization of homogeneous multi-agent systems | Anton A. Stoorvogel et.al. | 2403.18200 | null |
2024-03-26 | A Real-Time Rescheduling Algorithm for Multi-robot Plan Execution | Ying Feng et.al. | 2403.18145 | link |
2024-03-26 | Generalizing Better Response Paths and Weakly Acyclic Games | Bora Yongacoglu et.al. | 2403.18086 | null |
2024-03-26 | Paths to Equilibrium in Normal-Form Games | Bora Yongacoglu et.al. | 2403.18079 | null |
2024-03-26 | Self-Clustering Hierarchical Multi-Agent Reinforcement Learning with Extensible Cooperation Graph | Qingxu Fu et.al. | 2403.18056 | null |
2024-03-26 | MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution | Wei Tao et.al. | 2403.17927 | null |
2024-03-26 | Multi-Agent Clarity-Aware Dynamic Coverage with Gaussian Processes | Devansh R. Agrawal et.al. | 2403.17917 | null |
2024-03-26 | CMP: Cooperative Motion Prediction with Multi-Agent Communication | Zhuoyuan Wu et.al. | 2403.17916 | null |
2024-03-26 | Multi-Agent Resilient Consensus under Intermittent Faulty and Malicious Transmissions (Extended Version) | Sarper Aydın et.al. | 2403.17907 | null |
2024-03-26 | Multi Agent Pathfinding for Noise Restricted Hybrid Fuel Unmanned Aerial Vehicles | Drew Scott et.al. | 2403.17849 | null |
2024-03-26 | Scenario-Based Curriculum Generation for Multi-Agent Autonomous Driving | Axel Brunnbauer et.al. | 2403.17805 | link |
2024-03-26 | Prioritize Team Actions: Multi-Agent Temporal Logic Task Planning with Ordering Constraints | Bowen Ye et.al. | 2403.17704 | null |
2024-03-26 | PeersimGym: An Environment for Solving the Task Offloading Problem with Reinforcement Learning | Frederico Metelo et.al. | 2403.17637 | link |
2024-03-26 | LASIL: Learner-Aware Supervised Imitation Learning For Long-term Microscopic Traffic Simulation | Ke Guo et.al. | 2403.17601 | link |
2024-03-26 | Sharing the Cost of Success: A Game for Evaluating and Learning Collaborative Multi-Agent Instruction Giving and Following Policies | Philipp Sadler et.al. | 2403.17497 | link |
2024-03-25 | Harnessing the power of LLMs for normative reasoning in MASs | Bastin Tony Roy Savarimuthu et.al. | 2403.16524 | null |
2024-03-25 | Norm Violation Detection in Multi-Agent Systems using Large Language Models: A Pilot Study | Shawn He et.al. | 2403.16517 | null |
2024-03-25 | Towards Automatic Evaluation for LLMs’ Clinical Capabilities: Metric, Data, and Algorithm | Lei Liu et.al. | 2403.16446 | null |
2024-03-25 | AgentFL: Scaling LLM-based Fault Localization to Project-Level Context | Yihao Qin et.al. | 2403.16362 | null |
2024-03-24 | Social Deliberation vs. Social Contracts in Self-Governing Voluntary Organisations | Matthew Scott et.al. | 2403.16329 | null |
2024-03-24 | Q-adaptive: A Multi-Agent Reinforcement Learning Based Routing on Dragonfly Network | Yao Kang et.al. | 2403.16301 | null |
2024-03-24 | Ultra Low-Cost Two-Stage Multimodal System for Non-Normative Behavior Detection | Albert Lu et.al. | 2403.16151 | null |
2024-03-24 | Specifying Agent Ethics (Blue Sky Ideas) | Louise A. Dennis et.al. | 2403.16100 | null |
2024-03-24 | V2X-Real: a Largs-Scale Dataset for Vehicle-to-Everything Cooperative Perception | Hao Xiang et.al. | 2403.16034 | null |
2024-03-24 | MQE: Unleashing the Power of Interaction with Multi-agent Quadruped Environment | Ziyan Xiong et.al. | 2403.16015 | link |
2024-03-22 | Collaborative AI Teaming in Unknown Environments via Active Goal Deduction | Zuyuan Zhang et.al. | 2403.15341 | null |
2024-03-22 | Blockchain-based Pseudonym Management for Vehicle Twin Migrations in Vehicular Edge Metaverse | Jiawen Kang et.al. | 2403.15285 | null |
2024-03-22 | An Agent-Centric Perspective on Norm Enforcement and Sanctions | Elena Yan et.al. | 2403.15128 | link |
2024-03-22 | A Picture Is Worth a Graph: Blueprint Debate on Graph for Multimodal Reasoning | Changmeng Zheng et.al. | 2403.14972 | link |
2024-03-21 | Multi-Agent VQA: Exploring Multi-Agent Foundation Models in Zero-Shot Visual Question Answering | Bowen Jiang et.al. | 2403.14783 | link |
2024-03-21 | Multi-agent Task-Driven Exploration via Intelligent Map Compression and Sharing | Evangelos Psomiadis et.al. | 2403.14780 | null |
2024-03-21 | Co-Optimization of Environment and Policies for Decentralized Multi-Agent Navigation | Zhan Gao et.al. | 2403.14583 | null |
2024-03-22 | Learning Hierarchical Control For Multi-Agent Capacity-Constrained Systems | Charlott Vallon et.al. | 2403.14545 | null |
2024-03-21 | Emergent communication and learning pressures in language models: a language evolution perspective | Lukas Galke et.al. | 2403.14427 | null |
2024-03-21 | Exploiting Over-The-Air Consensus for Collision Avoidance and Formation Control in Multi-Agent Systems | Michael Epp et.al. | 2403.14386 | null |
2024-03-21 | A Control Barrier Function Composition Approach for Multi-Agent Systems in Marine Applications | Yujia Yang et.al. | 2403.14369 | null |
2024-03-21 | Adversary-Augmented Simulation to evaluate client-fairness on HyperLedger Fabric | Erwan Mahe et.al. | 2403.14342 | null |
2024-03-21 | ERD: A Framework for Improving LLM Reasoning for Cognitive Distortion Classification | Sehee Lim et.al. | 2403.14255 | null |
2024-03-21 | Carbon Footprint Reduction for Sustainable Data Centers in Real-Time | Soumyendu Sarkar et.al. | 2403.14092 | null |
2024-03-20 | Performance-Guaranteed Solutions for Multi-Agent Optimal Coverage Problems using Submodularity, Curvature, and Greedy Algorithms | Shirantha Welikala et.al. | 2403.14028 | null |
2024-03-20 | Motion Prediction of Multi-agent systems with Multi-view clustering | Anegi James et.al. | 2403.13905 | null |
2024-03-20 | Hyper Strategy Logic | Raven Beutner et.al. | 2403.13741 | null |
2024-03-20 | Large Language Models meet Network Slicing Management and Orchestration | Abdulhalim Dandoush et.al. | 2403.13721 | null |
2024-03-20 | Multi-agent Reinforcement Traffic Signal Control based on Interpretable Influence Mechanism and Biased ReLU Approximation | Zhiyue Luo et.al. | 2403.13639 | null |
2024-03-20 | Distributed Cooperative Formation Control of Nonlinear Multi-Agent System (UGV) Using Neural Network | Si Kheang Moeurn et.al. | 2403.13473 | null |
2024-03-20 | Agent Group Chat: An Interactive Group Chat Simulacra For Better Eliciting Collective Emergent Behavior | Zhouhong Gu et.al. | 2403.13433 | link |
2024-03-20 | Caching-Augmented Lifelong Multi-Agent Path Finding | Yimin Tang et.al. | 2403.13421 | link |
2024-03-20 | Robotics meets Fluid Dynamics: A Characterization of the Induced Airflow around a Quadrotor | Leonard Bauersfeld et.al. | 2403.13321 | null |
2024-03-20 | Mora: Enabling Generalist Video Generation via A Multi-Agent Framework | Zhengqing Yuan et.al. | 2403.13248 | link |
2024-03-19 | Graph Neural Network-based Multi-agent Reinforcement Learning for Resilient Distributed Coordination of Multi-Robot Systems | Anthony Goeckner et.al. | 2403.13093 | link |
2024-03-19 | NN-ETM: Enabling safe neural network-based event-triggering mechanisms for consensus problems | Irene Perez-Salesa et.al. | 2403.12567 | link |
2024-03-19 | Embodied LLM Agents Learn to Cooperate in Organized Teams | Xudong Guo et.al. | 2403.12482 | link |
2024-03-19 | Online Multi-Agent Pickup and Delivery with Task Deadlines | Hiroya Makino et.al. | 2403.12377 | null |
2024-03-19 | MARPF: Multi-Agent and Multi-Rack Path Finding | Hiroya Makino et.al. | 2403.12376 | null |
2024-03-18 | Routing and Scheduling in Answer Set Programming applied to Multi-Agent Path Finding: Preliminary Report | Roland Kaminski et.al. | 2403.12153 | null |
2024-03-18 | How Far Are We on the Decision-Making of LLMs? Evaluating LLMs’ Gaming Ability in Multi-Agent Environments | Jen-tse Huang et.al. | 2403.11807 | link |
2024-03-18 | Diffusion-Based Environment-Aware Trajectory Prediction | Theodor Westny et.al. | 2403.11643 | null |
2024-03-18 | Can LLM-Augmented autonomous agents cooperate?, An evaluation of their cooperative capabilities through Melting Pot | Manuel Mosquera et.al. | 2403.11381 | link |
2024-03-19 | V2X-DGW: Domain Generalization for Multi-agent Perception under Adverse Weather Conditions | Baolu Li et.al. | 2403.11371 | null |
2024-03-17 | Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective | Muhammad Aneeq uz Zaman et.al. | 2403.11345 | null |
2024-03-17 | Bridging the Gap between Discrete Agent Strategies in Game Theory and Continuous Motion Planning in Dynamic Environments | Hongrui Zheng et.al. | 2403.11334 | null |
2024-03-16 | A Scalable and Parallelizable Digital Twin Framework for Sustainable Sim2Real Transition of Multi-Agent Reinforcement Learning Systems | Chinmay Vilas Samak et.al. | 2403.10996 | null |
2024-03-16 | Diffusion-Reinforcement Learning Hierarchical Motion Planning in Adversarial Multi-agent Games | Zixuan Wu et.al. | 2403.10794 | link |
2024-03-16 | Fully Distributed Cooperative Multi-agent Underwater Obstacle Avoidance Under Dog Walking Paradigm | Kanzhong Yao et.al. | 2403.10759 | null |
2024-03-15 | Virtual Elastic Tether: a New Approach for Multi-agent Navigation in Confined Aquatic Environments | Kanzhong Yao et.al. | 2403.10629 | null |
2024-03-15 | Single- and Multi-Agent Private Active Sensing: A Deep Neuroevolution Approach | George Stamatelis et.al. | 2403.10112 | null |
2024-03-15 | What Makes Good Collaborative Views? Contrastive Mutual Information Maximization for Multi-Agent Perception | Wanfang Su et.al. | 2403.10068 | link |
2024-03-14 | Uncertainty Estimation in Multi-Agent Distributed Learning for AI-Enabled Edge Devices | Gleb Radchenko et.al. | 2403.09141 | null |
2024-03-13 | Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning | Peihong Yu et.al. | 2403.08936 | null |
2024-03-13 | Cultural evolution in populations of Large Language Models | Jérémy Perez et.al. | 2403.08882 | link |
2024-03-13 | Multi-Objective Optimization Using Adaptive Distributed Reinforcement Learning | Jing Tan et.al. | 2403.08879 | null |
2024-03-13 | Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation | Zhonghan Zhao et.al. | 2403.08282 | null |
2024-03-13 | Emergence of Social Norms in Large Language Model-based Agent Societies | Siyue Ren et.al. | 2403.08251 | link |
2024-03-13 | Object Permanence Filter for Robust Tracking with Interactive Robots | Shaoting Peng et.al. | 2403.08231 | null |
2024-03-13 | SpaceOctopus: An Octopus-inspired Motion Planning Framework for Multi-arm Space Robot | Wenbo Zhao et.al. | 2403.08219 | null |
2024-03-15 | Transforming Competition into Collaboration: The Revolutionary Role of Multi-Agent Systems and Language Models in Modern Organizations | Carlos Jose Xavier Cruz et.al. | 2403.07769 | link |
2024-03-12 | Asynchronous Approximate Byzantine Consensus: A Multi-hop Relay Method and Tight Graph Conditions | Liwei Yuan et.al. | 2403.07640 | null |
2024-03-12 | Ensembling Prioritized Hybrid Policies for Multi-agent Pathfinding | Huijie Tang et.al. | 2403.07559 | link |
2024-03-11 | RaceMOP: Mapless Online Path Planning for Multi-Agent Autonomous Racing using Residual Policy Learning | Raphael Trumpp et.al. | 2403.07129 | link |
2024-03-11 | Generalising Multi-Agent Cooperation through Task-Agnostic Communication | Dulhan Jayalath et.al. | 2403.06750 | link |
2024-03-11 | Decentralized and Lifelong-Adaptive Multi-Agent Collaborative Learning | Shuo Tang et.al. | 2403.06535 | null |
2024-03-12 | DeepSafeMPC: Deep Learning-Based Model Predictive Control for Safe Multi-Agent Reinforcement Learning | Xuefeng Wang et.al. | 2403.06397 | null |
2024-03-10 | ArgMed-Agents: Explainable Clinical Decision Reasoning with Large Language Models via Argumentation Schemes | Shengxin Hong et.al. | 2403.06294 | null |
2024-03-09 | MATRIX: Multi-Agent Trajectory Generation with Diverse Contexts | Zhuo Xu et.al. | 2403.06041 | null |
2024-03-09 | Deep Reinforcement Learning Enhanced Rate-Splitting Multiple Access for Interference Mitigation | Osman Nuri Irkicatal et.al. | 2403.05974 | null |
2024-03-09 | Scaling Team Coordination on Graphs with Reinforcement Learning | Manshi Limbu et.al. | 2403.05787 | null |
2024-03-08 | Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents | Jinyang Li et.al. | 2403.05307 | link |
2024-03-08 | Engineering consensus in static networks with unknown disruptors | Agathe Bouis et.al. | 2403.05272 | null |
2024-03-08 | ActFormer: Scalable Collaborative Perception via Active Queries | Suozhi Huang et.al. | 2403.04968 | null |
2024-03-07 | iTRPL: An Intelligent and Trusted RPL Protocol based on Multi-Agent Reinforcement Learning | Debasmita Dey et.al. | 2403.04416 | null |
2024-03-07 | Cooperative Task Execution in Multi-Agent Systems | Karishma et.al. | 2403.04370 | null |
2024-03-07 | LitSim: Conflict-aware Policy for Long-term Interactive Traffic Simulation | Haojie Xin et.al. | 2403.04299 | null |
2024-03-08 | Dynamics of Moral Behavior in Heterogeneous Populations of Learning Agents | Elizaveta Tennant et.al. | 2403.04202 | link |
2024-03-06 | Population-aware Online Mirror Descent for Mean-Field Games by Deep Reinforcement Learning | Zida Wu et.al. | 2403.03552 | null |
2024-03-08 | Discrete Consensus-Based Optimization | Junhyeok Byeon et.al. | 2403.03430 | null |
2024-03-05 | Collision Avoidance Verification of Multiagent Systems with Learned Policies | Zihao Dong et.al. | 2403.03314 | link |
2024-03-05 | Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination | Liangzhou Wang et.al. | 2403.03172 | null |
2024-03-05 | Equilibria in Two-Stage Facility Location with Atomic Clients | Simon Krogmann et.al. | 2403.03114 | null |
2024-03-05 | Distributed Policy Gradient for Linear Quadratic Networked Control with Limited Communication Range | Yuzi Yan et.al. | 2403.03055 | null |
2024-03-05 | OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied Instruction Following | Haochen Shi et.al. | 2403.03017 | null |
2024-03-05 | SimuCourt: Building Judicial Decision-Making Agents with Real-world Judgement Documents | Zhitao He et.al. | 2403.02959 | link |
2024-03-05 | PPS-QMIX: Periodically Parameter Sharing for Accelerating Convergence of Multi-Agent Reinforcement Learning | Ke Zhang et.al. | 2403.02635 | link |
2024-03-05 | Privacy in Multi-agent Systems | Yongqiang Wang et.al. | 2403.02631 | null |
2024-03-04 | A Multi-agent Reinforcement Learning Study of Evolution of Communication and Teaching under Libertarian and Utilitarian Governing Systems | Aslan S. Dizaji et.al. | 2403.02369 | link |
2024-03-04 | VITAMIN: A Compositional Framework for Model Checking of Multi-Agent Systems | Angelo Ferrando et.al. | 2403.02170 | null |
2024-03-04 | SMAUG: A Sliding Multidimensional Task Window-Based MARL Framework for Adaptive Real-Time Subtask Recognition | Wenjing Zhang et.al. | 2403.01816 | null |
2024-03-02 | A Communication-Efficient Stochastic Gradient Descent Algorithm for Distributed Nonconvex Optimization | Antai Xie et.al. | 2403.01322 | null |
2024-03-02 | Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning | Hyungho Na et.al. | 2403.01112 | link |
2024-03-01 | Composite Distributed Learning and Synchronization of Nonlinear Multi-Agent Systems with Complete Uncertain Dynamics | Emadodin Jandaghi et.al. | 2403.00987 | null |
2024-02-29 | Offline Fictitious Self-Play for Competitive Games | Jingxiao Chen et.al. | 2403.00841 | null |
2024-03-01 | Event-Triggered Robust Cooperative Output Regulation for a Class of Linear Multi-Agent Systems with an Unknown Exosystem | Yangyang Qian et.al. | 2403.00645 | null |
2024-03-01 | Imitation Learning Datasets: A Toolkit For Creating Datasets, Training Agents and Benchmarking | Nathan Gavenski et.al. | 2403.00550 | link |
2024-03-01 | Robustifying a Policy in Multi-Agent RL with Diverse Cooperative Behavior and Adversarial Style Sampling for Assistive Tasks | Tayuki Osa et.al. | 2403.00344 | null |
2024-03-01 | Mode Consensus Algorithms With Finite Convergence Time | Chao Huang et.al. | 2403.00221 | null |
2024-02-29 | Causal Graph ODE: Continuous Treatment Effect Modeling in Multi-agent Dynamical Systems | Zijie Huang et.al. | 2403.00178 | null |
2024-02-29 | Understanding Iterative Combinatorial Auction Designs via Multi-Agent Reinforcement Learning | Greg d’Eon et.al. | 2402.19420 | link |
2024-02-29 | Energy-Efficient UAV Swarm Assisted MEC with Dynamic Clustering and Scheduling | Jialiuyuan Li et.al. | 2402.18936 | null |
2024-02-28 | Timer-Based Coverage Control for Mobile Sensors | Federico M. Zegers et.al. | 2402.18744 | null |
2024-02-28 | Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication | Weize Chen et.al. | 2402.18439 | link |
2024-02-28 | Solving Multi-Entity Robotic Problems Using Permutation Invariant Neural Networks | Tianxu An et.al. | 2402.18345 | null |
2024-02-28 | Rethinking the Bounds of LLM Reasoning: Are Multi-Agent Discussions the Key? | Qineng Wang et.al. | 2402.18272 | null |
2024-02-28 | Human Simulacra: A Step toward the Personification of Large Language Models | Qiuejie Xie et.al. | 2402.18180 | link |
2024-03-01 | Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning | Zeyang Liu et.al. | 2402.17978 | null |
2024-02-27 | Independent Learning in Constrained Markov Potential Games | Philip Jordan et.al. | 2402.17885 | link |
2024-02-27 | Multi-Agent Deep Reinforcement Learning for Distributed Satellite Routing | Federico Lozano-Cuadra et.al. | 2402.17666 | null |
2024-02-27 | A Multi-Agent Model for Opinion Evolution under Cognitive Biases | Mário S. Alvim et.al. | 2402.17615 | null |
2024-02-27 | Corridor MPC for Multi-Agent Inspection of Orbiting Structures | Gregorio Marchesini et.al. | 2402.17596 | null |
2024-02-27 | Communication-Constrained STL Task Decomposition through Convex Optimization | Gregorio Marchesini et.al. | 2402.17585 | null |
2024-02-27 | Nissist: An Incident Mitigation Copilot based on Troubleshooting Guides | Kaikai An et.al. | 2402.17531 | null |
2024-02-27 | Multi-Agent, Human-Agent and Beyond: A Survey on Cooperation in Social Dilemmas | Hao Guo et.al. | 2402.17270 | null |
2024-02-27 | Reinforcement Learning Based Robust Volt/Var Control in Active Distribution Networks With Imprecisely Known Delay | Hong Cheng et.al. | 2402.17268 | null |
2024-02-27 | Inverse Optimal Control for Linear Quadratic Tracking with Unknown Target States | Yao Li et.al. | 2402.17247 | null |
2024-02-27 | Large Language Model for Participatory Urban Planning | Zhilun Zhou et.al. | 2402.17161 | null |
2024-02-26 | Navigating Complexity: Orchestrated Problem Solving with Multi-Agent LLMs | Sumedh Rasal et.al. | 2402.16713 | null |
2024-02-26 | LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments | Junzhe Chen et.al. | 2402.16499 | link |
2024-02-26 | Distributed Finite-time Differentiator for Multi-agent Systems Under Directed Graph | Weile Chen et.al. | 2402.16260 | null |
2024-02-26 | Scaling Robust Optimization for Multi-Agent Robotic Systems: A Distributed Perspective | Arshiya Taj Abdul et.al. | 2402.16227 | null |
2024-02-24 | Optimality of weighted contracts for multi-agent contract design with a budget | Sumit Goel et.al. | 2402.15890 | null |
2024-02-24 | On the Redistribution of Maximal Extractable Value: A Dynamic Mechanism | Pedro Braga et.al. | 2402.15849 | null |
2024-02-24 | Reward Design for Justifiable Sequential Decision-Making | Aleksa Sukovic et.al. | 2402.15826 | link |
2024-02-24 | Cooperation and Control in Delegation Games | Oliver Sourbut et.al. | 2402.15821 | null |
2024-02-23 | HiMAP: Learning Heuristics-Informed Policies for Large-Scale Multi-Agent Pathfinding | Huijie Tang et.al. | 2402.15546 | link |
2024-02-23 | Shapley Value Based Multi-Agent Reinforcement Learning: Theory, Method and Its Application to Energy Network | Jianhong Wang et.al. | 2402.15324 | null |
2024-02-23 | DEEM: Dynamic Experienced Expert Modeling for Stance Detection | Xiaolong Wang et.al. | 2402.15264 | link |
2024-02-23 | Multi-Agent Collaboration Framework for Recommender Systems | Zhefan Wang et.al. | 2402.15235 | link |
2024-02-23 | Optimal mesh generation for a non-iterative grid-converged solution of flow through a blade passage using deep reinforcement learning | Innyoung Kim et.al. | 2402.15079 | null |
2024-02-23 | Analyzing Games in Maker Protocol Part One: A Multi-Agent Influence Diagram Approach Towards Coordination | Abhimanyu Nag et.al. | 2402.15037 | null |
2024-02-23 | A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health | Nikhil Behari et.al. | 2402.14807 | null |
2024-02-22 | We Choose to Go to Space: Agent-driven Human and Multi-Robot Collaboration in Microgravity | Miao Xin et.al. | 2402.14299 | null |
2024-02-22 | Parking of Connected Automated Vehicles: Vehicle Control, Parking Assignment, and Multi-agent Simulation | Xu Shen et.al. | 2402.14183 | null |
2024-02-21 | AgentScope: A Flexible yet Robust Multi-Agent Platform | Dawei Gao et.al. | 2402.14034 | link |
2024-02-21 | Multi-Agent Online Graph Exploration on Cycles and Tadpole Graphs | Erik van den Akker et.al. | 2402.13845 | null |
2024-02-21 | Multi-Agent Contract Design beyond Binary Actions | Federico Cacciamani et.al. | 2402.13824 | null |
2024-02-21 | Learning to Model Diverse Driving Behaviors in Highly Interactive Autonomous Driving Scenarios with Multi-Agent Reinforcement Learning | Liu Weiwei et.al. | 2402.13481 | null |
2024-02-21 | A Neuro-Symbolic Approach to Multi-Agent RL for Interpretability and Probabilistic Decision Making | Chitra Subramanian et.al. | 2402.13440 | null |
2024-02-22 | Learning and Sustaining Shared Normative Systems via Bayesian Rule Induction in Markov Games | Ninell Oldenburg et.al. | 2402.13399 | link |
2024-02-19 | A Conflict-Aware Optimal Goal Assignment Algorithm for Multi-Robot Systems | Aakash et.al. | 2402.13292 | null |
2024-02-20 | Formal Synthesis of Controllers for Safety-Critical Autonomous Systems: Developments and Challenges | Xiang Yin et.al. | 2402.13075 | null |
2024-02-19 | Optimal Rejection of Bounded Perturbations in Linear Leader-Following Consensus Protocol: Method Invariant Ellipsoid | Siyuan Wang et.al. | 2402.12468 | null |
2024-02-19 | Aligning Individual and Collective Objectives in Multi-Agent Cooperation | Yang Li et.al. | 2402.12416 | null |
2024-02-19 | Easy as ABCs: Unifying Boltzmann Q-Learning and Counterfactual Regret Minimization | Luca D’Amico-Wong et.al. | 2402.11835 | null |
2024-02-19 | Stochastic Approximation with Delayed Updates: Finite-Time Rates under Markovian Sampling | Arman Adibi et.al. | 2402.11800 | null |
2024-02-19 | Targeted Parallelization of Conflict-Based Search for Multi-Robot Path Planning | Teng Guo et.al. | 2402.11768 | null |
2024-02-18 | LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration | Jun Zhao et.al. | 2402.11550 | link |
2024-02-18 | Benchmark Self-Evolving: A Multi-Agent Framework for Dynamic LLM Evaluation | Siyuan Wang et.al. | 2402.11443 | link |
2024-02-16 | Modelling crypto markets by multi-agent reinforcement learning | Johann Lussange et.al. | 2402.10803 | link |
2024-02-14 | Middleware-based multi-agent development environment for building and testing distributed intelligent systems | Francisco José Aguayo-Canela et.al. | 2402.10385 | null |
2024-02-15 | TDAG: A Multi-Agent Framework based on Dynamic Task Decomposition and Agent Generation | Yaoxiang Wang et.al. | 2402.10178 | link |
2024-02-14 | Advancing Building Energy Modeling with Large Language Models: Exploration and Case Studies | Liang Zhang et.al. | 2402.09579 | null |
2024-02-14 | ABIDES-Economist: Agent-Based Simulation of Economic Systems with Learning Agents | Kshama Dwarakanath et.al. | 2402.09563 | null |
2024-02-14 | Enriched multi-agent middleware for building rule-based distributed security solutions for IoT environments | Francisco José Aguayo-Canela et.al. | 2402.09499 | null |
2024-02-14 | Who Plays First? Optimizing the Order of Play in Stackelberg Games with Many Robots | Haimin Hu et.al. | 2402.09246 | null |
2024-02-13 | Approximate Sequential Optimization for Informative Path Planning | Joshua Ott et.al. | 2402.08841 | link |
2024-02-13 | Optimal Task Assignment and Path Planning using Conflict-Based Search with Precedence and Temporal Constraints | Yu Quan Chong et.al. | 2402.08772 | null |
2024-02-13 | Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast | Xiangming Gu et.al. | 2402.08567 | link |
2024-02-13 | Fairness Auditing with Multi-Agent Collaboration | Martijn de Vos et.al. | 2402.08522 | link |
2024-02-13 | Conservative and Risk-Aware Offline Multi-Agent Reinforcement Learning for Digital Twins | Eslam Eldeeb et.al. | 2402.08421 | link |
2024-02-13 | Interacting Particle Systems on Networks: joint inference of the network and the interaction kernel | Quanjun Lang et.al. | 2402.08412 | null |
2024-02-13 | Simulating Human Strategic Behavior: Comparing Single and Multi-agent LLMs | Karthik Sreedhar et.al. | 2402.08189 | null |
2024-02-13 | Enabling Multi-Agent Transfer Reinforcement Learning via Scenario Independent Representation | Ayesha Siddika Nipu et.al. | 2402.08184 | null |
2024-02-12 | Large Language Models as Agents in Two-Player Games | Yang Liu et.al. | 2402.08078 | null |
2024-02-12 | MAIDCRL: Semi-centralized Multi-Agent Influence Dense-CNN Reinforcement Learning | Ayesha Siddika Nipu et.al. | 2402.07890 | null |
2024-02-12 | Continuous Assurance of Autonomous Vehicle Behavior Through Machine Learned Correctness Properties | Matthew Litton et.al. | 2402.07791 | null |
2024-02-12 | Mixed Q-Functionals: Advancing Value-Based Methods in Cooperative MARL with Continuous Action Domains | Yasin Findik et.al. | 2402.07752 | null |
2024-02-12 | Rethinking Scaling Laws for Learning in Strategic Environments | Tinashe Handina et.al. | 2402.07588 | null |
2024-02-12 | Ensuring trustworthy and ethical behaviour in intelligent logical agents | Stefania Costantini et.al. | 2402.07547 | link |
2024-02-12 | Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful Explainable Fact-Checking via Multi-Agent Debate | Kyungha Kim et.al. | 2402.07401 | null |
2024-02-11 | Refined Sample Complexity for Markov Games with Independent Linear Function Approximation | Yan Dai et.al. | 2402.07082 | null |
2024-02-10 | A Factor Graph Model of Trust for a Collaborative Multi-Agent System | Behzad Akbari et.al. | 2402.07049 | null |
2024-02-09 | Distributed Quasi-Newton Method for Multi-Agent Optimization | Ola Shorinwa et.al. | 2402.06778 | null |
2024-02-09 | Evaluating the impact of items and cooperation in inventory models with exemptable ordering costs | M. Gloria Fiestras-Janeiro et.al. | 2402.06545 | null |
2024-02-09 | Distributed Safe Navigation of Multi-Agent Systems using Control Barrier Function-Based Optimal Controllers | Pol Mestres et.al. | 2402.06195 | null |
2024-02-08 | Risk-Sensitive Multi-Agent Reinforcement Learning in Network Aggregative Markov Games | Hafez Ghaemi et.al. | 2402.05906 | link |
2024-02-08 | When is Mean-Field Reinforcement Learning Tractable and Relevant? | Batuhan Yardim et.al. | 2402.05757 | null |
2024-02-08 | Adaptive Methods for Variational Inequalities under Relaxed Smoothness Assumption | Daniil Vankov et.al. | 2402.05691 | null |
2024-02-08 | Linking Vision and Multi-Agent Communication through Visible Light Communication using Event Cameras | Haruyuki Nakagawa et.al. | 2402.05619 | null |
2024-02-09 | Towards Generalizability of Multi-Agent Reinforcement Learning in Graphs with Recurrent Message Passing | Jannis Weil et.al. | 2402.05027 | link |
2024-02-08 | Multi-Sender Persuasion – A Computational Perspective | Safwan Hossain et.al. | 2402.04971 | null |
2024-02-09 | Multimodal Query Suggestion with Multi-Agent Reinforcement Learning from Human Feedback | Zheng Wang et.al. | 2402.04867 | null |
2024-02-07 | Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game | Philipp Sadler et.al. | 2402.04824 | null |
2024-02-07 | Investigating Driving Interactions: A Robust Multi-Agent Simulation Framework for Autonomous Vehicles | Marc Kaufeld et.al. | 2402.04720 | link |
2024-02-06 | Decentralized Blockchain-based Robust Multi-agent Multi-armed Bandit | Mengfan Xu et.al. | 2402.04417 | null |
2024-02-06 | Breaking Data Silos: Cross-Domain Learning for Multi-Agent Perception from Independent Private Sources | Jinlong Li et.al. | 2402.04273 | link |
2024-02-06 | Joint Intrinsic Motivation for Coordinated Exploration in Multi-Agent Deep Reinforcement Learning | Maxime Toquebiau et.al. | 2402.03972 | link |
2024-02-06 | Interpersonal trust: Asymptotic analysis of a stochastic coordination game with multi-agent learning | Benedikt V. Meylahn et.al. | 2402.03894 | null |
2024-02-06 | The Emergence of Cooperation in the well-mixed Prisoner’s Dilemma: Memory Couples Individual and Group Strategies | Changyan Di et.al. | 2402.03890 | null |
2024-02-06 | Beyond Lines and Circles: Unveiling the Geometric Reasoning Gap in Large Language Models | Spyridon Mouselinos et.al. | 2402.03877 | null |
2024-02-06 | SUB-PLAY: Adversarial Policies against Partially Observed Multi-Agent Reinforcement Learning Systems | Oubo Ma et.al. | 2402.03741 | link |
2024-02-06 | Hierarchical Large Language Models in Cloud Edge End Architecture for Heterogeneous Robot Cluster Control | Zhirong Luan et.al. | 2402.03703 | null |
2024-02-05 | Assessing the Impact of Distribution Shift on Reinforcement Learning Performance | Ted Fujimoto et.al. | 2402.03590 | null |
2024-02-05 | A Reinforcement Learning Approach for Dynamic Rebalancing in Bike-Sharing System | Jiaqi Liang et.al. | 2402.03589 | null |
2024-02-05 | LLM Multi-Agent Systems: Challenges and Open Problems | Shanshan Han et.al. | 2402.03578 | null |
2024-02-05 | Toward Human-AI Alignment in Large-Scale Multi-Player Games | Sugandha Sharma et.al. | 2402.03575 | null |
2024-02-05 | Multi-agent Reinforcement Learning for Energy Saving in Multi-Cell Massive MIMO Systems | Tianzhang Cai et.al. | 2402.03204 | null |
2024-02-05 | Decentralized Event-Triggered Online Learning for Safe Consensus of Multi-Agent Systems with Gaussian Process Regression | Xiaobing Dai et.al. | 2402.03174 | null |
2024-02-05 | Trustworthiness of Optimality Condition Violation in Inverse Optimal Control Methods Based on the Minimum Principle | Philipp Karg et.al. | 2402.03157 | null |
2024-02-05 | Proof Theory and Decision Procedures for Deontic STIT Logics | Tim S. Lyon et.al. | 2402.03148 | null |
2024-02-05 | Cooperative Learning with Gaussian Processes for Euler-Lagrange Systems Tracking Control under Switching Topologies | Zewen Yang et.al. | 2402.03048 | null |
2024-02-05 | Whom to Trust? Elective Learning for Distributed Gaussian Process Regression | Zewen Yang et.al. | 2402.03014 | null |
2024-02-05 | DualBi: A dual bisection algorithm for non-convex problems with a scalar complicating constraint | Lucrezia Manieri et.al. | 2402.03013 | null |
2024-02-05 | Multi-Agent Reinforcement Learning for Offloading Cellular Communications with Cooperating UAVs | Abhishek Mondal et.al. | 2402.02957 | null |
2024-02-04 | SIMPL: A Simple and Efficient Multi-agent Motion Prediction Baseline for Autonomous Driving | Lu Zhang et.al. | 2402.02519 | link |
2024-02-04 | Fast Peer Adaptation with Context-aware Exploration | Long Ma et.al. | 2402.02468 | null |
2024-02-02 | MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models | Justin Chih-Yao Chen et.al. | 2402.01620 | link |
2024-02-02 | Guidance Graph Optimization for Lifelong Multi-Agent Path Finding | Yulun Zhang et.al. | 2402.01446 | link |
2024-02-02 | CodePori: Large Scale Model for Autonomous Software Development by Using Multi-Agents | Zeeshan Rasheed et.al. | 2402.01411 | link |
2024-02-02 | Can Large Language Models Serve as Data Analysts? A Multi-Agent Assisted Approach for Qualitative Data Analysis | Zeeshan Rasheed et.al. | 2402.01386 | null |
2024-02-02 | Neural Trajectory Model: Implicit Neural Trajectory Representation for Trajectories Generation | Zihan Yu et.al. | 2402.01254 | link |
2024-02-02 | A Multi-Agent Conversational Recommender System | Jiabao Fang et.al. | 2402.01135 | null |
2024-02-02 | Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints | Dan Qiao et.al. | 2402.01111 | null |
2024-02-02 | Reasoning Capacity in Multi-Agent Systems: Limitations, Challenges and Human-Centered Solutions | Pouya Pezeshkpour et.al. | 2402.01108 | null |
2024-02-02 | The Danger Of Arrogance: Welfare Equilibra As A Solution To Stackelberg Self-Play In Non-Coincidental Games | Jake Levi et.al. | 2402.01088 | link |
2024-02-01 | Closure Discovery for Coarse-Grained Partial Differential Equations using Multi-Agent Reinforcement Learning | Jan-Philipp von Bassewitz et.al. | 2402.00972 | null |
2024-02-01 | Learning and Calibrating Heterogeneous Bounded Rational Market Behaviour with Multi-Agent Reinforcement Learning | Benjamin Patrick Evans et.al. | 2402.00787 | null |
2024-02-01 | FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game | Guangzheng Hu et.al. | 2402.00738 | null |
2024-02-01 | Developing A Multi-Agent and Self-Adaptive Framework with Deep Reinforcement Learning for Dynamic Portfolio Risk Management | Zhenglong Li et.al. | 2402.00515 | link |
2024-02-01 | Multi-agent Path Finding for Cooperative Autonomous Driving | Zhongxia Yan et.al. | 2402.00334 | link |
2024-02-01 | High-Level, Collaborative Task Planning Grammar and Execution for Heterogeneous Agents | Amy Fang et.al. | 2402.00296 | null |
2024-01-31 | Nash Soft Actor-Critic LEO Satellite Handover Management Algorithm for Flying Vehicles | Jinxuan Chen et.al. | 2402.00091 | null |
2024-01-31 | CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting | Jiezhi Yang et.al. | 2401.18075 | null |
2024-01-31 | Attention Graph for Multi-Robot Social Navigation with Deep Reinforcement Learning | Erwan Escudie et.al. | 2401.17914 | null |
2024-01-31 | Graph Attention-based Reinforcement Learning for Trajectory Design and Resource Assignment in Multi-UAV Assisted Communication | Zikai Feng et.al. | 2401.17880 | null |
2024-01-31 | Multi-Agent Phase-Balancing around Polar Curves with Bounded Trajectories: An Experimental Study using Crazyflies and MoCap System | Gaurav Singh Bhati et.al. | 2401.17591 | null |
2024-01-30 | AdvGPS: Adversarial GPS for Multi-Agent Perception Attack | Jinlong Li et.al. | 2401.17499 | link |
2024-01-30 | Camouflage Adversarial Attacks on Multiple Agent Systems | Ziqing Lu et.al. | 2401.17405 | null |
2024-01-30 | Scalable Mechanism Design for Multi-Agent Path Finding | Paul Friedrich et.al. | 2401.17044 | link |
2024-01-29 | Collaborative Manipulation of Deformable Objects with Predictive Obstacle Avoidance | Burak Aksoy et.al. | 2401.16560 | link |
2024-01-29 | A mechanism for discovering semantic relationships among agent communication protocols | Idoia Berges et.al. | 2401.16216 | null |
2024-01-29 | FIMP: Future Interaction Modeling for Multi-Agent Motion Prediction | Sungmin Woo et.al. | 2401.16189 | null |
2024-01-28 | ARGOS: An Automaton Referencing Guided Overtake System for Head-to-Head Autonomous Racing | Varundev Sukhil et.al. | 2401.15783 | null |
2024-01-28 | Survey of Distributed Algorithms for Resource Allocation over Multi-Agent Systems | Mohammadreza Doostmohammadian et.al. | 2401.15607 | null |
2024-01-28 | Accelerated Distributed Allocation | Mohammadreza Doostmohammadian et.al. | 2401.15598 | null |
2024-01-27 | Distributed Resilient Interval Observer Synthesis for Nonlinear Discrete-Time Systems | Mohammad Khajenejad et.al. | 2401.15511 | null |
2024-01-26 | Fully Independent Communication in Multi-Agent Reinforcement Learning | Rafael Pina et.al. | 2401.15059 | link |
2024-01-26 | Multi-Agent Coordination for a Partially Observable and Dynamic Robot Soccer Environment with Limited Communication | Daniele Affinita et.al. | 2401.15026 | null |
2024-01-26 | Energy Flexibility Potential in the Brewery Sector: A Multi-agent Based Simulation of 239 Danish Breweries | Daniel Anthony Howard et.al. | 2401.14903 | null |
2024-01-26 | On Inhomogeneous Infinite Products of Stochastic Matrices and Applications | Zhaoyue Xia et.al. | 2401.14612 | null |
2024-01-26 | Enhancing Diagnostic Accuracy through Multi-Agent Conversations: Using Large Language Models to Mitigate Cognitive Bias | Yu He Ke et.al. | 2401.14589 | null |
2024-01-25 | GCBF+: A Neural Graph Control Barrier Function Framework for Distributed Safe Multi-Agent Control | Songyuan Zhang et.al. | 2401.14554 | link |
2024-01-25 | STEMFold: Stochastic Temporal Manifold for Multi-Agent Interactions in the Presence of Hidden Agents | Hemant Kumawat et.al. | 2401.14522 | null |
2024-01-27 | Networked Multiagent Reinforcement Learning for Peer-to-Peer Energy Trading | Chen Feng et.al. | 2401.13947 | null |
2024-01-24 | Event-triggered adaptive consensus of heterogeneous multi-agent system under communication and actuator faults | Leyi Zheng et.al. | 2401.13492 | null |
2024-01-24 | Multi-Agent Diagnostics for Robustness via Illuminated Diversity | Mikayel Samvelyan et.al. | 2401.13460 | null |
2024-01-24 | Dynamic Epistemic Logic of Resource Bounded Information Mining Agents | Vitaliy Dolgorukov et.al. | 2401.13369 | null |
2024-01-24 | Past, Present, Future: A Comprehensive Exploration of AI Use Cases in the UMBRELLA IoT Testbed | Peizheng Li et.al. | 2401.13346 | null |
2024-01-23 | Generalization of Heterogeneous Multi-Robot Policies via Awareness and Communication of Capabilities | Pierce Howell et.al. | 2401.13127 | null |
2024-01-23 | Viewport Prediction, Bitrate Selection, and Beamforming Design for THz-Enabled 360-Degree Video Streaming | Mehdi Setayesh et.al. | 2401.13114 | null |
2024-01-23 | Emergent Communication Protocol Learning for Task Offloading in Industrial Internet of Things | Salwa Mostafa et.al. | 2401.12914 | null |
2024-01-23 | Localized Data-driven Consensus Control | Zeze Chang et.al. | 2401.12707 | null |
2024-01-23 | Pragmatic Communication in Multi-Agent Collaborative Perception | Yue Hu et.al. | 2401.12694 | null |
2024-01-23 | Learning Mean Field Games on Sparse Graphs: A Hybrid Graphex Approach | Christian Fabian et.al. | 2401.12686 | null |
2024-01-23 | Knowledge Distillation from Language-Oriented to Emergent Communication for Multi-Agent Remote Control | Yongjun Kim et.al. | 2401.12624 | null |
2024-01-23 | Backpropagation Through Agents | Zhiyuan Li et.al. | 2401.12574 | null |
2024-01-23 | Multi-agent deep reinforcement learning with centralized training and decentralized execution for transportation infrastructure management | M. Saifullah et.al. | 2401.12455 | null |
2024-01-22 | Multi-Agent Dynamic Relational Reasoning for Social Robot Navigation | Jiachen Li et.al. | 2401.12275 | null |
2024-01-22 | Natural Strategic Ability in Stochastic Multi-Agent Systems | Raphaël Berthon et.al. | 2401.12170 | null |
2024-01-22 | PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety | Zaibin Zhang et.al. | 2401.11880 | link |
2024-01-21 | Multi-Agent Generative Adversarial Interactive Self-Imitation Learning for AUV Formation Control and Obstacle Avoidance | Zheng Fang et.al. | 2401.11378 | null |
2024-01-20 | Measuring Policy Distance for Multi-Agent Reinforcement Learning | Tianyi Hu et.al. | 2401.11257 | link |
2024-01-19 | T2MAC: Targeted and Trusted Multi-Agent Communication through Selective Engagement and Evidence-Driven Integration | Chuxiong Sun et.al. | 2401.10973 | null |
2024-01-18 | The Synergy Between Optimal Transport Theory and Multi-Agent Reinforcement Learning | Ali Baheri et.al. | 2401.10949 | null |
2024-01-18 | Cooperative Multi-Agent Graph Bandits: UCB Algorithm and Regret Analysis | Phevos Paschalidis et.al. | 2401.10383 | null |
2024-01-18 | Model-Assisted Learning for Adaptive Cooperative Perception of Connected Autonomous Vehicles | Kaige Qu et.al. | 2401.10156 | null |
2024-01-18 | Multi-Agent Reinforcement Learning for Maritime Operational Technology Cyber Security | Alec Wilson et.al. | 2401.10149 | null |
2024-01-18 | Cooperative Edge Caching Based on Elastic Federated and Multi-Agent Deep Reinforcement Learning in Next-Generation Network | Qiong Wu et.al. | 2401.09886 | link |
2024-01-18 | Tiny Multi-Agent DRL for Twins Migration in UAV Metaverses: A Multi-Leader Multi-Follower Stackelberg Game Approach | Jiawen Kang et.al. | 2401.09680 | null |
2024-01-17 | A Multi-Agent Security Testbed for the Analysis of Attacks and Defenses in Collaborative Sensor Fusion | R. Spencer Hallyburton et.al. | 2401.09387 | null |
2024-01-17 | Self-navigation in crowds: An invariant set-based approach | Veejay Karthik J et.al. | 2401.09375 | null |
2024-01-16 | REValueD: Regularised Ensemble Value-Decomposition for Factorisable Markov Decision Processes | David Ireland et.al. | 2401.08850 | null |
2024-01-16 | Iterative Planning for Multi-agent Systems: An Application in Energy-Aware UAV-UGV Cooperative Task Site Assignments | Neelanga Thelasingha et.al. | 2401.08846 | null |
2024-01-16 | AgentMixer: Multi-Agent Correlated Policy Factorization | Zhiyuan Li et.al. | 2401.08728 | null |
2024-01-16 | Battery-Swapping Multi-Agent System for Sustained Operation of Large Planetary Fleets | Ethan Holand et.al. | 2401.08497 | null |
2024-01-16 | CycLight: learning traffic signal cooperation with a cycle-level strategy | Gengyue Han et.al. | 2401.08121 | null |
2024-01-15 | SSL-Interactions: Pretext Tasks for Interactive Trajectory Prediction | Prarthana Bhattacharyya et.al. | 2401.07729 | null |
2024-01-15 | Fully Decentralized Design of Initialization-free Distributed Network Size Estimation | Donggil Lee et.al. | 2401.07472 | null |
2024-01-14 | BET: Explaining Deep Reinforcement Learning through The Error-Prone Decisions | Xiao Liu et.al. | 2401.07263 | null |
2024-01-13 | One Agent Too Many: User Perspectives on Approaches to Multi-agent Conversational AI | Christopher Clarke et.al. | 2401.07123 | null |
2024-01-13 | Generative AI-enabled Quantum Computing Networks and Intelligent Resource Allocation | Minrui Xu et.al. | 2401.07120 | null |
2024-01-13 | Aquarium: A Comprehensive Framework for Exploring Predator-Prey Dynamics through Multi-Agent Reinforcement Learning Algorithms | Michael Kölle et.al. | 2401.07056 | link |
2024-01-12 | Mutual Enhancement of Large Language and Reinforcement Learning Models through Bi-Directional Feedback Mechanisms: A Case Study | Shangding Gu et.al. | 2401.06603 | null |
2024-01-12 | UNEX-RL: Reinforcing Long-Term Rewards in Multi-Stage Recommender Systems with UNidirectional EXecution | Gengrui Zhang et.al. | 2401.06470 | null |
2024-01-12 | A Logic for Repair and State Recovery in Byzantine Fault-tolerant Multi-agent Systems | Hans van Ditmarsch et.al. | 2401.06451 | null |
2024-01-12 | A Semantic-Aware Multiple Access Scheme for Distributed, Dynamic 6G-Based Applications | Hamidreza Mazandarani et.al. | 2401.06308 | null |
2024-01-11 | Distributed Optimal Output Consensus Control of Heterogeneous Multi-Agent Systems with Safety Constraints | Ji Ma et.al. | 2401.06245 | null |
2024-01-11 | Multi-Agent Based Simulation for Investigating Electric Vehicle Adoption and Its Impacts on Electricity Distribution Grids and CO2 Emissions | Kristoffer Christensen et.al. | 2401.06192 | null |
2024-01-11 | Combating Adversarial Attacks with Multi-Agent Debate | Steffi Chern et.al. | 2401.05998 | link |
2024-01-11 | Confidence-Based Curriculum Learning for Multi-Agent Path Finding | Thomy Phan et.al. | 2401.05860 | link |
2024-01-11 | Secure Dynamic Event-triggered Consensus Under Asynchronous Denial of Service | Ali Azarbahram et.al. | 2401.05857 | null |
2024-01-11 | Tracking Consensus of Networked Random Nonlinear Multi-agent Systems with Intermittent Communications | Ali Azarbahram et.al. | 2401.05808 | null |
2024-01-11 | Augmented Reality User Interface for Command, Control, and Supervision of Large Multi-Agent Teams | Frank Regal et.al. | 2401.05665 | null |
2024-01-11 | Full-State Prescribed Performance-Based Consensus of Double-Integrator Multi-Agent Systems with Jointly Connected Topologies | Yahui Hou et.al. | 2401.05639 | null |
2024-01-10 | Innate-Values-driven Reinforcement Learning for Cooperative Multi-Agent Systems | Qin Yang et.al. | 2401.05572 | null |
2024-01-10 | Transparency as Delayed Observability in Multi-Agent Systems | Kshama Dwarakanath et.al. | 2401.05563 | null |
2024-01-10 | Dual Quaternion Laplacian Matrix and Formation Control | Liqun Qi et.al. | 2401.05132 | null |
2024-01-10 | Discrete-Time Stress Matrix-Based Formation Control of General Linear Multi-Agent Systems | Okechi Onuoha et.al. | 2401.05083 | null |
2024-01-10 | Fully Decentralized Cooperative Multi-Agent Reinforcement Learning: A Survey | Jiechuan Jiang et.al. | 2401.04934 | null |
2024-01-09 | Deep Reinforcement Multi-agent Learning framework for Information Gathering with Local Gaussian Processes for Water Monitoring | Samuel Yanes Luis et.al. | 2401.04631 | null |
2024-01-09 | StarCraftImage: A Dataset For Prototyping Spatial Reasoning Methods For Multi-Agent Environments | Sean Kulinski et.al. | 2401.04290 | null |
2024-01-08 | MARG: Multi-Agent Review Generation for Scientific Papers | Mike D’Arcy et.al. | 2401.04259 | link |
2024-01-08 | Zeroth-Order Non-Convex Optimization for Cooperative Multi-Agent Systems with Diminishing Step Size and Smoothing Radius | Xinran Zheng et.al. | 2401.03998 | null |
2024-01-08 | SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems | Dong Zhang et.al. | 2401.03945 | link |
2024-01-08 | A Tensor Network Implementation of Multi Agent Reinforcement Learning | Sunny Howard et.al. | 2401.03896 | null |
2024-01-08 | Is Limited Information Enough? An Approximate Multi-agent Coverage Control in Non-Convex Discrete Environments | Tatsuya Iwase et.al. | 2401.03752 | null |
2024-01-08 | Why Solving Multi-agent Path Finding with Large Language Model has not Succeeded Yet | Weizhe Chen et.al. | 2401.03630 | null |
2024-01-07 | NovelGym: A Flexible Ecosystem for Hybrid Planning and Learning Agents Designed for Open Worlds | Shivam Goel et.al. | 2401.03546 | null |
2024-01-07 | ClusterComm: Discrete Communication in Decentralized MARL using Internal Representation Clustering | Robert Müller et.al. | 2401.03504 | null |
2024-01-07 | Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects | Yuheng Cheng et.al. | 2401.03428 | link |
2024-01-07 | Improving Dribbling, Passing, and Marking Actions in Soccer Simulation 2D Games Using Machine Learning | Nader Zare et.al. | 2401.03406 | link |
2024-01-06 | Distributed Identification of Stable Large-Scale Isomorphic Nonlinear Networks Using Partial Observations | Chunhui Li et.al. | 2401.03216 | null |
2024-01-05 | Une ontologie pour les syst{è}mes multi-agents ambiants dans les villes intelligentes | Nathan Aky et.al. | 2401.02726 | null |
2024-01-05 | XUAT-Copilot: Multi-Agent Collaborative System for Automated User Acceptance Testing with Large Language Model | Zhitao Wang et.al. | 2401.02705 | null |
2024-01-04 | On the Prospects of Incorporating Large Language Models (LLMs) in Automated Planning and Scheduling (APS) | Vishal Pallagani et.al. | 2401.02500 | null |
2024-01-04 | Multi-agent Modeling and Optimal Pumping Control of Magnetic Artificial Cilia | Shuangshuang Yu et.al. | 2401.02455 | null |
2024-01-04 | Multi-Agent Context Learning Strategy for Interference-Aware Beam Allocation in mmWave Vehicular Communications | Abdulkadir Kose et.al. | 2401.02323 | link |
2024-01-04 | A Decentralized Multiagent-Based Task Scheduling Framework for Handling Uncertain Events in Fog Computing | Yikun Yang et.al. | 2401.02219 | null |
2024-01-03 | Optimizing UAV-UGV Coalition Operations: A Hybrid Clustering and Multi-Agent Reinforcement Learning Approach for Path Planning in Obstructed Environment | Shamyo Brotee et.al. | 2401.01481 | null |
2024-01-02 | LLM Harmony: Multi-Agent Communication for Problem Solving | Sumedh Rasal et.al. | 2401.01312 | link |
2024-01-02 | Joint Offloading and Resource Allocation for Hybrid Cloud and Edge Computing in SAGINs: A Decision Assisted Hybrid Action Space Deep Reinforcement Learning Approach | Chong Huang et.al. | 2401.01140 | null |
2024-01-01 | Edge Computing based Human-Robot Cognitive Fusion: A Medical Case Study in the Autism Spectrum Disorder Therapy | Qin Yang et.al. | 2401.00776 | null |
2024-01-01 | Plug-and-Play regularized 3D seismic inversion with 2D pre-trained denoisers | Nick Luiken et.al. | 2401.00753 | null |
2024-01-01 | Polynomial-time Approximation Scheme for Equilibriums of Games | Hongbo Sun et.al. | 2401.00747 | link |
2023-12-30 | Bidirectional Temporal Plan Graph: Enabling Switchable Passing Orders for More Efficient Multi-Agent Path Finding Plan Execution | Yifan Su et.al. | 2401.00315 | null |
2023-12-30 | Physics-Informed Multi-Agent Reinforcement Learning for Distributed Multi-Robot Problems | Eduardo Sebastian et.al. | 2401.00212 | link |
2023-12-30 | Leveraging Partial Symmetry for Multi-Agent Reinforcement Learning | Xin Yu et.al. | 2401.00167 | null |
2023-12-30 | Contrastive learning-based agent modeling for deep reinforcement learning | Wenhao Ma et.al. | 2401.00132 | null |