Usage instructions: here
Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-09-18 | Residual Descent Differential Dynamic Game (RD3G) -- A Fast Newton Solver for Constrained General Sum Games | Zhiyuan Zhang et.al. | 2409.12152 | null |
2024-09-18 | MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning | Justin Chih-Yao Chen et.al. | 2409.12147 | link |
2024-09-19 | The Impact of Element Ordering on LM Agent Performance | Wayne Chi et.al. | 2409.12089 | link |
2024-09-19 | Using Large Language Models to Generate Clinical Trial Tables and Figures | Yumeng Yang et.al. | 2409.12046 | null |
2024-09-19 | Representing Positional Information in Generative World Models for Object Manipulation | Stefano Ferraro et.al. | 2409.12005 | null |
2024-09-18 | Putting Data at the Centre of Offline Multi-Agent Reinforcement Learning | Claude Formanek et.al. | 2409.12001 | null |
2024-09-18 | On the Stability of Consensus Control under Rotational Ambiguities | Zhonggang Li et.al. | 2409.11979 | null |
2024-09-18 | Anomalous behavior of Replicator dynamics for the Prisoner's Dilemma on diluted lattices | Fernanda R. Leivas et.al. | 2409.11955 | null |
2024-09-18 | Reinforcement Learning as an Improvement Heuristic for Real-World Production Scheduling | Arthur Müller et.al. | 2409.11933 | null |
2024-09-18 | Secure Control Systems for Autonomous Quadrotors against Cyber-Attacks | Samuel Belkadi et.al. | 2409.11897 | link |
2024-09-18 | Motivations, Challenges, Best Practices, and Benefits for Bots and Conversational Agents in Software Engineering: A Multivocal Literature Review | Stefano Lambiase et.al. | 2409.11864 | null |
2024-09-18 | XP-MARL: Auxiliary Prioritization in Multi-Agent Reinforcement Learning to Address Non-Stationarity | Jianye Xu et.al. | 2409.11852 | null |
2024-09-18 | Optimizing Job Shop Scheduling in the Furniture Industry: A Reinforcement Learning Approach Considering Machine Setup, Batch Variability, and Intralogistics | Malte Schneevogt et.al. | 2409.11820 | null |
2024-09-18 | Distributed Resilient Secondary Control for Microgrids with Attention-based Weights against High-density Misbehaving Agents | Yutong Li et.al. | 2409.11812 | null |
2024-09-18 | Synthesizing Evolving Symbolic Representations for Autonomous Systems | Gabriele Sartor et.al. | 2409.11756 | link |
2024-09-18 | HARP: Human-Assisted Regrouping with Permutation Invariant Critic for Multi-Agent Reinforcement Learning | Huawen Hu et.al. | 2409.11741 | null |
2024-09-18 | Revealing the Challenge of Detecting Character Knowledge Errors in LLM Role-Playing | Wenyuan Zhang et.al. | 2409.11726 | link |
2024-09-18 | Discovering Conceptual Knowledge with Analytic Ontology Templates for Articulated Objects | Jianhua Sun et.al. | 2409.11702 | null |
2024-09-18 | RMP-YOLO: A Robust Motion Predictor for Partially Observable Scenarios even if You Only Look Once | Jiawei Sun et.al. | 2409.11696 | null |
2024-09-18 | Towards Explainable Goal Recognition Using Weight of Evidence (WoE): A Human-Centered Approach | Abeer Alshehri et.al. | 2409.11675 | null |
2024-09-18 | Agent Aggregator with Mask Denoise Mechanism for Histopathology Whole Slide Image Analysis | Xitong Ling et.al. | 2409.11664 | null |
2024-09-18 | From Data Stories to Dialogues: A Randomised Controlled Trial of Generative AI Agents and Data Storytelling in Enhancing Data Visualisation Comprehension | Lixiang Yan et.al. | 2409.11645 | null |
2024-09-17 | Context-Generative Default Policy for Bounded Rational Agent | Durgakant Pushp et.al. | 2409.11604 | null |
2024-09-17 | React to This! How Humans Challenge Interactive Agents using Nonverbal Behaviors | Chuxuan Zhang et.al. | 2409.11602 | null |
2024-09-17 | Distributed Deep Koopman Learning for Nonlinear Dynamics | Wenjian Hao et.al. | 2409.11586 | null |
2024-09-17 | PLATO: Planning with LLMs and Affordances for Tool Manipulation | Arvind Car et.al. | 2409.11580 | null |
2024-09-17 | Optimal Investment with Costly Expert Opinions | Christoph Knochenhauer et.al. | 2409.11569 | null |
2024-09-17 | Hyper-SAMARL: Hypergraph-based Coordinated Task Allocation and Socially-aware Navigation for Multi-Robot Systems | Weizheng Wang et.al. | 2409.11561 | null |
2024-09-17 | Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent | Fatemeh Haji et.al. | 2409.11527 | null |
2024-09-17 | Diffusion of knowledge and the lottery society | Henri Berestycki et.al. | 2409.11479 | null |
2024-09-17 | Consensus decision making on a complete graph: complex behaviour from simple assumptions | P. Sarkanych et.al. | 2409.11475 | null |
2024-09-12 | Towards Opinion Shaping: A Deep Reinforcement Learning Approach in Bot-User Interactions | Farbod Siahkali et.al. | 2409.11426 | null |
2024-09-17 | Ising model with varying spin strength on a scale-free network: scaling functions and critical amplitude ratios | M. Krasnytska et.al. | 2409.11396 | null |
2024-09-17 | Distributed Perception Aware Safe Leader Follower System via Control Barrier Methods | Richie R. Suganda et.al. | 2409.11394 | null |
2024-09-17 | LLM-Agent-UMF: LLM-based Agent Unified Modeling Framework for Seamless Integration of Multi Active/Passive Core-Agents | Amine B. Hassouna et.al. | 2409.11393 | null |
2024-09-17 | CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark | Zachary S. Siegel et.al. | 2409.11363 | link |
2024-09-17 | A Scalable Game Theoretic Approach for Coordination of Multiple Dynamic Systems | Mostafa M. Shibl et.al. | 2409.11358 | null |
2024-09-17 | EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage | Zeyi Liao et.al. | 2409.11295 | null |
2024-09-17 | P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task | Weiye Xu et.al. | 2409.11279 | null |
2024-09-17 | Hackphyr: A Local Fine-Tuned LLM Agent for Network Security Environments | Maria Rigaki et.al. | 2409.11276 | null |
2024-09-19 | The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives | Samee Arif et.al. | 2409.11261 | link |
2024-09-17 | To What Extent do Open-loop and Feedback Nash Equilibria Diverge in General-Sum Linear Quadratic Dynamic Games? | Chih-Yuan Chiu et.al. | 2409.11257 | null |
2024-09-17 | A Continuous-time Tractable Model for Present-biased Agents | Yasunori Akagi et.al. | 2409.11225 | null |
2024-09-17 | Bearing-based Target Localisation in Search and Rescue Scenarios | Giulia Michieletto et.al. | 2409.11221 | null |
2024-09-17 | SuperCoder2.0: Technical Report on Exploring the feasibility of LLMs as Autonomous Programmer | Anmol Gautam et.al. | 2409.11190 | null |
2024-09-18 | Annealed Winner-Takes-All for Motion Forecasting | Yihong Xu et.al. | 2409.11172 | link |
2024-09-17 | Preventing Unconstrained CBF Safety Filters Caused by Invalid Relative Degree Assumptions | Lukas Brunke et.al. | 2409.11171 | null |
2024-09-17 | Reactive Environments for Active Inference Agents with RxEnvironments.jl | Wouter W. L. Nuijten et.al. | 2409.11087 | link |
2024-09-17 | Data-driven Dynamic Intervention Design in Network Games | Xiupeng Chen et.al. | 2409.11069 | null |
2024-09-17 | A logical alarm for misaligned binary classifiers | Andrés Corrada-Emmanuel et.al. | 2409.11052 | null |
2024-09-17 | Improving Speech Emotion Recognition in Under-Resourced Languages via Speech-to-Speech Translation with Bootstrapping Data Selection | Hsi-Che Lin et.al. | 2409.10985 | null |
2024-09-17 | Label-free correlative morpho-chemical tomography of 3D kidney mesangial cells | Ankit Butola et.al. | 2409.10971 | null |
2024-09-17 | Frontier Shepherding: A Bio-Mimetic Multi-robot Framework for Large-Scale Exploration | John Lewis et.al. | 2409.10931 | null |
2024-09-17 | Multi-Floor Zero-Shot Object Navigation Policy | Lingfeng Zhang et.al. | 2409.10906 | null |
2024-09-17 | Distributed Optimization for Traffic Light Control and Connected Automated Vehicle Coordination in Mixed-Traffic Intersections | Viet-Anh Le et.al. | 2409.10864 | null |
2024-09-17 | SIFToM: Robust Spoken Instruction Following through Theory of Mind | Lance Ying et.al. | 2409.10849 | null |
2024-09-17 | Improving Interface Design in Interactive Task Learning for Hierarchical Tasks based on a Qualitative Study | Jieyu Zhou et.al. | 2409.10826 | null |
2024-09-17 | Consensus in Models for Opinion Dynamics with Generalized-Bias | Juan Paz et.al. | 2409.10809 | null |
2024-09-16 | AutoSafeCoder: A Multi-Agent Framework for Securing LLM Code Generation through Static Analysis and Fuzz Testing | Ana Nunez et.al. | 2409.10737 | null |
2024-09-16 | CoMamba: Real-time Cooperative Perception Unlocked with State Space Models | Jinlong Li et.al. | 2409.10699 | null |
2024-09-16 | Mitigating Partial Observability in Adaptive Traffic Signal Control with Transformers | Xiaoyu Wang et.al. | 2409.10693 | null |
2024-09-16 | Multi-agent Path Finding in Continuous Environment | Kristýna Janovská et.al. | 2409.10680 | null |
2024-09-16 | Motion Forecasting via Model-Based Risk Minimization | Aron Distelzweig et.al. | 2409.10585 | null |
2024-09-16 | Reinforcement Learning with Quasi-Hyperbolic Discounting | S. R. Eshwar et.al. | 2409.10583 | null |
2024-09-14 | On the limits of agency in agent-based models | Ayush Chopra et.al. | 2409.10568 | link |
2024-09-13 | Applying Action Masking and Curriculum Learning Techniques to Improve Data Efficiency and Overall Performance in Operational Technology Cyber Security using Reinforcement Learning | Alec Wilson et.al. | 2409.10563 | null |
2024-09-16 | On interactive anisotropic walks in two dimensions generated from a three state opinion dynamics model | Surajit Saha et.al. | 2409.10413 | null |
2024-09-16 | Reducing Leximin Fairness to Utilitarian Optimization | Eden Hartman et.al. | 2409.10395 | null |
2024-09-16 | Decentralized and Asymmetric Multi-Agent Learning in Construction Sites | Yakov Miron et.al. | 2409.10375 | null |
2024-09-19 | Instigating Cooperation among LLM Agents Using Adaptive Information Modulation | Qiliang Chen et.al. | 2409.10372 | null |
2024-09-16 | 2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation? | Téo Guichoux et.al. | 2409.10357 | null |
2024-09-16 | Partial Ordering Bayesian Logistic Regression Model for Phase I Combination Trials and Computationally Efficient Approach to Operational Prior Specification | Weishi Chen et.al. | 2409.10352 | null |
2024-09-16 | Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots | Hongming Zhang et.al. | 2409.10277 | link |
2024-09-16 | Synchronization-Based Cooperative Distributed Model Predictive Control | Julius Beerwerth et.al. | 2409.10215 | null |
2024-09-16 | Maneuver Decision-Making with Trajectory Streams Prediction for Autonomous Vehicles | Mais Jamal et.al. | 2409.10165 | null |
2024-09-16 | Multi-Agent Obstacle Avoidance using Velocity Obstacles and Control Barrier Functions | Alejandro Sánchez Roncero et.al. | 2409.10117 | null |
2024-09-16 | Robust Reinforcement Learning with Dynamic Distortion Risk Measures | Anthony Coache et.al. | 2409.10096 | link |
2024-09-16 | Cross-modality image synthesis from TOF-MRA to CTA using diffusion-based models | Alexander Koch et.al. | 2409.10089 | null |
2024-09-19 | Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation | Meng Chen et.al. | 2409.10071 | link |
2024-09-16 | A Social Force Model for Multi-Agent Systems With Application to Robots Traversal in Cluttered Environments | Chenxi Li et.al. | 2409.10049 | null |
2024-09-16 | Audio-Driven Reinforcement Learning for Head-Orientation in Naturalistic Environments | Wessel Ledder et.al. | 2409.10048 | null |
2024-09-16 | Bearing-Distance Based Flocking with Zone-Based Interactions | Hossein B. Jond et.al. | 2409.10047 | null |
2024-09-16 | E2Map: Experience-and-Emotion Map for Self-Reflective Robot Navigation with Language Models | Chan Kim et.al. | 2409.10027 | null |
2024-09-16 | Reinforcement learning-based statistical search strategy for an axion model from flavor | Satsuki Nishimura et.al. | 2409.10023 | null |
2024-09-16 | SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning | Amogh Joshi et.al. | 2409.09990 | null |
2024-09-16 | Optimality Gap of Decentralized Submodular Maximization under Probabilistic Communication | Joan Vendrell et.al. | 2409.09979 | null |
2024-09-16 | Constrained Bandwidth Observation Sharing for Multi-Robot Navigation in Dynamic Environments via Intelligent Knapsack | Anirudh Chari et.al. | 2409.09975 | null |
2024-09-16 | Solving Monotone Variational Inequalities with Best Response Dynamics | Yu-Wen Chen et.al. | 2409.09961 | null |
2024-09-16 | Context-aware Advertisement Modeling and Applications in Rapid Transit Systems | Afzal Ahmed et.al. | 2409.09956 | null |
2024-09-15 | Critic as Lyapunov function (CALF): a model-free, stability-ensuring agent | Pavel Osinenko et.al. | 2409.09869 | null |
2024-09-15 | A Complete Algorithm for a Moving Target Traveling Salesman Problem with Obstacles | Anoop Bhat et.al. | 2409.09852 | null |
2024-09-15 | On the Effect of Robot Errors on Human Teaching Dynamics | Jindan Huang et.al. | 2409.09827 | null |
2024-09-15 | Revisiting the state-space model of unawareness | Alex A. T. Rathke et.al. | 2409.09818 | null |
2024-09-15 | Social Influence and Consensus Building: Introducing a q-Voter Model with Weighted Influence | Pratik Mullick et.al. | 2409.09817 | null |
2024-09-17 | Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition | Chao-Han Huck Yang et.al. | 2409.09785 | null |
2024-09-15 | DiFSD: Ego-Centric Fully Sparse Paradigm with Uncertainty Denoising and Iterative Refinement for Efficient End-to-End Autonomous Driving | Haisheng Su et.al. | 2409.09777 | null |
2024-09-15 | Range-SLAM: Ultra-Wideband-Based Smoke-Resistant Real-Time Localization and Mapping | Yi Liu et.al. | 2409.09763 | null |
2024-09-15 | Automatic Control With Human-Like Reasoning: Exploring Language Model Embodied Air Traffic Agents | Justas Andriuškevičius et.al. | 2409.09717 | null |
2024-09-15 | Unveiling Gender Bias in Large Language Models: Using Teacher's Evaluation in Higher Education As an Example | Yuanning Huang et.al. | 2409.09652 | link |
2024-09-15 | RethinkMCTS: Refining Erroneous Thoughts in Monte Carlo Tree Search for Code Generation | Qingyao Li et.al. | 2409.09584 | null |
2024-09-15 | Traffic Scene Generation from Natural Language Description for Autonomous Vehicles with Large Language Model | Bo-Kai Ruan et.al. | 2409.09575 | null |
2024-09-15 | Decentralized Safe and Scalable Multi-Agent Control under Limited Actuation | Vrushabh Zinage et.al. | 2409.09573 | null |
2024-09-14 | Swarm Algorithms for Dynamic Task Allocation in Unknown Environments | Adithya Balachandran et.al. | 2409.09550 | null |
2024-09-14 | Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation | Yiwei Shi et.al. | 2409.09541 | null |
2024-09-14 | Ensuring System-Level Protection against Eavesdropping Adversaries in Distributed Dynamical Systems | Dipankar Maity et.al. | 2409.09539 | null |
2024-09-14 | Planning Transformer: Long-Horizon Offline Reinforcement Learning with Planning Tokens | Joseph Clinton et.al. | 2409.09513 | null |
2024-09-14 | Learning Nudges for Conditional Cooperation: A Multi-Agent Reinforcement Learning Model | Shatayu Kulkarni et.al. | 2409.09509 | null |
2024-09-14 | Learning Keypoints for Multi-Agent Behavior Analysis using Self-Supervision | Daniel Khalil et.al. | 2409.09455 | null |
2024-09-14 | Initial Error Affection and Error Correction in Linear Quadratic Mean Field Games under Erroneous Initial Information | Yuxin Jin et.al. | 2409.09375 | null |
2024-09-14 | The (n,k) game with heterogeneous agents | Hsin-Lun Li et.al. | 2409.09364 | null |
2024-09-14 | PeriGuru: A Peripheral Robotic Mobile App Operation Assistant based on GUI Image Understanding and Prompting with LLM | Kelin Fu et.al. | 2409.09354 | link |
2024-09-14 | Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models | Yuanzhao Zhai et.al. | 2409.09345 | null |
2024-09-14 | Capability Augmentation for Heterogeneous Dynamic Teaming with Temporal Logic Tasks | Carter Berlind et.al. | 2409.09285 | null |
2024-09-14 | Python Symbolic Execution with LLM-powered Code Generation | Wenhan Wang et.al. | 2409.09271 | null |
2024-09-14 | High-Fidelity Data-Driven Dynamics Model for Reinforcement Learning-based Magnetic Control in HL-3 Tokamak | Niannian Wu et.al. | 2409.09238 | null |
2024-09-19 | Curricula for Learning Robust Policies with Factored State Representations in Changing Environments | Panayiotis Panayiotou et.al. | 2409.09169 | null |
2024-09-13 | Measure Preserving Flows for Ergodic Search in Convoluted Environments | Albert Xu et.al. | 2409.09164 | null |
2024-09-08 | ELMS: Elasticized Large Language Models On Mobile Devices | Wangsong Yin et.al. | 2409.09071 | null |
2024-09-13 | The unknotting number, hard unknot diagrams, and reinforcement learning | Taylor Applebaum et.al. | 2409.09032 | null |
2024-09-13 | Optically-Validated Microvascular Phantom for Super-Resolution Ultrasound Imaging | Jaime Parra Raad et.al. | 2409.09031 | null |
2024-09-13 | Agents in Software Engineering: Survey, Landscape, and Vision | Yanxian Huang et.al. | 2409.09030 | link |
2024-09-13 | AI-LieDar: Examine the Trade-off Between Utility and Truthfulness in LLM Agents | Zhe Su et.al. | 2409.09013 | null |
2024-09-13 | Mechanism Design for Extending the Accessibility of Facilities | Hau Chan et.al. | 2409.08993 | null |
2024-09-13 | Safeguarding Decentralized Social Media: LLM Agents for Automating Community Rule Compliance | Lucio La Cava et.al. | 2409.08963 | null |
2024-09-13 | Emerging Reliance Behaviors in Human-AI Text Generation: Hallucinations, Data Quality Assessment, and Cognitive Forcing Functions | Zahra Ashktorab et.al. | 2409.08937 | null |
2024-09-13 | Farmer.Chat: Scaling AI-Powered Agricultural Services for Smallholder Farmers | Namita Singh et.al. | 2409.08916 | null |
2024-09-13 | Exploring Action-Centric Representations Through the Lens of Rate-Distortion Theory | Miguel de Llanza Varona et.al. | 2409.08892 | null |
2024-09-13 | Using The Concept Hierarchy for Household Action Recognition | Andrei Costinescu et.al. | 2409.08853 | null |
2024-09-13 | Deep reinforcement learning for tracking a moving target in jellyfish-like swimming | Yihao Chen et.al. | 2409.08815 | null |
2024-09-13 | Mutual Theory of Mind in Human-AI Collaboration: An Empirical Study with LLM-driven AI Agents in a Real-time Shared Workspace Task | Shao Zhang et.al. | 2409.08811 | null |
2024-09-13 | HOLA-Drone: Hypergraphic Open-ended Learning for Zero-Shot Multi-Drone Cooperative Pursuit | Yang Li et.al. | 2409.08767 | null |
2024-09-13 | Fusing Dynamics Equation: A Social Opinions Prediction Algorithm with LLM-based Agents | Junchi Yao et.al. | 2409.08717 | null |
2024-09-13 | Systematic analysis of requirements for socially acceptable service robots | Andrea Ruo et.al. | 2409.08677 | null |
2024-09-13 | Average Consensus over Directed Networks in Open Multi-Agent Systems with Acknowledgement Feedback | Evagoras Makridis et.al. | 2409.08634 | null |
2024-09-13 | Generalization of Gershgorin's theorem. Analysis and design of control laws | Igor Furtat et.al. | 2409.08576 | null |
2024-09-13 | Expediting and Elevating Large Language Model Reasoning via Hidden Chain-of-Thought Decoding | Tianqiao Liu et.al. | 2409.08561 | null |
2024-09-16 | Can AI Prompt Humans? Multimodal Agents Prompt Players' Game Actions and Show Consequences to Raise Sustainability Awareness | Qinshi Zhang et.al. | 2409.08486 | null |
2024-09-13 | A BERT-Based Summarization approach for depression detection | Hossein Salahshoor Gavalan et.al. | 2409.08483 | null |
2024-09-12 | A Surveillance Game between a Differential Drive Robot and an Omnidirectional Agent: The Case of a Faster Evader | Rodrigo Saavedra et.al. | 2409.08414 | null |
2024-09-12 | Sequential Discrete Action Selection via Blocking Conditions and Resolutions | Liam Merz Hoffmeister et.al. | 2409.08410 | null |
2024-09-12 | Knowledge Tagging with Large Language Model based Multi-Agent System | Hang Li et.al. | 2409.08406 | null |
2024-09-12 | Self-Supervised Inference of Agents in Trustless Environments | Vladyslav Larin et.al. | 2409.08386 | null |
2024-09-12 | An Experimental Study of Competitive Market Behavior Through LLMs | Jingru Jia et.al. | 2409.08357 | null |
2024-09-13 | Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale | Rogerio Bonatti et.al. | 2409.08264 | link |
2024-09-12 | How can the tragedy of the commons be prevented?: Introducing Linear Quadratic Mixed Mean Field Games | Gokce Dayanikli et.al. | 2409.08235 | null |
2024-09-12 | Linear Complementary Dual Codes Constructed from Reinforcement Learning | Yansheng Wu et.al. | 2409.08114 | null |
2024-09-12 | MosquitoMiner: A Light Weight Rover for Detecting and Eliminating Mosquito Breeding Sites | Md. Adnanul Islam et.al. | 2409.08078 | link |
2024-09-13 | Learning Communities from Equilibria of Nonlinear Opinion Dynamics | Yu Xing et.al. | 2409.08004 | null |
2024-09-12 | Autonomous Vehicle Controllers From End-to-End Differentiable Simulation | Asen Nachkov et.al. | 2409.07965 | null |
2024-09-12 | WirelessAgent: Large Language Model Agents for Intelligent Wireless Networks | Jingwen Tong et.al. | 2409.07964 | link |
2024-09-12 | Covariance Intersection-based Invariant Kalman Filtering(DInCIKF) for Distributed Pose Estimation | Haoying Li et.al. | 2409.07933 | null |
2024-09-12 | Reinforcement Learning Discovers Efficient Decentralized Graph Path Search Strategies | Alexei Pisacane et.al. | 2409.07932 | null |
2024-09-12 | Tidal MerzA: Combining affective modelling and autonomous code generation through Reinforcement Learning | Elizabeth Wilson et.al. | 2409.07918 | null |
2024-09-12 | Tera-SpaceCom: GNN-based Deep Reinforcement Learning for Joint Resource Allocation and Task Offloading in TeraHertz Band Space Networks | Zhifeng Hu et.al. | 2409.07911 | null |
2024-09-12 | UNIT: Unsupervised Online Instance Segmentation through Time | Corentin Sautier et.al. | 2409.07887 | null |
2024-09-12 | Mapping Technical Safety Research at AI Companies: A literature review and incentives analysis | Oscar Delaney et.al. | 2409.07878 | null |
2024-09-12 | ReGentS: Real-World Safety-Critical Driving Scenario Generation Made Stable | Yuan Yin et.al. | 2409.07830 | null |
2024-09-12 | GateAttentionPose: Enhancing Pose Estimation with Agent Attention and Improved Gated Convolutions | Liang Feng et.al. | 2409.07798 | null |
2024-09-12 | A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning | Yinbo Yu et.al. | 2409.07775 | null |
2024-09-12 | Accelerated Multi-Time-Scale Stochastic Approximation: Optimal Complexity and Applications in Reinforcement Learning and Multi-Agent Games | Sihan Zeng et.al. | 2409.07767 | null |
2024-09-12 | Distributed Learning Dynamics Converging to the Core of |
Aya Hamed et.al. | 2409.07754 | null |
2024-09-12 | Self-similarity of temporal interaction networks arises from hyperbolic geometry with time-varying curvature | Subhabrata Dutta et.al. | 2409.07733 | link |
2024-09-12 | A Conceptual Framework for Understanding Empathy in Physics Faculty | Alia Hamdan et.al. | 2409.07724 | null |
2024-09-12 | CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model | Yang Li et.al. | 2409.07714 | null |
2024-09-12 | DSBench: How Far Are Data Science Agents to Becoming Data Science Experts? | Liqiang Jing et.al. | 2409.07703 | link |
2024-09-11 | SimulBench: Evaluating Language Models with Creative Simulation Tasks | Qi Jia et.al. | 2409.07641 | null |
2024-09-11 | HERL: Tiered Federated Learning with Adaptive Homomorphic Encryption using Reinforcement Learning | Jiaxang Tang et.al. | 2409.07631 | null |
2024-09-11 | A Survey of Inverse Constrained Reinforcement Learning: Definitions, Progress and Challenges | Guiliang Liu et.al. | 2409.07569 | null |
2024-09-11 | Connecting extended Wigner's friend arguments and noncontextuality | Laurens Walleghem et.al. | 2409.07537 | null |
2024-09-13 | MoA is All You Need: Building LLM Research Team using Mixture of Agents | Sandy Chen et.al. | 2409.07487 | null |
2024-09-04 | MarS: a Financial Market Simulation Engine Powered by Generative Foundation Model | Junjie Li et.al. | 2409.07486 | null |
2024-09-11 | "My Grade is Wrong!": A Contestable AI Framework for Interactive Feedback in Evaluating Student Essays | Shengxin Hong et.al. | 2409.07453 | null |
2024-09-11 | SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories | Ben Bogin et.al. | 2409.07440 | link |
2024-09-11 | Agent Workflow Memory | Zora Zhiruo Wang et.al. | 2409.07429 | link |
2024-09-11 | Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise Recommendation | Luo Ji et.al. | 2409.07416 | null |
2024-09-11 | A Contrastive Symmetric Forward-Forward Algorithm (SFFA) for Continual Learning Tasks | Erik B. Terres-Escudero et.al. | 2409.07387 | null |
2024-09-11 | Policy consequences of the new neuroeconomic framework | A. David Redish et.al. | 2409.07373 | null |
2024-09-11 | Online Decision MetaMorphFormer: A Casual Transformer-Based Reinforcement Learning Framework of Universal Embodied Intelligence | Luo Ji et.al. | 2409.07341 | null |
2024-09-11 | Explanation, Debate, Align: A Weak-to-Strong Framework for Language Model Generalization | Mehrdad Zakershahrak et.al. | 2409.07335 | null |
2024-09-11 | Using Generative Agents to Create Tip Sheets for Investigative Data Reporting | Joris Veerbeek et.al. | 2409.07286 | null |
2024-09-11 | Multi-Type Preference Learning: Empowering Preference-Based Reinforcement Learning with Equal Preferences | Ziang Liu et.al. | 2409.07268 | null |
2024-09-11 | Cross-Dialect Text-To-Speech in Pitch-Accent Language Incorporating Multi-Dialect Phoneme-Level BERT | Kazuki Yamauchi et.al. | 2409.07265 | null |
2024-09-11 | Propaganda to Hate: A Multimodal Analysis of Arabic Memes with Multi-Agent LLMs | Firoj Alam et.al. | 2409.07246 | null |
2024-09-11 | A Perspective on AI-Guided Molecular Simulations in VR: Exploring Strategies for Imitation Learning in Hyperdimensional Molecular Systems | Mohamed Dhouioui et.al. | 2409.07189 | null |
2024-09-11 | Identify Design Problems Through Questioning: Exploring Role-playing Interactions with Large Language Models to Foster Design Questioning Skills | Hyunseung Lim et.al. | 2409.07178 | null |
2024-09-11 | Learning Efficient Recursive Numeral Systems via Reinforcement Learning | Jonathan D. Thomas et.al. | 2409.07170 | null |
2024-09-11 | Randomized Strategic Facility Location with Predictions | Eric Balkanski et.al. | 2409.07142 | null |
2024-09-11 | MVLLaVA: An Intelligent Agent for Unified and Flexible Novel View Synthesis | Hanyu Jiang et.al. | 2409.07129 | null |
2024-09-11 | DCMAC: Demand-aware Customized Multi-Agent Communication via Upper Bound Training | Dongkun Huo et.al. | 2409.07127 | null |
2024-09-17 | Inefficient Alliance Formation in Coalitional Blotto Games | Vade Shah et.al. | 2409.06899 | null |
2024-09-10 | A Quality Diversity Approach to Automatically Generate Multi-Agent Path Finding Benchmark Maps | Cheng Qian et.al. | 2409.06888 | null |
2024-09-10 | Can Agents Spontaneously Form a Society? Introducing a Novel Architecture for Generative Multi-Agents to Elicit Social Emergence | H. Zhang et.al. | 2409.06750 | null |
2024-09-19 | Decentralized Neural Networks for Robust and Scalable Eigenvalue Computation | Ronald Katende et.al. | 2409.06746 | null |
2024-09-10 | Memory and Personality in Ideological Polarization: The Politico-physics of Mnemomatter | Shengkai Li et.al. | 2409.06660 | null |
2024-09-10 | Fixed-budget and Multiple-issue Quadratic Voting | Laura Georgescu et.al. | 2409.06614 | null |
2024-09-10 | On Epistemic Properties in Discrete-Event Systems: A Uniform Framework and Its Applications | Bohan Cui et.al. | 2409.06588 | null |
2024-09-10 | Think-on-Process: Dynamic Process Generation for Collaborative Development of Multi-Agent System | Leilei Lin et.al. | 2409.06568 | link |
2024-09-10 | Indirect Dynamic Negotiation in the Nash Demand Game | Tatiana V. Guy et.al. | 2409.06566 | null |
2024-09-10 | Social Mediation through Robots -- A Scoping Review on Improving Group Interactions through Directed Robot Action using an Extended Group Process Model | Thomas H. Weisswange et.al. | 2409.06557 | null |
2024-09-10 | Coordinated Motion Planning: Multi-Agent Path Finding in a Densely Packed, Bounded Domain | Sándor P. Fekete et.al. | 2409.06486 | null |
2024-09-10 | Learning Generative Interactive Environments By Trained Agent Exploration | Naser Kazemi et.al. | 2409.06445 | link |
2024-09-10 | Position Fair Mechanisms Allocating Indivisible Goods | Ryoga Mahara et.al. | 2409.06423 | null |
2024-09-10 | Exploring the Integration of Large Language Models in Industrial Test Maintenance Processes | Ludvig Lemner et.al. | 2409.06416 | null |
2024-09-10 | MAGDA: Multi-agent guideline-driven diagnostic assistance | David Bani-Harouni et.al. | 2409.06351 | null |
2024-09-17 | Foragax: An Agent-Based Modelling Framework Based on JAX | Siddharth Chaturvedi et.al. | 2409.06345 | link |
2024-09-10 | Towards Agentic AI on Particle Accelerators | Antonin Sulc et.al. | 2409.06336 | null |
2024-09-11 | Modified Meta-Thompson Sampling for Linear Bandits and Its Bayes Regret Analysis | Hao Li et.al. | 2409.06329 | null |
2024-09-10 | Automate Strategy Finding with LLM in Quant investment | Zhizhuo Kou et.al. | 2409.06289 | null |
2024-09-10 | Evidence gathering under competitive and noncompetitive rewards | Philip Brookins et.al. | 2409.06248 | null |
2024-09-10 | INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding | Ji Ha Jang et.al. | 2409.06210 | null |
2024-09-11 | A Policy Iteration Method for Inverse Mean Field Games | Kui Ren et.al. | 2409.06184 | null |
2024-09-10 | Contrastive Federated Learning with Tabular Data Silos | Achmad Ginanjar et.al. | 2409.06123 | null |
2024-09-14 | ClarQ-LLM: A Benchmark for Models Clarifying and Requesting Information in Task-Oriented Dialog | Yujian Gan et.al. | 2409.06097 | link |
2024-09-09 | Coarse Descriptions and Cautious Preferences | Evan Piermont et.al. | 2409.06054 | null |
2024-09-09 | When Learning Meets Dynamics: Distributed User Connectivity Maximization in UAV-Based Communication Networks | Bowei Li et.al. | 2409.06010 | null |
2024-09-09 | Promptable Closed-loop Traffic Simulation | Shuhan Tan et.al. | 2409.05863 | null |
2024-09-15 | MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct | Run Luo et.al. | 2409.05840 | null |
2024-09-09 | Cooperative Decision-Making for CAVs at Unsignalized Intersections: A MARL Approach with Attention and Hierarchical Game Priors | Jiaqi Liu et.al. | 2409.05712 | null |
2024-09-09 | StratXplore: Strategic Novelty-seeking and Instruction-aligned Exploration for Vision and Language Navigation | Muraleekrishna Gopinathan et.al. | 2409.05593 | null |
2024-09-09 | Interpretable Responsibility Sharing as a Heuristic for Task and Motion Planning | Arda Sarp Yenicesu et.al. | 2409.05586 | link |
2024-09-09 | SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning | Alireza Ghafarollahi et.al. | 2409.05556 | link |
2024-09-09 | Seeing is Believing? Enhancing Vision-Language Navigation using Visual Perturbations | Xuesong Zhang et.al. | 2409.05552 | null |
2024-09-09 | A refined Frauchiger--Renner paradox based on strong contextuality | Laurens Walleghem et.al. | 2409.05491 | null |
2024-09-09 | Adaptive Multi-Layer Deployment for A Digital Twin Empowered Satellite-Terrestrial Integrated Network | Yihong Tao et.al. | 2409.05480 | null |
2024-09-09 | Reinforcement Learning for Variational Quantum Circuits Design | Simone Foderà et.al. | 2409.05475 | null |
2024-09-09 | Semifactual Explanations for Reinforcement Learning | Jasmina Gajcin et.al. | 2409.05435 | link |
2024-09-09 | Leveraging Computation of Expectation Models for Commonsense Affordance Estimation on 3D Scene Graphs | Mario Alberto Valdes Saucedo et.al. | 2409.05392 | null |
2024-09-09 | BAMDP Shaping: a Unified Theoretical Framework for Intrinsic Motivation and Reward Shaping | Aly Lidayan et.al. | 2409.05358 | null |
2024-09-09 | Obvious Strategy-proofness with Respect to a Partition | R. Pablo Arribillaga et.al. | 2409.05315 | null |
2024-09-09 | Distributed Robust Continuous-Time Optimization Algorithms for Time-Varying Constrained Cost | Zeinab Ebrahimi et.al. | 2409.05293 | null |
2024-09-09 | Towards Fast Rates for Federated and Multi-Task Reinforcement Learning | Feng Zhu et.al. | 2409.05291 | null |
2024-09-08 | COVID19-CBABM: A City-Based Agent Based Disease Spread Modeling Framework | Raunak Sarbajna et.al. | 2409.05235 | null |
2024-09-08 | Banded phases in topological flocks | Charles R. Packard et.al. | 2409.05198 | null |
2024-09-08 | Difference Between Cyclic and Distributed Approach in Stochastic Optimization for Multi-agent System | Jiahao Shi et.al. | 2409.05155 | null |
2024-09-08 | Nonlinear Cooperative Output Regulation with Input Delay Compensation | Shiqi Zheng et.al. | 2409.05113 | null |
2024-09-11 | Decentralized Control of Multi-Agent Systems Under Acyclic Spatio-Temporal Task Dependencies | Gregorio Marchesini et.al. | 2409.05106 | null |
2024-09-08 | Pareto-Optimal Peer-to-Peer Risk Sharing with Robust Distortion Risk Measures | Mario Ghossoub et.al. | 2409.05103 | null |
2024-09-08 | On final opinions of the Friedkin-Johnsen model over random graphs with partially stubborn community | Lingfei Wang et.al. | 2409.05063 | null |
2024-09-08 | Towards Multi-agent Policy-based Directed Hypergraph Learning for Traffic Signal Control | Kang Wang et.al. | 2409.05037 | null |
2024-09-08 | Cooperative Learning-Based Framework for VNF Caching and Placement Optimization over Low Earth Orbit Satellite Networks | Khai Doan et.al. | 2409.05025 | null |
2024-09-08 | A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement | Huan Zhang et.al. | 2409.05001 | link |
2024-09-08 | Multi-V2X: A Large Scale Multi-modal Multi-penetration-rate Dataset for Cooperative Perception | Rongsong Li et.al. | 2409.04980 | null |
2024-09-07 | DEPLOYERS: An agent based modeling tool for multi country real world data | Martin Jaraiz et.al. | 2409.04876 | null |
2024-09-07 | Adaptation Procedure in Misinformation Games | Konstantinos Varsos et.al. | 2409.04854 | null |
2024-09-07 | Context-Aware Replanning with Pre-explored Semantic Map for Object Navigation | Hung-Ting Su et.al. | 2409.04837 | null |
2024-09-07 | LMGT: Optimizing Exploration-Exploitation Balance in Reinforcement Learning through Language Model Guided Trade-offs | Yongxin Deng et.al. | 2409.04744 | null |
2024-09-07 | Algorithmic Scenario Generation as Quality Diversity Optimization | Stefanos Nikolaidis et.al. | 2409.04711 | null |
2024-09-07 | Learning Optimal Stable Matches in Decentralized Markets with Unknown Preferences | Vade Shah et.al. | 2409.04669 | null |
2024-09-10 | QueryBuilder: Human-in-the-Loop Query Development for Information Retrieval | Hemanth Kandula et.al. | 2409.04667 | null |
2024-09-06 | Stacked Universal Successor Feature Approximators for Safety in Reinforcement Learning | Ian Cannon et.al. | 2409.04641 | null |
2024-09-06 | Sparse Rewards Can Self-Train Dialogue Agents | Barrett Martin Lattimer et.al. | 2409.04617 | link |
2024-09-15 | Decentralized Learning in General-sum Markov Games | Chinmay Maheshwari et.al. | 2409.04613 | null |
2024-09-06 | Impact of Transit on Mobility, Equity, and Economy in the Chicago Metropolitan Region | Omer Verbas et.al. | 2409.04568 | null |
2024-09-03 | State and Action Factorization in Power Grids | Gianvito Losapio et.al. | 2409.04467 | null |
2024-09-03 | Here's Charlie! Realising the Semantic Web vision of Agents in the age of LLMs | Jesse Wright et.al. | 2409.04465 | null |
2024-09-06 | A Survey on Knowledge Organization Systems of Research Fields: Resources and Challenges | Angelo Salatino et.al. | 2409.04432 | null |
2024-09-06 | RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs | Jiaxing Wu et.al. | 2409.04421 | null |
2024-09-06 | MATWA: A Web Toolkit for Matching under Preferences | Frederik Glitzner et.al. | 2409.04402 | null |
2024-09-06 | Cs-O |
Maximilian Herbert et.al. | 2409.04319 | null |
2024-09-06 | Safe and Efficient Path Planning under Uncertainty via Deep Collision Probability Fields | Felix Herrmann et.al. | 2409.04306 | null |
2024-09-06 | Using Large Language Models to Generate Authentic Multi-agent Knowledge Work Datasets | Desiree Heim et.al. | 2409.04286 | null |
2024-09-06 | Collective chemotactic search strategies | Hugues Meyer et.al. | 2409.04262 | null |
2024-09-06 | SPACE: A Python-based Simulator for Evaluating Decentralized Multi-Robot Task Allocation Algorithms | Inmo Jang et.al. | 2409.04230 | link |
2024-09-06 | FPT Algorithms using Minimal Parameters for a Generalized Version of Maximin Shares | Klaus Jansen et.al. | 2409.04225 | null |
2024-09-06 | Advancing Multi-Organ Disease Care: A Hierarchical Multi-Agent Reinforcement Learning Framework | Daniel J. Tan et.al. | 2409.04224 | null |
2024-09-06 | Runtime analysis of a coevolutionary algorithm on impartial combinatorial games | Alistair Benford et.al. | 2409.04177 | null |
2024-09-06 | Towards a Socially Acceptable Competitive Equilibrium in Energy Markets | Koorosh Shomalzadeh et.al. | 2409.04157 | null |
2024-09-06 | Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers | Chenglei Si et.al. | 2409.04109 | link |
2024-09-06 | Tighter Analysis for Decentralized Stochastic Gradient Method: Impact of Data Homogeneity | Qiang Li et.al. | 2409.04092 | null |
2024-09-06 | Surface Patterns Shaped by Additives in Crystals | M. A. Chabowska et.al. | 2409.04084 | null |
2024-09-05 | DRAL: Deep Reinforcement Adaptive Learning for Multi-UAVs Navigation in Unknown Indoor Environment | Kangtong Mo et.al. | 2409.03930 | null |
2024-09-05 | On the Convergence Rates of Federated Q-Learning across Heterogeneous Environments | Muxing Wang et.al. | 2409.03897 | null |
2024-09-05 | Multi-agent Path Finding for Mixed Autonomy Traffic Coordination | Han Zheng et.al. | 2409.03881 | null |
2024-09-05 | PARCO: Learning Parallel Autoregressive Policies for Efficient Multi-Agent Combinatorial Optimization | Federico Berto et.al. | 2409.03811 | link |
2024-09-04 | NESTFUL: A Benchmark for Evaluating LLMs on Nested Sequences of API Calls | Kinjal Basu et.al. | 2409.03797 | null |
2024-09-13 | Safeguarding AI Agents: Developing and Analyzing Safety Architectures | Ishaan Domkundwar et.al. | 2409.03793 | null |
2024-08-31 | BreachSeek: A Multi-Agent Automated Penetration Tester | Ibrahim Alshehri et.al. | 2409.03789 | link |
2024-09-06 | RAG based Question-Answering for Contextual Response Prediction System | Sriram Veturi et.al. | 2409.03708 | null |
2024-09-05 | TRACE-cs: Trustworthy Reasoning for Contrastive Explanations in Course Scheduling Problems | Stylianos Loukas Vasileiou et.al. | 2409.03671 | null |
2024-09-06 | LLM-based multi-agent poetry generation in non-cooperative environments | Ran Zhang et.al. | 2409.03659 | link |
2024-09-05 | A Complete Landscape of EFX Allocations of Mixed Manna on Graphs | Yu Zhou et.al. | 2409.03594 | null |
2024-09-05 | CHIRPs: Change-Induced Regret Proxy metrics for Lifelong Reinforcement Learning | John Birkbeck et.al. | 2409.03577 | null |
2024-09-05 | From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents | Jifan Yu et.al. | 2409.03512 | null |
2024-09-05 | Rx Strategist: Prescription Verification using LLM Agents System | Phuc Phan Van et.al. | 2409.03440 | null |
2024-09-05 | Reinforcement Learning Approach to Optimizing Profilometric Sensor Trajectories for Surface Inspection | Sara Roos-Hoefgeest et.al. | 2409.03429 | null |
2024-09-05 | Game On: Towards Language Models as RL Experimenters | Jingwei Zhang et.al. | 2409.03402 | null |
2024-09-05 | ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models | Qi Ju et.al. | 2409.03301 | link |
2024-09-05 | Robust synchronization and policy adaptation for networked heterogeneous agents | Miguel F. Arevalo-Castiblanco et.al. | 2409.03273 | null |
2024-09-05 | GraphInsight: Unlocking Insights in Large Language Models for Graph Structure Understanding | Yukun Cao et.al. | 2409.03258 | null |
2024-09-05 | E2CL: Exploration-based Error Correction Learning for Embodied Agents | Hanlin Wang et.al. | 2409.03256 | null |
2024-09-05 | Improving agent performance in fluid environments by perceptual pretraining | Jin Zhang et.al. | 2409.03230 | null |
2024-09-05 | xLAM: A Family of Large Action Models to Empower AI Agent Systems | Jianguo Zhang et.al. | 2409.03215 | link |
2024-09-05 | Predefined-time distributed non-convex optimization via a time-base generator | Qinlong Lin et.al. | 2409.03188 | null |
2024-09-11 | Continual Skill and Task Learning via Dialogue | Weiwei Gu et.al. | 2409.03166 | null |
2024-09-04 | Subsidy design for better social outcomes | Maria-Florina Balcan et.al. | 2409.03129 | null |
2024-09-04 | RoboKoop: Efficient Control Conditioned Representations from Visual Input in Robotics using Koopman Operator | Hemant Kumawat et.al. | 2409.03107 | null |
2024-09-04 | Resilient Two-Time-Scale Local Stochastic Gradient Descent for Byzantine Federated Learning | Amit Dutta et.al. | 2409.03092 | null |
2024-09-04 | An Introduction to Centralized Training for Decentralized Execution in Cooperative Multi-Agent Reinforcement Learning | Christopher Amato et.al. | 2409.03052 | null |
2024-09-04 | Large Language Model-Based Agents for Software Engineering: A Survey | Junwei Liu et.al. | 2409.02977 | link |
2024-09-03 | Managing multiple agents by automatically adjusting incentives | Shunichi Akatsuka et.al. | 2409.02960 | null |
2024-09-04 | LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture | Xidong Wang et.al. | 2409.02889 | link |
2024-09-04 | Bioinformatics Retrieval Augmentation Data (BRAD) Digital Assistant | Joshua Pickard et.al. | 2409.02864 | null |
2024-09-06 | Language Understanding as a Constraint on Consensus Size in LLM Societies | Giordano De Marzo et.al. | 2409.02822 | null |
2024-09-04 | Ion-specific Stability of Gold Nanoparticle Suspensions | Philipp Ritzert et.al. | 2409.02762 | null |
2024-09-04 | Adaptive Formation Learning Control for Cooperative AUVs under Complete Uncertainty | Emadodin Jandaghi et.al. | 2409.02745 | null |
2024-09-04 | Creating a Gen-AI based Track and Trace Assistant MVP (SuperTracy) for PostNL | Mohammad Reshadati et.al. | 2409.02711 | null |
2024-09-04 | Decision Transformer for Enhancing Neural Local Search on the Job Shop Scheduling Problem | Constantin Waubert de Puiseau et.al. | 2409.02697 | null |
2024-09-04 | Generalized Individual Q-learning for Polymatrix Games with Partial Observations | Ahmed Said Donmez et.al. | 2409.02663 | null |
2024-09-04 | A Survey on Emergent Language | Jannik Peters et.al. | 2409.02645 | null |
2024-09-04 | Evaluating Environments Using Exploratory Agents | Bobby Khaleque et.al. | 2409.02632 | null |
2024-09-04 | Advancing Cyber Incident Timeline Analysis Through Rule Based AI and Large Language Models | Fatma Yasmine Loumachi et.al. | 2409.02572 | null |
2024-09-04 | Vision-Language Navigation with Continual Learning | Zhiyuan Li et.al. | 2409.02561 | null |
2024-09-05 | A Sequential Decision-Making Model for Perimeter Identification | Ayal Taitler et.al. | 2409.02549 | null |
2024-09-04 | Astrochemistry on Galactic scales | L. Colzi et.al. | 2409.02537 | null |
2024-09-04 | Cog-GA: A Large Language Models-based Generative Agent for Vision-Language Navigation in Continuous Environments | Zhiyuan Li et.al. | 2409.02522 | null |
2024-09-04 | Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal | Jifeng Hu et.al. | 2409.02512 | link |
2024-09-04 | Occlusion-Based Cooperative Transport for Concave Objects with a Swarm of Miniature Mobile Robots | Sanjuksha Nirgude et.al. | 2409.02436 | null |
2024-09-04 | Context-Aware Agent-based Model for Smart Long Distance Transport System | Muhammad Raees et.al. | 2409.02434 | null |
2024-09-04 | Building Math Agents with Multi-Turn Iterative Preference Learning | Wei Xiong et.al. | 2409.02392 | null |
2024-09-04 | Multi-modal Situated Reasoning in 3D Scenes | Xiongkun Linghu et.al. | 2409.02389 | null |
2024-09-04 | Neighbourhood conditions for network stability with link uncertainty | Simone Mariano et.al. | 2409.02350 | null |
2024-09-03 | Kinesthetic Teaching in Robotics: a Mixed Reality Approach | Simone Macci`o et.al. | 2409.02305 | null |
2024-09-03 | Multi-Agent Reinforcement Learning for Joint Police Patrol and Dispatch | Matthew Repasky et.al. | 2409.02246 | null |
2024-09-02 | AutoEncoder Convolutional Neural Network for Pneumonia Detection | Michael Nosa-Omoruyi et.al. | 2409.02142 | null |
2024-09-01 | TrajWeaver: Trajectory Recovery with State Propagation Diffusion Model | Jinming Wang et.al. | 2409.02124 | null |
2024-09-05 | Noise-free comparison of stochastic agent-based simulations using common random numbers | Daniel J. Klein et.al. | 2409.02086 | null |
2024-09-03 | A Modern Take on Visual Relationship Reasoning for Grasp Planning | Paolo Rabino et.al. | 2409.02035 | null |
2024-09-03 | Optimal allocations with capacity constrained verification | Albin Erlanson et.al. | 2409.02031 | null |
2024-09-03 | Planning to avoid ambiguous states through Gaussian approximations to non-linear sensors in active inference agents | Wouter M. Kouw et.al. | 2409.01974 | null |
2024-09-03 | Snapshot: Towards Application-centered Models for Pedestrian Trajectory Prediction in Urban Traffic Environments | Nico Uhlemann et.al. | 2409.01971 | null |
2024-09-03 | Achieving Maximin Share and EFX/EF1 Guarantees Simultaneously | Hannaneh Akrami et.al. | 2409.01963 | null |
2024-09-03 | Learning Resilient Formation Control of Drones with Graph Attention Network | Jiaping Xiao et.al. | 2409.01953 | null |
2024-09-03 | From Grounding to Planning: Benchmarking Bottlenecks in Web Agents | Segev Shlomov et.al. | 2409.01927 | null |
2024-09-03 | Focus Agent: LLM-Powered Virtual Focus Group | Taiyu Zhang et.al. | 2409.01907 | null |
2024-09-03 | What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices | Zhi Chen et.al. | 2409.01893 | link |
2024-09-03 | AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction | Yuchen Shi et.al. | 2409.01854 | link |
2024-09-03 | Segmenting Object Affordances: Reproducibility and Sensitivity to Scale | Tommaso Apicella et.al. | 2409.01814 | link |
2024-09-03 | Empirical evidence of Large Language Model's influence on human spoken communication | Hiromu Yakura et.al. | 2409.01754 | null |
2024-09-03 | 4D-CAT: Synthesis of 4D Coronary Artery Trees from Systole and Diastole | Daosong Hu et.al. | 2409.01725 | null |
2024-09-03 | VProChart: Answering Chart Question through Visual Perception Alignment Agent and Programmatic Solution Reasoning | Muye Huang et.al. | 2409.01667 | null |
2024-09-03 | T1-contrast Enhanced MRI Generation from Multi-parametric MRI for Glioma Patients with Latent Tumor Conditioning | Zach Eidex et.al. | 2409.01622 | null |
2024-09-03 | A Time-Intensity Aware Pipeline for Generating Late-Stage Breast DCE-MRI using Generative Adversarial Models | Ruben D. Fonnegra et.al. | 2409.01596 | null |
2024-09-03 | Convergence of the Heterogeneous Deffuant-Weisbuch Model: A Complete Proof and Some Extensions | Ge Chen et.al. | 2409.01593 | null |
2024-09-03 | An Implementation of Werewolf Agent That does not Truly Trust LLMs | Takehiro Sato et.al. | 2409.01575 | null |
2024-09-03 | Purification-Agnostic Proxy Learning for Agentic Copyright Watermarking against Adversarial Evidence Forgery | Erjin Bao et.al. | 2409.01541 | null |
2024-09-03 | Bridging the Gap Between Central and Local Decision-Making: The Efficacy of Collaborative Equilibria in Altruistic Congestion Games | Bryce L Ferguson et.al. | 2409.01525 | null |
2024-09-02 | The Compressor-Retriever Architecture for Language Model OS | Yuan Yang et.al. | 2409.01495 | link |
2024-09-02 | Watermarking of Quantum Circuits | Rupshali Roy et.al. | 2409.01484 | null |
2024-09-02 | Irreversible investment under weighted discounting: effects of decreasing impatience | Pengyu Wei et.al. | 2409.01478 | null |
2024-09-02 | Real-Time Recurrent Learning using Trace Units in Reinforcement Learning | Esraa Elelimy et.al. | 2409.01449 | null |
2024-09-02 | Performance-Aware Self-Configurable Multi-Agent Networks: A Distributed Submodular Approach for Simultaneous Coordination and Network Design | Zirui Xu et.al. | 2409.01411 | link |
2024-09-02 | GenAgent: Build Collaborative AI Systems with Automated Workflow Generation -- Case Studies on ComfyUI | Xiangyuan Xue et.al. | 2409.01392 | null |
2024-09-02 | Modeling contagious disease spreading | Dipak Patra et.al. | 2409.01103 | null |
2024-09-02 | Two-Timescale Synchronization and Migration for Digital Twin Networks: A Multi-Agent Deep Reinforcement Learning Approach | Wenshuai Liu et.al. | 2409.01092 | null |
2024-09-02 | Learning in Hybrid Active Inference Models | Poppy Collis et.al. | 2409.01066 | null |
2024-09-02 | Multiagent Reinforcement Learning Enhanced Decision-making of Crew Agents During Floor Construction Process | Bin Yang et.al. | 2409.01060 | null |
2024-09-02 | Accelerated Multi-objective Task Learning using Modified Q-learning Algorithm | Varun Prakash Rajamohan et.al. | 2409.01046 | null |
2024-09-02 | Federated Deep Reinforcement Learning-Based Intelligent Channel Access in Dense Wi-Fi Deployments | Xinyang Du et.al. | 2409.01004 | null |
2024-09-02 | Evolution of Social Norms in LLM Agents using Natural Language | Ilya Horiguchi et.al. | 2409.00993 | null |
2024-09-02 | Co-Learning: Code Learning for Multi-Agent Reinforcement Collaborative Framework with Conversational Natural Language Interfaces | Jiapeng Yu et.al. | 2409.00985 | link |
2024-09-02 | Semantically Controllable Augmentations for Generalizable Robot Learning | Zoey Chen et.al. | 2409.00951 | null |
2024-09-02 | Distributed Optimization under Edge Agreement with Application in Battery Network Management | Zehui Lu et.al. | 2409.00936 | null |
2024-09-02 | ToolACE: Winning the Points of LLM Function Calling | Weiwen Liu et.al. | 2409.00920 | null |
2024-09-04 | MarsCode Agent: AI-native Automated Bug Fixing | Yizhou Liu et.al. | 2409.00899 | null |
2024-09-02 | Whole-Body Control Through Narrow Gaps From Pixels To Action | Tianyue Wu et.al. | 2409.00895 | null |
2024-09-01 | Self-evolving Agents with reflective and memory-augmented abilities | Xuechen Liang et.al. | 2409.00872 | null |
2024-09-01 | JaxLife: An Open-Ended Agentic Simulator | Chris Lu et.al. | 2409.00853 | link |
2024-09-01 | Satisficing Equilibrium | Bary S. R. Pradelski et.al. | 2409.00832 | null |
2024-09-01 | Digital Homunculi: Reimagining Democracy Research with Generative Agents | Petr Specian et.al. | 2409.00826 | null |
2024-09-01 | Accelerating Hybrid Agent-Based Models and Fuzzy Cognitive Maps: How to Combine Agents who Think Alike? | Philippe J. Giabbanelli et.al. | 2409.00824 | null |
2024-09-01 | Cooperative Path Planning with Asynchronous Multiagent Reinforcement Learning | Jiaming Yin et.al. | 2409.00754 | null |
2024-09-01 | Simulation of Social Media-Driven Bubble Formation in Financial Markets using an Agent-Based Model with Hierarchical Influence Network | Gonzalo Bohorquez et.al. | 2409.00742 | link |
2024-09-01 | Fair Reciprocal Recommendation in Matching Markets | Yoji Tomita et.al. | 2409.00720 | link |
2024-09-04 | Multi-Agent Reinforcement Learning from Human Feedback: Data Coverage and Algorithmic Techniques | Natalia Zhang et.al. | 2409.00717 | null |
2024-09-01 | Universal Finite-State and Self-Stabilizing Computation in Anonymous Dynamic Networks | Giuseppe A. Di Luna et.al. | 2409.00688 | null |
2024-09-01 | A Learnable Agent Collaboration Network Framework for Personalized Multimodal AI Search Engine | Yunxiao Shi et.al. | 2409.00636 | null |
2024-09-01 | Roundabout Dilemma Zone Data Mining and Forecasting with Trajectory Prediction and Graph Neural Networks | Manthan Chelenahalli Satish et.al. | 2409.00622 | null |
2024-09-01 | TinyAgent: Function Calling at the Edge | Lutfi Eren Erdogan et.al. | 2409.00608 | null |
2024-09-01 | Average-case optimization analysis for distributed consensus algorithms on regular graphs | Nhat Trung Nguyen et.al. | 2409.00605 | null |
2024-09-04 | GenAI-powered Multi-Agent Paradigm for Smart Urban Mobility: Opportunities and Challenges for Integrating Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) with Intelligent Transportation Systems | Haowen Xu et.al. | 2409.00494 | null |
2024-08-31 | Online Learning of Interaction Dynamics with Dual Model Predictive Control for Multi-Agent Systems Using Gaussian Processes | T. M. J. T. Baltussen et.al. | 2409.00432 | null |
2024-08-31 | Chatting Up Attachment: Using LLMs to Predict Adult Bonds | Paulo Soares et.al. | 2409.00347 | null |
2024-08-29 | PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action | Yijia Shao et.al. | 2409.00138 | link |
2024-08-29 | HoneyComb: A Flexible LLM-Based Agent System for Materials Science | Huan Zhang et.al. | 2409.00135 | null |
2024-08-29 | MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale | Anton Andreychuk et.al. | 2409.00134 | link |
2024-08-27 | Evaluating the Impact of Multiple DER Aggregators on Wholesale Energy Markets: A Hybrid Mean Field Approach | Jun He et.al. | 2409.00107 | null |
2024-08-27 | Collective Predictive Coding as Model of Science: Formalizing Scientific Activities Towards Generative Science | Tadahiro Taniguchi et.al. | 2409.00102 | null |
2024-08-27 | Modelisation a base d'Agent Augmentes par LLM pour les Simulations Sociales: Defis et Opportunites | Önder Gürcan et.al. | 2409.00100 | null |
2024-08-24 | Towards Human-Level Understanding of Complex Process Engineering Schematics: A Pedagogical, Introspective Multi-Agent Framework for Open-Domain Question Answering | Sagar Srinivas Sakhinana et.al. | 2409.00082 | null |
2024-08-30 | Robust Technology Regulation | Andrew Koh et.al. | 2408.17398 | null |
2024-08-30 | Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory control | Zihao Sheng et.al. | 2408.17380 | link |
2024-08-30 | EMPOWER: Embodied Multi-role Open-vocabulary Planning with Online Grounding and Execution | Francesco Argenziano et.al. | 2408.17379 | null |
2024-08-30 | Non-reciprocal spin-glass transition and aging | Giulia Garcia Lorenzana et.al. | 2408.17360 | null |
2024-08-30 | Why do elites extend property rights: unlocking investment and the switch to public goods | Alastair Langtry et.al. | 2408.17335 | null |
2024-08-30 | All You Need is Group Actions: Advancing Robust Autonomous Planning | Vincenzo Basco et.al. | 2408.17295 | null |
2024-08-30 | Predicting the Impact of Generative AI Using an Agent-Based Model | Joao Tiago Aparicio et.al. | 2408.17268 | null |
2024-08-30 | Using Quantum Solved Deep Boltzmann Machines to Increase the Data Efficiency of RL Agents | Daniel Kent et.al. | 2408.17240 | null |
2024-08-30 | Asynchronous Distributed Learning with Quantized Finite-Time Coordination | Nicola Bastianello et.al. | 2408.17156 | null |
2024-08-30 | Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning | Shuyang Zhang et.al. | 2408.17005 | link |
2024-08-30 | Characterizing User Platforms for Video Streaming in Broadband Networks | Yifan Wang et.al. | 2408.16995 | link |
2024-08-30 | Tool-Assisted Agent on SQL Inspection and Refinement in Real-World Scenarios | Zhongyuan Wang et.al. | 2408.16991 | null |
2024-08-30 | Beyond Preferences in AI Alignment | Tan Zhi-Xuan et.al. | 2408.16984 | null |
2024-08-30 | The Sample-Communication Complexity Trade-off in Federated Q-Learning | Sudeep Salgia et.al. | 2408.16981 | null |
2024-08-30 | Discovery of False Data Injection Schemes on Frequency Controllers with Reinforcement Learning | Romesh Prasad et.al. | 2408.16958 | null |
2024-08-29 | Robotic warehousing operations: a learn-then-optimize approach to large-scale neighborhood search | Cynthia Barnhart et.al. | 2408.16890 | null |
2024-08-29 | Learning Multi-agent Multi-machine Tending by Mobile Robots | Abdalwhab Abdalwhab et.al. | 2408.16875 | null |
2024-08-29 | A framework for training and benchmarking algorithms that schedule robot tasks | Wojciech Dudek et.al. | 2408.16844 | null |
2024-08-29 | AdapShare: An RL-Based Dynamic Spectrum Sharing Solution for O-RAN | Sneihil Gopal et.al. | 2408.16842 | null |
2024-08-29 | A Bibliometric Analysis of Trust in Conversational Agents over the Past Fifteen Years | Meltem Aksoy et.al. | 2408.16837 | null |
2024-08-29 | Maelstrom Networks | Matthew Evanusa et.al. | 2408.16632 | null |
2024-08-29 | On the data-sparsity of the solution of Riccati equations with applications to feedback control | Stefano Massei et.al. | 2408.16569 | null |
2024-08-29 | CooTest: An Automated Testing Approach for V2X Communication Systems | An Guo et.al. | 2408.16470 | null |
2024-08-29 | Consensus Planning with Primal, Dual, and Proximal Agents | Alvaro Maggiar et.al. | 2408.16462 | null |
2024-08-29 | 3D Topological Modeling and Multi-Agent Movement Simulation for Viral Infection Risk Analysis | Wassim Jabi et.al. | 2408.16417 | null |
2024-09-04 | Efficient Multi-agent Navigation with Lightweight DRL Policy | Xingrong Diao et.al. | 2408.16370 | null |
2024-08-29 | Guided Reasoning: A Non-Technical Introduction | Gregor Betz et.al. | 2408.16331 | link |
2024-08-29 | Autocorrelation properties of temporal networks governed by dynamic node variables | Harrison Hartle et.al. | 2408.16270 | null |
2024-08-29 | Action potential dynamics on heterogenous neural networks: from kinetic to macroscopic equations | Marzia Bisi et.al. | 2408.16214 | null |
2024-08-28 | DECAF: a Discrete-Event based Collaborative Human-Robot Framework for Furniture Assembly | Giulio Giacomuzzo et.al. | 2408.16125 | null |
2024-08-28 | EPO: Hierarchical LLM Agents with Environment Preference Optimization | Qi Zhao et.al. | 2408.16090 | null |
2024-08-28 | Logic-Enhanced Language Model Agents for Trustworthy Social Simulations | Agnieszka Mensfelt et.al. | 2408.16081 | link |
2024-08-28 | Hitting the Gym: Reinforcement Learning Control of Exercise-Strengthened Biohybrid Robots in Simulation | Saul Schaffer et.al. | 2408.16069 | null |
2024-08-28 | An Extremely Data-efficient and Generative LLM-based Reinforcement Learning Agent for Recommenders | Shuang Feng et.al. | 2408.16032 | null |
2024-08-28 | Thoughtseeds: Evolutionary Priors, Nested Markov Blankets, and the Emergence of Embodied Cognition | Prakash Chandra Kavi et.al. | 2408.15982 | null |
2024-08-28 | WebPilot: A Versatile and Autonomous Multi-Agent System for Web Task Execution with Strategic Exploration | Yao Zhang et.al. | 2408.15978 | null |
2024-08-28 | BattleAgentBench: A Benchmark for Evaluating Cooperation and Competition Capabilities of Language Models in Multi-Agent Systems | Wei Wang et.al. | 2408.15971 | null |
2024-08-28 | Atari-GPT: Investigating the Capabilities of Multimodal Large Language Models as Low-Level Policies for Atari Games | Nicholas R. Waytowich et.al. | 2408.15950 | null |
2024-08-28 | Auxiliary Input in Training: Incorporating Catheter Features into Deep Learning Models for ECG-Free Dynamic Coronary Roadmapping | Yikang Liu et.al. | 2408.15947 | null |
2024-09-02 | Persuasion Games using Large Language Models | Ganesh Prasath Ramani et.al. | 2408.15879 | null |
2024-08-28 | Retrieval-Augmented Instruction Tuning for Automated Process Engineering Calculations : A Tool-Chaining Problem-Solving Framework with Attributable Reflection | Sagar Srinivas Sakhinana et.al. | 2408.15866 | null |
2024-08-28 | FlowAct: A Proactive Multimodal Human-robot Interaction System with Continuous Flow of Perception and Modular Action Sub-systems | Timothée Dhaussy et.al. | 2408.15864 | null |
2024-08-28 | Interactive Agents: Simulating Counselor-Client Psychological Counseling via Role-Playing LLM-to-LLM Interactions | Huachuan Qiu et.al. | 2408.15787 | link |
2024-09-05 | LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models | Jiayi Gui et.al. | 2408.15778 | null |
2024-08-28 | A Survey on Evaluation of Multimodal Large Language Models | Jiaxing Huang et.al. | 2408.15769 | null |
2024-08-28 | Evaluating and Comparing Crowd Simulations: Perspectives from a Crowd Authoring Tool | Gabriel Fonseca Silva et.al. | 2408.15762 | null |
2024-09-01 | Reinforcement Learning for Adaptive Traffic Signal Control: Turn-Based and Time-Based Approaches to Reduce Congestion | Muhammad Tahir Rafique et.al. | 2408.15751 | null |
2024-08-28 | Different Facets for Different Experts: A Framework for Streamlining The Integration of Qualitative Insights into ABM Development | Vivek Nallur et.al. | 2408.15725 | null |
2024-08-28 | Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning | Minjong Yoo et.al. | 2408.15593 | null |
2024-08-28 | TrafficGamer: Reliable and Flexible Traffic Simulation for Safety-Critical Scenarios with Game-Theoretic Oracles | Guanren Qiao et.al. | 2408.15538 | link |
2024-08-28 | Towards Fully Autonomous Research Powered by LLMs: Case Study on Simulations | Zhihan Liu et.al. | 2408.15512 | link |
2024-08-28 | AeroVerse: UAV-Agent Benchmark Suite for Simulating, Pre-training, Finetuning, and Evaluating Aerospace Embodied World Models | Fanglong Yao et.al. | 2408.15511 | null |
2024-08-28 | Infinite-Horizon Optimal Wireless Control Over Shared State-Dependent Fading Channels for IIoT Systems | Shuling Wang et.al. | 2408.15492 | null |
2024-08-27 | Graph Attention Inference of Network Topology in Multi-Agent Systems | Akshay Kolli et.al. | 2408.15449 | null |
2024-08-27 | Fast and Modular Autonomy Software for Autonomous Racing Vehicles | Andrew Saba et.al. | 2408.15425 | null |
2024-09-04 | Simultaneous Training of First- and Second-Order Optimizers in Population-Based Reinforcement Learning | Felix Pfeiffer et.al. | 2408.15421 | null |
2024-08-27 | On Stateful Value Factorization in Multi-Agent Reinforcement Learning | Enrico Marchesini et.al. | 2408.15381 | null |
2024-08-27 | A Multi-Agent Reinforcement Learning Scheme for SFC Placement in Edge Computing Networks | Congzhou Li et.al. | 2408.15337 | null |
2024-08-27 | Artificially intelligent Maxwell's demon for optimal control of open quantum systems | Paolo Andrea Erdman et.al. | 2408.15328 | null |
2024-08-27 | TourSynbio: A Multi-Modal Large Model and Agent Framework to Bridge Text and Protein Sequences for Protein Engineering | Yiqing Shen et.al. | 2408.15299 | link |
2024-08-27 | Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations | Yucheng Jiang et.al. | 2408.15232 | null |
2024-08-27 | Exploiting Approximate Symmetry for Efficient Multi-Agent Reinforcement Learning | Batuhan Yardim et.al. | 2408.15173 | null |
2024-08-27 | Applications in CityLearn Gym Environment for Multi-Objective Control Benchmarking in Grid-Interactive Buildings and Districts | Kingsley Nweye et.al. | 2408.15170 | null |
2024-08-27 | Delay as Payoff in MAB | Ofir Schlisselberg et.al. | 2408.15158 | null |
2024-08-27 | muPRL: A Mutation Testing Pipeline for Deep Reinforcement Learning based on Real Faults | Deepak-George Thomas et.al. | 2408.15150 | link |
2024-08-29 | No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery | Alexander Rutherford et.al. | 2408.15099 | link |
2024-08-23 | Flexible categorization using formal concept analysis and Dempster-Shafer theory | Marcel Boersma et.al. | 2408.15012 | null |
2024-08-27 | AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems | Chi-Min Chan et.al. | 2408.14972 | link |
2024-08-27 | The Asymptotic Cost of Complexity | Martin W Cripps et.al. | 2408.14949 | null |
2024-08-27 | Decentralized Unlabeled Multi-agent Pathfinding Via Target And Priority Swapping (With Supplementary) | Stepan Dergachev et.al. | 2408.14948 | link |
2024-08-27 | Learning Robust Reward Machines from Noisy Labels | Roko Parac et.al. | 2408.14871 | link |
2024-08-27 | Diffusion Models Are Real-Time Game Engines | Dani Valevski et.al. | 2408.14837 | null |
2024-08-27 | Sequential-Scanning Dual-Energy CT Imaging Using High Temporal Resolution Image Reconstruction and Error-Compensated Material Basis Image Generation | Qiaoxin Li et.al. | 2408.14754 | null |
2024-08-27 | Sub-Riemannian Geometry, Mixing, and the Holonomy of Optimal Mass Transport | Mahmoud Abdelgalil et.al. | 2408.14707 | null |
2024-08-26 | Model-Based Reinforcement Learning for Control of Strongly-Disturbed Unsteady Aerodynamic Flows | Zhecheng Liu et.al. | 2408.14685 | null |
2024-08-26 | Emergent Language in Open-Ended Environments | Cornelius Wolff et.al. | 2408.14649 | null |
2024-08-26 | Biased Dueling Bandits with Stochastic Delayed Feedback | Bongsoo Yi et.al. | 2408.14603 | null |
2024-08-26 | On Centralized Critics in Multi-Agent Reinforcement Learning | Xueguang Lyu et.al. | 2408.14597 | link |
2024-08-26 | Multi-Agent Path Finding with Real Robot Dynamics and Interdependent Tasks for Automated Warehouses | Vassilissa Lehoux-Lebacque et.al. | 2408.14527 | null |
2024-08-26 | A Survey on Reinforcement Learning Applications in SLAM | Mohammad Dehghani Tezerjani et.al. | 2408.14518 | null |
2024-08-24 | Artificial intelligence for science: The easy and hard problems | Ruairidh M. Battleday et.al. | 2408.14508 | null |
2024-08-23 | Knowledge Graph Modeling-Driven Large Language Model Operating System (LLM OS) for Task Automation in Process Engineering Problem-Solving | Sakhinana Sagar Srinivas et.al. | 2408.14494 | null |
2024-08-18 | Agentic Retrieval-Augmented Generation for Time Series Analysis | Chidaksh Ravuru et.al. | 2408.14484 | null |
2024-08-26 | Employing Artificial Intelligence to Steer Exascale Workflows with Colmena | Logan Ward et.al. | 2408.14434 | null |
2024-08-26 | SWE-bench-java: A GitHub Issue Resolving Benchmark for Java | Daoguang Zan et.al. | 2408.14354 | link |
2024-09-03 | Foundation Models for Music: A Survey | Yinghao Ma et.al. | 2408.14340 | link |
2024-08-26 | Equivariant Reinforcement Learning under Partial Observability | Hai Nguyen et.al. | 2408.14336 | null |
2024-08-26 | LLM-3D Print: Large Language Models To Monitor and Control 3D Printing | Yayati Jadhav et.al. | 2408.14307 | null |
2024-08-26 | Fact Probability Vector Based Goal Recognition | Nils Wilken et.al. | 2408.14224 | link |
2024-08-26 | Robot Navigation with Entity-Based Collision Avoidance using Deep Reinforcement Learning | Yury Kolomeytsev et.al. | 2408.14183 | null |
2024-08-26 | "Hi. I'm Molly, Your Virtual Interviewer!" -- Exploring the Impact of Race and Gender in AI-powered Virtual Interview Experiences | Shreyan Biswas et.al. | 2408.14159 | null |
2024-08-26 | Investigating the effect of Mental Models in User Interaction with an Adaptive Dialog Agent | Lindsey Vanderlyn et.al. | 2408.14154 | null |
2024-09-02 | MLR-Copilot: Autonomous Machine Learning Research based on Large Language Models Agents | Ruochen Li et.al. | 2408.14033 | link |
2024-08-26 | Optimizing TD3 for 7-DOF Robotic Arm Grasping: Overcoming Suboptimality with Exploration-Enhanced Contrastive Learning | Wen-Han Hsieh et.al. | 2408.14009 | null |
2024-08-26 | Decentralized Federated Learning with Model Caching on Mobile Agents | Xiaoyu Wang et.al. | 2408.14001 | null |
2024-08-26 | Quantitative Representation of Scenario Difficulty for Autonomous Driving Based on Adversarial Policy Search | Shuo Yang et.al. | 2408.14000 | null |
2024-08-26 | AgentMove: Predicting Human Mobility Anywhere Using Large Language Model based Agentic Framework | Jie Feng et.al. | 2408.13986 | link |
2024-08-25 | CoT Rerailer: Enhancing the Reliability of Large Language Models in Complex Reasoning Tasks through Error Detection and Correction | Guangya Wan et.al. | 2408.13940 | null |
2024-08-25 | Safe Policy Exploration Improvement via Subgoals | Brian Angulo et.al. | 2408.13881 | null |
2024-08-25 | Flexible game-playing AI with AlphaViT: adapting to multiple games and board sizes | Kazuhisa Fujita et.al. | 2408.13871 | null |
2024-08-25 | Informativeness and Trust in Bayesian Persuasion | Reema Deori et.al. | 2408.13822 | null |
2024-08-25 | Optical Inversion Using Plasmonic Contrast Agents | Xinlin Cao et.al. | 2408.13793 | null |
2024-08-25 | Demo: Generative Open xG Network Simulation with Multi-Agent LLM and ns-3 (GenOnet) | Farhad Rezazadeh et.al. | 2408.13781 | null |
2024-08-25 | MASQ: Multi-Agent Reinforcement Learning for Single Quadruped Robot Locomotion | Qi Liu et.al. | 2408.13759 | null |
2024-08-25 | Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation | Zhaoyang Li et.al. | 2408.13752 | null |
2024-08-25 | Multi-Agent Target Assignment and Path Finding for Intelligent Warehouse: A Cooperative Multi-Agent Deep Reinforcement Learning Perspective | Qi Liu et.al. | 2408.13750 | null |
2024-08-25 | Count-based Novelty Exploration in Classical Planning | Giacomo Rosa et.al. | 2408.13719 | null |
2024-08-24 | How to guide a present-biased agent through prescribed tasks? | Tatiana Belova et.al. | 2408.13675 | null |
2024-08-24 | Temporal Elections: Welfare, Strategyproofness, and Proportionality | Edith Elkind et.al. | 2408.13637 | null |
2024-08-24 | DeepVoting: Learning Voting Rules with Tailored Embeddings | Leonardo Matone et.al. | 2408.13630 | null |
2024-08-24 | Reaching New Heights in Multi-Agent Collective Construction | Martin Rameš et.al. | 2408.13615 | null |
2024-08-24 | Hybrid Training for Enhanced Multi-task Generalization in Multi-agent Reinforcement Learning | Mingliang Zhang et.al. | 2408.13567 | null |
2024-08-27 | Control-Informed Reinforcement Learning for Chemical Processes | Maximilian Bloor et.al. | 2408.13566 | link |
2024-08-24 | IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question Answering | Ruosen Li et.al. | 2408.13545 | null |
2024-08-24 | Unleashing Collaborative Computing for Adaptive Video Streaming with Multi-objective Optimization in Satellite Terrestrial Networks | Zhishu Shen et.al. | 2408.13512 | null |
2024-08-23 | Optimizing Collaboration of LLM based Agents for Finite Element Analysis | Chuan Tian et.al. | 2408.13406 | null |
2024-08-23 | DrugAgent: Explainable Drug Repurposing Agent with Large Language Model-based Reasoning | Yoshitaka Inoue et.al. | 2408.13378 | null |
2024-08-23 | Generative Blockchain: Transforming Blockchain from Transaction Recording to Transaction Generation through Proof-of-Merit | Haozhao Zhang et.al. | 2408.13367 | null |
2024-08-23 | Reconciling Different Theories of Learning with an Agent-based Model of Procedural Learning | Sina Rismanchian et.al. | 2408.13364 | null |
2024-08-23 | Oscillatory and Excitable Dynamics in an Opinion Model with Group Opinions | Corbit R. Sampson et.al. | 2408.13336 | link |
2024-08-23 | Mastering the Digital Art of War: Developing Intelligent Combat Simulation Agents for Wargaming Using Hierarchical Reinforcement Learning | Scotty Black et.al. | 2408.13333 | null |
2024-08-23 | Localized Observation Abstraction Using Piecewise Linear Spatial Decay for Reinforcement Learning in Combat Simulations | Scotty Black et.al. | 2408.13328 | null |
2024-08-23 | Large Language Models for Zero Touch Network Configuration Management | Oscar G. Lira et.al. | 2408.13298 | null |
2024-08-23 | The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities | Venkatesh Balavadhani Parthasarathy et.al. | 2408.13296 | null |
2024-08-23 | Optimally Solving Simultaneous-Move Dec-POMDPs: The Sequential Central Planning Approach | Johan Peralez et.al. | 2408.13139 | null |
2024-08-18 | An Introduction to Cognidynamics | Marco Gori et.al. | 2408.13112 | null |
2024-08-23 | Diffusion-based Episodes Augmentation for Offline Multi-Agent Reinforcement Learning | Jihwan Oh et.al. | 2408.13092 | null |
2024-09-01 | Controllable Financial Market Generation with Diffusion Guided Meta Agent | Yu-Hao Huang et.al. | 2408.12991 | null |
2024-08-26 | Zeoformer: Coarse-Grained Periodic Graph Transformer for OSDA-Zeolite Affinity Prediction | Xiangxiang Shen et.al. | 2408.12984 | null |
2024-08-23 | Informational Embodiment: Computational role of information structure in codes and robots | Alexandre Pitti et.al. | 2408.12950 | null |
2024-08-23 | Complete Graph Identification in Population Protocols | Haruki Kanaya et.al. | 2408.12862 | null |
2024-08-23 | Online Fair Division with Contextual Bandits | Arun Verma et.al. | 2408.12845 | null |
2024-08-23 | LIMP: Large Language Model Enhanced Intent-aware Mobility Prediction | Songwei Li et.al. | 2408.12832 | link |
2024-08-23 | From Mobilisation to Radicalisation: Probing the Persistence and Radicalisation of Social Movements Using an Agent-Based Model | Emma F. Thomas et.al. | 2408.12795 | null |
2024-08-23 | Environment-Centric Active Inference | Kanako Esaki et.al. | 2408.12777 | null |
2024-08-27 | Intelligent OPC Engineer Assistant for Semiconductor Manufacturing | Guojin Chen et.al. | 2408.12775 | null |
2024-08-22 | Free-breathing 3D cardiac extracellular volume (ECV) mapping using a linear tangent space alignment (LTSA) model | Wonil Lee et.al. | 2408.12706 | null |
2024-09-01 | Can LLMs Understand Social Norms in Autonomous Driving Games? | Boxuan Wang et.al. | 2408.12680 | null |
2024-08-22 | Integrating an agent-based behavioral model in microtransit forecasting and revenue management | Xiyuan Ren et.al. | 2408.12577 | null |
2024-08-25 | MuMA-ToM: Multi-modal Multi-Agent Theory of Mind | Haojun Shi et.al. | 2408.12574 | link |
2024-08-22 | PCGRL+: Scaling, Control and Generalization in Reinforcement Learning Level Generators | Sam Earle et.al. | 2408.12525 | null |
2024-08-22 | Stochastic Online Correlated Selection | Ziyun Chen et.al. | 2408.12524 | null |
2024-08-22 | Weighted Envy-Freeness in House Allocation | Sijia Dai et.al. | 2408.12523 | null |
2024-08-22 | MEDCO: Medical Education Copilots Based on A Multi-Agent Framework | Hao Wei et.al. | 2408.12496 | null |
2024-08-22 | Multi Agent Framework for Collective Intelligence Research | Alexandru Dochian et.al. | 2408.12391 | link |
2024-08-22 | Recursive Distributed Collaborative Aided Inertial Navigation | Roland Jung et.al. | 2408.12360 | link |
2024-09-04 | Graph Retrieval Augmented Trustworthiness Reasoning | Ying Zhu et.al. | 2408.12333 | link |
2024-08-22 | Can Artificial Intelligence Embody Moral Values? | Torben Swoboda et.al. | 2408.12250 | null |
2024-08-22 | Time Optimal Distance- |
Brati Mondal et.al. | 2408.12220 | null |
2024-08-22 | MDD-5k: A New Diagnostic Conversation Dataset for Mental Disorders Synthesized via Neuro-Symbolic LLM Agents | Congchi Yin et.al. | 2408.12142 | link |
2024-08-22 | An evidence-accumulating drift-diffusion model of competing information spread on networks | Julien Corsin et.al. | 2408.12127 | null |
2024-08-22 | Emotion-Agent: Unsupervised Deep Reinforcement Learning with Distribution-Prototype Reward for Continuous Emotional EEG Analysis | Zhihao Zhou et.al. | 2408.12121 | null |
2024-08-22 | Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards | Shresth Verma et.al. | 2408.12112 | null |
2024-08-22 | Distributed Noncoherent Joint Transmission Based on Multi-Agent Reinforcement Learning for Dense Small Cell MISO Systems | Shaozhuang Bai et.al. | 2408.12067 | null |
2024-08-21 | Empirical Equilibria in Agent-based Economic systems with Learning agents | Kshama Dwarakanath et.al. | 2408.12038 | null |
2024-08-21 | Reasoning and Tools for Human-Level Forecasting | Elvis Hsieh et.al. | 2408.12036 | null |
2024-08-21 | Understanding Epistemic Language with a Bayesian Theory of Mind | Lance Ying et.al. | 2408.12022 | null |
2024-08-21 | Controlling nonergodicity in quantum many-body systems by reinforcement learning | Li-Li Ye et.al. | 2408.11989 | link |
2024-08-21 | Advances in Preference-based Reinforcement Learning: A Review | Youssef Abdelkareem et.al. | 2408.11943 | null |
2024-08-21 | Distributed alternating gradient descent for convex semi-infinite programs over a network | Ashwin Aravind et.al. | 2408.11937 | null |
2024-08-21 | Spline tie-decay temporal networks | Chanon Thongprayoon et.al. | 2408.11913 | null |
2024-08-21 | Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction | Anthony GX-Chen et.al. | 2408.11816 | null |
2024-08-21 | EmbodiedSAM: Online Segment Any 3D Thing in Real Time | Xiuwei Xu et.al. | 2408.11811 | null |
2024-08-21 | Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models | Yuzhou Huang et.al. | 2408.11801 | null |
2024-08-21 | Leveraging Chemistry Foundation Models to Facilitate Structure Focused Retrieval Augmented Generation in Multi-Agent Workflows for Catalyst and Materials Design | Nathaniel H. Park et.al. | 2408.11793 | null |
2024-08-21 | DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework | Zhifei Xie et.al. | 2408.11788 | null |
2024-08-21 | Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards | Omar Erak et.al. | 2408.11775 | link |
2024-08-21 | Deviations from the Nash equilibrium and emergence of tacit collusion in a two-player optimal execution game with reinforcement learning | Fabrizio Lillo et.al. | 2408.11773 | null |
2024-08-21 | VIRIS: Simulating indoor airborne transmission combining architectural design and people movement | Yidan Xue et.al. | 2408.11772 | link |
2024-08-23 | Consensus over Clustered Networks Using Intermittent and Asynchronous Output Feedback | Federico M. Zegers et.al. | 2408.11752 | null |
2024-08-21 | Bayesian Optimization Framework for Efficient Fleet Design in Autonomous Multi-Robot Exploration | David Molina Concha et.al. | 2408.11751 | null |
2024-08-21 | Open-Ended 3D Point Cloud Instance Segmentation | Phuc D. A. Nguyen et.al. | 2408.11747 | null |
2024-08-21 | Less is more: AI Decision-Making using Dynamic Deep Neural Networks for Short-Term Stock Index Prediction | CJ Finnegan et.al. | 2408.11740 | null |
2024-08-22 | LLM4VV: Exploring LLM-as-a-Judge for Validation and Verification Testsuites | Zachariah Sollenberger et.al. | 2408.11729 | null |
2024-08-21 | Networked Communication for Mean-Field Games with Function Approximation and Empirical Mean-Field Estimation | Patrick Benjamin et.al. | 2408.11607 | null |
2024-08-21 | Optimizing QoS in HD Map Updates: Cross-Layer Multi-Agent with Hierarchical and Independent Learning | Jeffrey Redondo et.al. | 2408.11605 | null |
2024-08-21 | Cause-Aware Empathetic Response Generation via Chain-of-Thought Fine-Tuning | Xinhao Chen et.al. | 2408.11599 | null |
2024-08-21 | Drama Engine: A Framework for Narrative Agents | Martin Pichlmair et.al. | 2408.11574 | null |
2024-08-21 | AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition | Minheng Ni et.al. | 2408.11564 | null |
2024-08-21 | Semi-supervised 3D Semantic Scene Completion with 2D Vision Foundation Model Guidance | Duc-Hai Pham et.al. | 2408.11559 | null |
2024-08-21 | Fixation of leadership in non-Markovian growth processes | Tejas Iyer et.al. | 2408.11516 | null |
2024-08-21 | Verifying Approximate Equilibrium in Auctions | Fabian R. Pieroth et.al. | 2408.11445 | null |
2024-08-21 | Subgoal-based Hierarchical Reinforcement Learning for Multi-Agent Collaboration | Cheng Xu et.al. | 2408.11416 | link |
2024-08-21 | Towards "Differential AI Psychology" and in-context Value-driven Statement Alignment with Moral Foundations Theory | Simon Münker et.al. | 2408.11415 | null |
2024-08-21 | Swarm Intelligence in Geo-Localization: A Multi-Agent Large Vision-Language Model Collaborative Framework | Xiao Han et.al. | 2408.11312 | null |
2024-08-20 | CooPre: Cooperative Pretraining for V2X Cooperative Perception | Seth Z. Zhao et.al. | 2408.11241 | null |
2024-08-20 | Optimization of Multi-Agent Flying Sidekick Traveling Salesman Problem over Road Networks | Ruixiao Yang et.al. | 2408.11187 | null |
2024-08-20 | Autonomous Negotiation Using Comparison-Based Gradient Estimation | Surya Murthy et.al. | 2408.11186 | link |
2024-08-20 | Range-based Multi-Robot Integrity Monitoring Against Cyberattacks and Faults: An Anchor-Free Approach | Vishnu Vijay et.al. | 2408.11155 | null |
2024-08-20 | Accelerating Goal-Conditioned RL Algorithms and Research | Michał Bortkiewicz et.al. | 2408.11052 | link |
2024-08-20 | FLAME: Learning to Navigate with Multimodal LLM in Urban Environments | Yunzhe Xu et.al. | 2408.11051 | link |
2024-08-23 | MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding | Jian Chen et.al. | 2408.11049 | link |
2024-08-20 | Athena: Safe Autonomous Agents with Verbal Contrastive Learning | Tanmana Sadhu et.al. | 2408.11021 | null |
2024-08-20 | The Evolution of Reinforcement Learning in Quantitative Finance | Nikolaos Pippas et.al. | 2408.10932 | null |
2024-08-20 | All Robots in One: A New Standard and Unified Dataset for Versatile, General-Purpose Embodied Agents | Zhiqiang Wang et.al. | 2408.10899 | null |
2024-08-23 | DBHP: Trajectory Imputation in Multi-Agent Sports Using Derivative-Based Hybrid Prediction | Hanjun Choi et.al. | 2408.10878 | null |
2024-08-20 | More Options for Prelabor Rupture of Membranes, A Bayesian Analysis | Ashley Klein et.al. | 2408.10876 | null |
2024-08-20 | Multi-agent Multi-armed Bandits with Stochastic Sharable Arm Capacities | Hong Xie et.al. | 2408.10865 | null |
2024-08-20 | Knowledge Sharing and Transfer via Centralized Reward Agent for Multi-Task Reinforcement Learning | Haozhe Ma et.al. | 2408.10858 | link |
2024-08-20 | Learning Randomized Algorithms with Transformers | Johannes von Oswald et.al. | 2408.10818 | null |
2024-08-20 | Multi-Agent Based Simulation for Decentralized Electric Vehicle Charging Strategies and their Impacts | Kristoffer Christensen et.al. | 2408.10790 | null |
2024-08-20 | Multi-agent based modeling for investigating excess heat utilization from electrolyzer production to district heating network | Kristoffer Christensen et.al. | 2408.10783 | null |
2024-08-20 | Multi-Agent Based Simulation for Investigating Centralized Charging Strategies and their Impact on Electric Vehicle Home Charging Ecosystem | Kristoffer Christensen et.al. | 2408.10773 | null |
2024-08-20 | PhishAgent: A Robust Multimodal Agent for Phishing Webpage Detection | Tri Cao et.al. | 2408.10738 | null |
2024-08-20 | Investigating Context Effects in Similarity Judgements in Large Language Models | Sagar Uprety et.al. | 2408.10711 | null |
2024-08-20 | Genesis: Towards the Automation of Systems Biology Research | Ievgeniia A. Tiukova et.al. | 2408.10689 | null |
2024-08-20 | Neural Exploratory Landscape Analysis | Zeyuan Ma et.al. | 2408.10672 | null |
2024-08-20 | Incorporating a 'ladder of trust' into dynamic Allocation of Function in Human-Autonomous Agent Collectives | Chris Baber et.al. | 2408.10654 | null |
2024-08-20 | Variations on distributed belief | John Lindqvist et.al. | 2408.10637 | null |
2024-08-20 | Strategist: Learning Strategic Skills by LLMs via Bi-Level Tree Search | Jonathan Light et.al. | 2408.10635 | null |
2024-08-21 | MUSES: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration | Yanbo Ding et.al. | 2408.10605 | null |
2024-08-20 | Fast Collective Evasion in Self-Localized Swarms of Unmanned Aerial Vehicles | Filip Novák et.al. | 2408.10596 | null |
2024-08-20 | Synchronization behind Learning in Periodic Zero-Sum Games Triggers Divergence from Nash equilibrium | Yuma Fujimoto et.al. | 2408.10595 | null |
2024-08-20 | Bidirectional Intent Communication: A Role for Large Foundation Models | Tim Schreiter et.al. | 2408.10589 | null |
2024-08-20 | DEGAS: Detailed Expressions on Full-Body Gaussian Avatars | Zhijing Shao et.al. | 2408.10588 | null |
2024-08-20 | Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks | Yun Qu et.al. | 2408.10556 | link |
2024-08-20 | Semi-on-Demand Off-Peak Transit Services with Shared Autonomous Vehicles -- Service Planning, Simulation, and Analysis in Munich, Germany | Max T. M. Ng et.al. | 2408.10547 | null |
2024-08-20 | Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation | Jiawei Han et.al. | 2408.10537 | link |
2024-08-20 | Approximate Estimation of High-dimension Execution Skill for Dynamic Agents in Continuous Domains | Delma Nieves-Rivera et.al. | 2408.10512 | null |
2024-08-20 | Evaluation Framework for AI-driven Molecular Design of Multi-target Drugs: Brain Diseases as a Case Study | Arthur Cerveira et.al. | 2408.10482 | link |
2024-08-24 | IDEA:Enhancing the Rule Learning Ability of Language Agents through Induction, Deduction, and Abduction | Kaiyu He et.al. | 2408.10455 | null |
2024-08-19 | Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation | Liu He et.al. | 2408.10453 | null |
2024-08-19 | Tax Credits and Household Behavior: The Roles of Myopic Decision-Making and Liquidity in a Simulated Economy | Jialin Dong et.al. | 2408.10391 | null |
2024-08-19 | Narrowing the Gap between Vision and Action in Navigation | Yue Zhang et.al. | 2408.10388 | link |
2024-08-19 | Competing Social Contagions with Opinion Dependent Infectivity | Corbit R. Sampson et.al. | 2408.10373 | link |
2024-08-19 | Toward Fair and Strategyproof Tournament Rules for Tournaments with Partially Transferable Utilities | David Pennock et.al. | 2408.10346 | null |
2024-08-17 | Why and How do Complex Systems Self-Organize at All? Average Action Efficiency as a Predictor, Measure, Driver, and Mechanism of Self-Organization | Matthew J Brouillet et.al. | 2408.10278 | null |
2024-08-19 | Don't Get Stuck: A Deadlock Recovery Approach | Francesca Baldini et.al. | 2408.10167 | null |
2024-08-19 | Learning Precise Affordances from Egocentric Videos for Robotic Manipulation | Gen Li et.al. | 2408.10123 | null |
2024-08-19 | Enhancing Reinforcement Learning Through Guided Search | Jérôme Arjonilla et.al. | 2408.10113 | null |
2024-08-19 | No Screening is More Efficient with Multiple Objects | Shunya Noda et.al. | 2408.10077 | null |
2024-08-19 | Synthesis of Reward Machines for Multi-Agent Equilibrium Design (Full Version) | Muhammad Najib et.al. | 2408.10074 | null |
2024-08-19 | Near-Optimal Mechanisms for Resource Allocation Without Monetary Transfers | Moise Blanchard et.al. | 2408.10066 | null |
2024-08-19 | The Practimum-Optimum Algorithm for Manufacturing Scheduling: A Paradigm Shift Leading to Breakthroughs in Scale and Performance | Moshe BenBassat et.al. | 2408.10040 | null |
2024-08-19 | The Expressive Power of Uniform Population Protocols with Logarithmic Space | Philipp Czerner et.al. | 2408.10027 | null |
2024-08-19 | Adaptive BESS and Grid Setpoints Optimization: A Model-Free Framework for Efficient Battery Management under Dynamic Tariff Pricing | Alaa Selim et.al. | 2408.09989 | null |
2024-08-19 | The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective | Renye Yan et.al. | 2408.09974 | null |
2024-08-20 | MegaAgent: A Practical Framework for Autonomous Cooperation in Large-Scale LLM Agent Systems | Qian Wang et.al. | 2408.09955 | null |
2024-08-19 | Boltzmann approach to collective motion via non-local visual interaction | Susumu Ito et.al. | 2408.09917 | null |
2024-08-19 | Multi-layer diffusion model of photovoltaic installations | Tomasz Weron et.al. | 2408.09904 | null |
2024-08-19 | Demystifying Reinforcement Learning in Production Scheduling via Explainable AI | Daniel Fischer et.al. | 2408.09841 | null |
2024-08-19 | Mitigating the Stability-Plasticity Dilemma in Adaptive Train Scheduling with Curriculum-Driven Continual DQN Expansion | Achref Jaziri et.al. | 2408.09838 | null |
2024-08-20 | World Models Increase Autonomy in Reinforcement Learning | Zhao Yang et.al. | 2408.09807 | null |
2024-08-19 | Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation | Yunxin Li et.al. | 2408.09787 | link |
2024-08-19 | GoNoGo: An Efficient LLM-based Multi-Agent System for Streamlining Automotive Software Release Decision-Making | Arsham Gholamzadeh Khoee et.al. | 2408.09785 | null |
2024-08-19 | Targeted Drug Delivery: Algorithmic Methods for Collecting a Swarm of Particles with Uniform External Forces | Aaron T. Becker et.al. | 2408.09729 | null |
2024-08-19 | Algorithmic Contract Design with Reinforcement Learning Agents | David Molina Concha et.al. | 2408.09686 | null |
2024-08-19 | Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey | Ruiqi Zhang et.al. | 2408.09675 | link |
2024-08-20 | BLADE: Benchmarking Language Model Agents for Data-Driven Science | Ken Gu et.al. | 2408.09667 | link |
2024-08-19 | Linear-Quadratic Mean-Field Game for Stochastic Systems with Partial Observation | Min Li et.al. | 2408.09652 | null |
2024-08-18 | Prescribed-time Convergent Distributed Multiobjective Optimization with Dynamic Event-triggered Communication | Tengyang Gong et.al. | 2408.09602 | null |
2024-08-21 | Löb-Safe Logics for Reflective Agents | Seth Ahrenbach et.al. | 2408.09590 | null |
2024-08-18 | HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model | Mengkang Hu et.al. | 2408.09559 | link |
2024-08-18 | Enhancing Population-based Search with Active Inference | Nassim Dehouche et.al. | 2408.09548 | null |
2024-08-18 | A Logic for Policy Based Resource Exchanges in Multiagent Systems | Lorenzo Ceragioli et.al. | 2408.09516 | null |
2024-08-18 | Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning | Zhiwei Xu et.al. | 2408.09501 | null |
2024-08-18 | Ancestral Reinforcement Learning: Unifying Zeroth-Order Optimization and Genetic Algorithms for Reinforcement Learning | So Nakashima et.al. | 2408.09493 | null |
2024-08-18 | HySem: A context length optimized LLM pipeline for unstructured tabular extraction | Narayanan PP et.al. | 2408.09434 | null |
2024-08-18 | Value-Enriched Population Synthesis: Integrating a Motivational Layer | Alba Aguilera et.al. | 2408.09407 | null |
2024-08-18 | Optimal stopping and divestment timing under scenario ambiguity and learning | Andrea Mazzon et.al. | 2408.09349 | null |
2024-08-17 | How to Make an Action Better | Marilyn Pease et.al. | 2408.09294 | null |
2024-08-17 | GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System | Shuo Wang et.al. | 2408.09191 | null |
2024-08-17 | Generative Agent-Based Models for Complex Systems Research: a review | Yikang Lu et.al. | 2408.09175 | null |
2024-08-17 | Worst- and Average-Case Robustness of Stable Matchings: (Counting) Complexity and Experiments | Kimon Boehmer et.al. | 2408.09160 | null |
2024-08-17 | Training Verifiably Robust Agents Using Set-Based Reinforcement Learning | Manuel Wendl et.al. | 2408.09112 | null |
2024-08-17 | Me want cookie! Towards automated and transparent data governance on the Web | Jesse Wright et.al. | 2408.09071 | null |
2024-08-16 | On the Completeness of Conflict-Based Search: Temporally-Relative Duplicate Pruning | Thayne T Walker et.al. | 2408.09028 | null |
2024-08-16 | Visual Agents as Fast and Slow Thinkers | Guangyan Sun et.al. | 2408.08862 | link |
2024-08-16 | CPS-TaskForge: Generating Collaborative Problem Solving Environments for Diverse Communication Tasks | Nikita Haduong et.al. | 2408.08853 | null |
2024-08-16 | A Novel Quantum Algorithm for Efficient Attractor Search in Gene Regulatory Networks | Mirko Rossini et.al. | 2408.08814 | link |
2024-08-16 | CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk | Mohamad Fares El Hajj Chehade et.al. | 2408.08812 | null |
2024-08-16 | EmoDynamiX: Emotional Support Dialogue Strategy Prediction by Modelling MiXed Emotions and Discourse Dynamics | Chenwei Wan et.al. | 2408.08782 | link |
2024-08-16 | Beyond Proportional Individual Guarantees for Binary Perpetual Voting | Yotam Gafni et.al. | 2408.08767 | null |
2024-08-16 | Rethinking Generative Semantic Communication for Multi-User Systems with Multi-Modal LLM | Wanting Yang et.al. | 2408.08765 | null |
2024-08-16 | SYMPOL: Symbolic Tree-Based On-Policy Reinforcement Learning | Sascha Marton et.al. | 2408.08761 | link |
2024-08-16 | Weighted Envy-free Allocation with Subsidy | Haris Aziz et.al. | 2408.08711 | null |
2024-08-16 | Explore-then-Commit Algorithms for Decentralized Two-Sided Matching Markets | Tejas Pagare et.al. | 2408.08690 | null |
2024-08-24 | The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset Generation | Samee Arif et.al. | 2408.08688 | link |
2024-08-16 | Neural Reward Machines | Elena Umili et.al. | 2408.08677 | link |
2024-08-16 | Fine-tuning LLMs for Autonomous Spacecraft Control: A Case Study Using Kerbal Space Program | Alejandro Carrasco et.al. | 2408.08676 | link |
2024-08-16 | An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation | Peiming Guo et.al. | 2408.08650 | null |
2024-08-16 | A survey on secure decentralized optimization and learning | Changxin Liu et.al. | 2408.08628 | null |
2024-08-16 | DeepREST: Automated Test Case Generation for REST APIs Exploiting Deep Reinforcement Learning | Davide Corradini et.al. | 2408.08594 | null |