Skip to content

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

License

Notifications You must be signed in to change notification settings

Lyz103/LLM-Agent-Paper-daily

Repository files navigation

Contributors Forks Stargazers Issues

Updated on 2024.09.20

Usage instructions: here

Table of Contents
  1. Agents

Agents

Publish Date Title Authors PDF Code
2024-09-18 Residual Descent Differential Dynamic Game (RD3G) -- A Fast Newton Solver for Constrained General Sum Games Zhiyuan Zhang et.al. 2409.12152 null
2024-09-18 MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning Justin Chih-Yao Chen et.al. 2409.12147 link
2024-09-19 The Impact of Element Ordering on LM Agent Performance Wayne Chi et.al. 2409.12089 link
2024-09-19 Using Large Language Models to Generate Clinical Trial Tables and Figures Yumeng Yang et.al. 2409.12046 null
2024-09-19 Representing Positional Information in Generative World Models for Object Manipulation Stefano Ferraro et.al. 2409.12005 null
2024-09-18 Putting Data at the Centre of Offline Multi-Agent Reinforcement Learning Claude Formanek et.al. 2409.12001 null
2024-09-18 On the Stability of Consensus Control under Rotational Ambiguities Zhonggang Li et.al. 2409.11979 null
2024-09-18 Anomalous behavior of Replicator dynamics for the Prisoner's Dilemma on diluted lattices Fernanda R. Leivas et.al. 2409.11955 null
2024-09-18 Reinforcement Learning as an Improvement Heuristic for Real-World Production Scheduling Arthur Müller et.al. 2409.11933 null
2024-09-18 Secure Control Systems for Autonomous Quadrotors against Cyber-Attacks Samuel Belkadi et.al. 2409.11897 link
2024-09-18 Motivations, Challenges, Best Practices, and Benefits for Bots and Conversational Agents in Software Engineering: A Multivocal Literature Review Stefano Lambiase et.al. 2409.11864 null
2024-09-18 XP-MARL: Auxiliary Prioritization in Multi-Agent Reinforcement Learning to Address Non-Stationarity Jianye Xu et.al. 2409.11852 null
2024-09-18 Optimizing Job Shop Scheduling in the Furniture Industry: A Reinforcement Learning Approach Considering Machine Setup, Batch Variability, and Intralogistics Malte Schneevogt et.al. 2409.11820 null
2024-09-18 Distributed Resilient Secondary Control for Microgrids with Attention-based Weights against High-density Misbehaving Agents Yutong Li et.al. 2409.11812 null
2024-09-18 Synthesizing Evolving Symbolic Representations for Autonomous Systems Gabriele Sartor et.al. 2409.11756 link
2024-09-18 HARP: Human-Assisted Regrouping with Permutation Invariant Critic for Multi-Agent Reinforcement Learning Huawen Hu et.al. 2409.11741 null
2024-09-18 Revealing the Challenge of Detecting Character Knowledge Errors in LLM Role-Playing Wenyuan Zhang et.al. 2409.11726 link
2024-09-18 Discovering Conceptual Knowledge with Analytic Ontology Templates for Articulated Objects Jianhua Sun et.al. 2409.11702 null
2024-09-18 RMP-YOLO: A Robust Motion Predictor for Partially Observable Scenarios even if You Only Look Once Jiawei Sun et.al. 2409.11696 null
2024-09-18 Towards Explainable Goal Recognition Using Weight of Evidence (WoE): A Human-Centered Approach Abeer Alshehri et.al. 2409.11675 null
2024-09-18 Agent Aggregator with Mask Denoise Mechanism for Histopathology Whole Slide Image Analysis Xitong Ling et.al. 2409.11664 null
2024-09-18 From Data Stories to Dialogues: A Randomised Controlled Trial of Generative AI Agents and Data Storytelling in Enhancing Data Visualisation Comprehension Lixiang Yan et.al. 2409.11645 null
2024-09-17 Context-Generative Default Policy for Bounded Rational Agent Durgakant Pushp et.al. 2409.11604 null
2024-09-17 React to This! How Humans Challenge Interactive Agents using Nonverbal Behaviors Chuxuan Zhang et.al. 2409.11602 null
2024-09-17 Distributed Deep Koopman Learning for Nonlinear Dynamics Wenjian Hao et.al. 2409.11586 null
2024-09-17 PLATO: Planning with LLMs and Affordances for Tool Manipulation Arvind Car et.al. 2409.11580 null
2024-09-17 Optimal Investment with Costly Expert Opinions Christoph Knochenhauer et.al. 2409.11569 null
2024-09-17 Hyper-SAMARL: Hypergraph-based Coordinated Task Allocation and Socially-aware Navigation for Multi-Robot Systems Weizheng Wang et.al. 2409.11561 null
2024-09-17 Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent Fatemeh Haji et.al. 2409.11527 null
2024-09-17 Diffusion of knowledge and the lottery society Henri Berestycki et.al. 2409.11479 null
2024-09-17 Consensus decision making on a complete graph: complex behaviour from simple assumptions P. Sarkanych et.al. 2409.11475 null
2024-09-12 Towards Opinion Shaping: A Deep Reinforcement Learning Approach in Bot-User Interactions Farbod Siahkali et.al. 2409.11426 null
2024-09-17 Ising model with varying spin strength on a scale-free network: scaling functions and critical amplitude ratios M. Krasnytska et.al. 2409.11396 null
2024-09-17 Distributed Perception Aware Safe Leader Follower System via Control Barrier Methods Richie R. Suganda et.al. 2409.11394 null
2024-09-17 LLM-Agent-UMF: LLM-based Agent Unified Modeling Framework for Seamless Integration of Multi Active/Passive Core-Agents Amine B. Hassouna et.al. 2409.11393 null
2024-09-17 CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark Zachary S. Siegel et.al. 2409.11363 link
2024-09-17 A Scalable Game Theoretic Approach for Coordination of Multiple Dynamic Systems Mostafa M. Shibl et.al. 2409.11358 null
2024-09-17 EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage Zeyi Liao et.al. 2409.11295 null
2024-09-17 P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task Weiye Xu et.al. 2409.11279 null
2024-09-17 Hackphyr: A Local Fine-Tuned LLM Agent for Network Security Environments Maria Rigaki et.al. 2409.11276 null
2024-09-19 The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives Samee Arif et.al. 2409.11261 link
2024-09-17 To What Extent do Open-loop and Feedback Nash Equilibria Diverge in General-Sum Linear Quadratic Dynamic Games? Chih-Yuan Chiu et.al. 2409.11257 null
2024-09-17 A Continuous-time Tractable Model for Present-biased Agents Yasunori Akagi et.al. 2409.11225 null
2024-09-17 Bearing-based Target Localisation in Search and Rescue Scenarios Giulia Michieletto et.al. 2409.11221 null
2024-09-17 SuperCoder2.0: Technical Report on Exploring the feasibility of LLMs as Autonomous Programmer Anmol Gautam et.al. 2409.11190 null
2024-09-18 Annealed Winner-Takes-All for Motion Forecasting Yihong Xu et.al. 2409.11172 link
2024-09-17 Preventing Unconstrained CBF Safety Filters Caused by Invalid Relative Degree Assumptions Lukas Brunke et.al. 2409.11171 null
2024-09-17 Reactive Environments for Active Inference Agents with RxEnvironments.jl Wouter W. L. Nuijten et.al. 2409.11087 link
2024-09-17 Data-driven Dynamic Intervention Design in Network Games Xiupeng Chen et.al. 2409.11069 null
2024-09-17 A logical alarm for misaligned binary classifiers Andrés Corrada-Emmanuel et.al. 2409.11052 null
2024-09-17 Improving Speech Emotion Recognition in Under-Resourced Languages via Speech-to-Speech Translation with Bootstrapping Data Selection Hsi-Che Lin et.al. 2409.10985 null
2024-09-17 Label-free correlative morpho-chemical tomography of 3D kidney mesangial cells Ankit Butola et.al. 2409.10971 null
2024-09-17 Frontier Shepherding: A Bio-Mimetic Multi-robot Framework for Large-Scale Exploration John Lewis et.al. 2409.10931 null
2024-09-17 Multi-Floor Zero-Shot Object Navigation Policy Lingfeng Zhang et.al. 2409.10906 null
2024-09-17 Distributed Optimization for Traffic Light Control and Connected Automated Vehicle Coordination in Mixed-Traffic Intersections Viet-Anh Le et.al. 2409.10864 null
2024-09-17 SIFToM: Robust Spoken Instruction Following through Theory of Mind Lance Ying et.al. 2409.10849 null
2024-09-17 Improving Interface Design in Interactive Task Learning for Hierarchical Tasks based on a Qualitative Study Jieyu Zhou et.al. 2409.10826 null
2024-09-17 Consensus in Models for Opinion Dynamics with Generalized-Bias Juan Paz et.al. 2409.10809 null
2024-09-16 AutoSafeCoder: A Multi-Agent Framework for Securing LLM Code Generation through Static Analysis and Fuzz Testing Ana Nunez et.al. 2409.10737 null
2024-09-16 CoMamba: Real-time Cooperative Perception Unlocked with State Space Models Jinlong Li et.al. 2409.10699 null
2024-09-16 Mitigating Partial Observability in Adaptive Traffic Signal Control with Transformers Xiaoyu Wang et.al. 2409.10693 null
2024-09-16 Multi-agent Path Finding in Continuous Environment Kristýna Janovská et.al. 2409.10680 null
2024-09-16 Motion Forecasting via Model-Based Risk Minimization Aron Distelzweig et.al. 2409.10585 null
2024-09-16 Reinforcement Learning with Quasi-Hyperbolic Discounting S. R. Eshwar et.al. 2409.10583 null
2024-09-14 On the limits of agency in agent-based models Ayush Chopra et.al. 2409.10568 link
2024-09-13 Applying Action Masking and Curriculum Learning Techniques to Improve Data Efficiency and Overall Performance in Operational Technology Cyber Security using Reinforcement Learning Alec Wilson et.al. 2409.10563 null
2024-09-16 On interactive anisotropic walks in two dimensions generated from a three state opinion dynamics model Surajit Saha et.al. 2409.10413 null
2024-09-16 Reducing Leximin Fairness to Utilitarian Optimization Eden Hartman et.al. 2409.10395 null
2024-09-16 Decentralized and Asymmetric Multi-Agent Learning in Construction Sites Yakov Miron et.al. 2409.10375 null
2024-09-19 Instigating Cooperation among LLM Agents Using Adaptive Information Modulation Qiliang Chen et.al. 2409.10372 null
2024-09-16 2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation? Téo Guichoux et.al. 2409.10357 null
2024-09-16 Partial Ordering Bayesian Logistic Regression Model for Phase I Combination Trials and Computationally Efficient Approach to Operational Prior Specification Weishi Chen et.al. 2409.10352 null
2024-09-16 Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots Hongming Zhang et.al. 2409.10277 link
2024-09-16 Synchronization-Based Cooperative Distributed Model Predictive Control Julius Beerwerth et.al. 2409.10215 null
2024-09-16 Maneuver Decision-Making with Trajectory Streams Prediction for Autonomous Vehicles Mais Jamal et.al. 2409.10165 null
2024-09-16 Multi-Agent Obstacle Avoidance using Velocity Obstacles and Control Barrier Functions Alejandro Sánchez Roncero et.al. 2409.10117 null
2024-09-16 Robust Reinforcement Learning with Dynamic Distortion Risk Measures Anthony Coache et.al. 2409.10096 link
2024-09-16 Cross-modality image synthesis from TOF-MRA to CTA using diffusion-based models Alexander Koch et.al. 2409.10089 null
2024-09-19 Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation Meng Chen et.al. 2409.10071 link
2024-09-16 A Social Force Model for Multi-Agent Systems With Application to Robots Traversal in Cluttered Environments Chenxi Li et.al. 2409.10049 null
2024-09-16 Audio-Driven Reinforcement Learning for Head-Orientation in Naturalistic Environments Wessel Ledder et.al. 2409.10048 null
2024-09-16 Bearing-Distance Based Flocking with Zone-Based Interactions Hossein B. Jond et.al. 2409.10047 null
2024-09-16 E2Map: Experience-and-Emotion Map for Self-Reflective Robot Navigation with Language Models Chan Kim et.al. 2409.10027 null
2024-09-16 Reinforcement learning-based statistical search strategy for an axion model from flavor Satsuki Nishimura et.al. 2409.10023 null
2024-09-16 SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning Amogh Joshi et.al. 2409.09990 null
2024-09-16 Optimality Gap of Decentralized Submodular Maximization under Probabilistic Communication Joan Vendrell et.al. 2409.09979 null
2024-09-16 Constrained Bandwidth Observation Sharing for Multi-Robot Navigation in Dynamic Environments via Intelligent Knapsack Anirudh Chari et.al. 2409.09975 null
2024-09-16 Solving Monotone Variational Inequalities with Best Response Dynamics Yu-Wen Chen et.al. 2409.09961 null
2024-09-16 Context-aware Advertisement Modeling and Applications in Rapid Transit Systems Afzal Ahmed et.al. 2409.09956 null
2024-09-15 Critic as Lyapunov function (CALF): a model-free, stability-ensuring agent Pavel Osinenko et.al. 2409.09869 null
2024-09-15 A Complete Algorithm for a Moving Target Traveling Salesman Problem with Obstacles Anoop Bhat et.al. 2409.09852 null
2024-09-15 On the Effect of Robot Errors on Human Teaching Dynamics Jindan Huang et.al. 2409.09827 null
2024-09-15 Revisiting the state-space model of unawareness Alex A. T. Rathke et.al. 2409.09818 null
2024-09-15 Social Influence and Consensus Building: Introducing a q-Voter Model with Weighted Influence Pratik Mullick et.al. 2409.09817 null
2024-09-17 Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition Chao-Han Huck Yang et.al. 2409.09785 null
2024-09-15 DiFSD: Ego-Centric Fully Sparse Paradigm with Uncertainty Denoising and Iterative Refinement for Efficient End-to-End Autonomous Driving Haisheng Su et.al. 2409.09777 null
2024-09-15 Range-SLAM: Ultra-Wideband-Based Smoke-Resistant Real-Time Localization and Mapping Yi Liu et.al. 2409.09763 null
2024-09-15 Automatic Control With Human-Like Reasoning: Exploring Language Model Embodied Air Traffic Agents Justas Andriuškevičius et.al. 2409.09717 null
2024-09-15 Unveiling Gender Bias in Large Language Models: Using Teacher's Evaluation in Higher Education As an Example Yuanning Huang et.al. 2409.09652 link
2024-09-15 RethinkMCTS: Refining Erroneous Thoughts in Monte Carlo Tree Search for Code Generation Qingyao Li et.al. 2409.09584 null
2024-09-15 Traffic Scene Generation from Natural Language Description for Autonomous Vehicles with Large Language Model Bo-Kai Ruan et.al. 2409.09575 null
2024-09-15 Decentralized Safe and Scalable Multi-Agent Control under Limited Actuation Vrushabh Zinage et.al. 2409.09573 null
2024-09-14 Swarm Algorithms for Dynamic Task Allocation in Unknown Environments Adithya Balachandran et.al. 2409.09550 null
2024-09-14 Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation Yiwei Shi et.al. 2409.09541 null
2024-09-14 Ensuring System-Level Protection against Eavesdropping Adversaries in Distributed Dynamical Systems Dipankar Maity et.al. 2409.09539 null
2024-09-14 Planning Transformer: Long-Horizon Offline Reinforcement Learning with Planning Tokens Joseph Clinton et.al. 2409.09513 null
2024-09-14 Learning Nudges for Conditional Cooperation: A Multi-Agent Reinforcement Learning Model Shatayu Kulkarni et.al. 2409.09509 null
2024-09-14 Learning Keypoints for Multi-Agent Behavior Analysis using Self-Supervision Daniel Khalil et.al. 2409.09455 null
2024-09-14 Initial Error Affection and Error Correction in Linear Quadratic Mean Field Games under Erroneous Initial Information Yuxin Jin et.al. 2409.09375 null
2024-09-14 The (n,k) game with heterogeneous agents Hsin-Lun Li et.al. 2409.09364 null
2024-09-14 PeriGuru: A Peripheral Robotic Mobile App Operation Assistant based on GUI Image Understanding and Prompting with LLM Kelin Fu et.al. 2409.09354 link
2024-09-14 Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models Yuanzhao Zhai et.al. 2409.09345 null
2024-09-14 Capability Augmentation for Heterogeneous Dynamic Teaming with Temporal Logic Tasks Carter Berlind et.al. 2409.09285 null
2024-09-14 Python Symbolic Execution with LLM-powered Code Generation Wenhan Wang et.al. 2409.09271 null
2024-09-14 High-Fidelity Data-Driven Dynamics Model for Reinforcement Learning-based Magnetic Control in HL-3 Tokamak Niannian Wu et.al. 2409.09238 null
2024-09-19 Curricula for Learning Robust Policies with Factored State Representations in Changing Environments Panayiotis Panayiotou et.al. 2409.09169 null
2024-09-13 Measure Preserving Flows for Ergodic Search in Convoluted Environments Albert Xu et.al. 2409.09164 null
2024-09-08 ELMS: Elasticized Large Language Models On Mobile Devices Wangsong Yin et.al. 2409.09071 null
2024-09-13 The unknotting number, hard unknot diagrams, and reinforcement learning Taylor Applebaum et.al. 2409.09032 null
2024-09-13 Optically-Validated Microvascular Phantom for Super-Resolution Ultrasound Imaging Jaime Parra Raad et.al. 2409.09031 null
2024-09-13 Agents in Software Engineering: Survey, Landscape, and Vision Yanxian Huang et.al. 2409.09030 link
2024-09-13 AI-LieDar: Examine the Trade-off Between Utility and Truthfulness in LLM Agents Zhe Su et.al. 2409.09013 null
2024-09-13 Mechanism Design for Extending the Accessibility of Facilities Hau Chan et.al. 2409.08993 null
2024-09-13 Safeguarding Decentralized Social Media: LLM Agents for Automating Community Rule Compliance Lucio La Cava et.al. 2409.08963 null
2024-09-13 Emerging Reliance Behaviors in Human-AI Text Generation: Hallucinations, Data Quality Assessment, and Cognitive Forcing Functions Zahra Ashktorab et.al. 2409.08937 null
2024-09-13 Farmer.Chat: Scaling AI-Powered Agricultural Services for Smallholder Farmers Namita Singh et.al. 2409.08916 null
2024-09-13 Exploring Action-Centric Representations Through the Lens of Rate-Distortion Theory Miguel de Llanza Varona et.al. 2409.08892 null
2024-09-13 Using The Concept Hierarchy for Household Action Recognition Andrei Costinescu et.al. 2409.08853 null
2024-09-13 Deep reinforcement learning for tracking a moving target in jellyfish-like swimming Yihao Chen et.al. 2409.08815 null
2024-09-13 Mutual Theory of Mind in Human-AI Collaboration: An Empirical Study with LLM-driven AI Agents in a Real-time Shared Workspace Task Shao Zhang et.al. 2409.08811 null
2024-09-13 HOLA-Drone: Hypergraphic Open-ended Learning for Zero-Shot Multi-Drone Cooperative Pursuit Yang Li et.al. 2409.08767 null
2024-09-13 Fusing Dynamics Equation: A Social Opinions Prediction Algorithm with LLM-based Agents Junchi Yao et.al. 2409.08717 null
2024-09-13 Systematic analysis of requirements for socially acceptable service robots Andrea Ruo et.al. 2409.08677 null
2024-09-13 Average Consensus over Directed Networks in Open Multi-Agent Systems with Acknowledgement Feedback Evagoras Makridis et.al. 2409.08634 null
2024-09-13 Generalization of Gershgorin's theorem. Analysis and design of control laws Igor Furtat et.al. 2409.08576 null
2024-09-13 Expediting and Elevating Large Language Model Reasoning via Hidden Chain-of-Thought Decoding Tianqiao Liu et.al. 2409.08561 null
2024-09-16 Can AI Prompt Humans? Multimodal Agents Prompt Players' Game Actions and Show Consequences to Raise Sustainability Awareness Qinshi Zhang et.al. 2409.08486 null
2024-09-13 A BERT-Based Summarization approach for depression detection Hossein Salahshoor Gavalan et.al. 2409.08483 null
2024-09-12 A Surveillance Game between a Differential Drive Robot and an Omnidirectional Agent: The Case of a Faster Evader Rodrigo Saavedra et.al. 2409.08414 null
2024-09-12 Sequential Discrete Action Selection via Blocking Conditions and Resolutions Liam Merz Hoffmeister et.al. 2409.08410 null
2024-09-12 Knowledge Tagging with Large Language Model based Multi-Agent System Hang Li et.al. 2409.08406 null
2024-09-12 Self-Supervised Inference of Agents in Trustless Environments Vladyslav Larin et.al. 2409.08386 null
2024-09-12 An Experimental Study of Competitive Market Behavior Through LLMs Jingru Jia et.al. 2409.08357 null
2024-09-13 Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Rogerio Bonatti et.al. 2409.08264 link
2024-09-12 How can the tragedy of the commons be prevented?: Introducing Linear Quadratic Mixed Mean Field Games Gokce Dayanikli et.al. 2409.08235 null
2024-09-12 Linear Complementary Dual Codes Constructed from Reinforcement Learning Yansheng Wu et.al. 2409.08114 null
2024-09-12 MosquitoMiner: A Light Weight Rover for Detecting and Eliminating Mosquito Breeding Sites Md. Adnanul Islam et.al. 2409.08078 link
2024-09-13 Learning Communities from Equilibria of Nonlinear Opinion Dynamics Yu Xing et.al. 2409.08004 null
2024-09-12 Autonomous Vehicle Controllers From End-to-End Differentiable Simulation Asen Nachkov et.al. 2409.07965 null
2024-09-12 WirelessAgent: Large Language Model Agents for Intelligent Wireless Networks Jingwen Tong et.al. 2409.07964 link
2024-09-12 Covariance Intersection-based Invariant Kalman Filtering(DInCIKF) for Distributed Pose Estimation Haoying Li et.al. 2409.07933 null
2024-09-12 Reinforcement Learning Discovers Efficient Decentralized Graph Path Search Strategies Alexei Pisacane et.al. 2409.07932 null
2024-09-12 Tidal MerzA: Combining affective modelling and autonomous code generation through Reinforcement Learning Elizabeth Wilson et.al. 2409.07918 null
2024-09-12 Tera-SpaceCom: GNN-based Deep Reinforcement Learning for Joint Resource Allocation and Task Offloading in TeraHertz Band Space Networks Zhifeng Hu et.al. 2409.07911 null
2024-09-12 UNIT: Unsupervised Online Instance Segmentation through Time Corentin Sautier et.al. 2409.07887 null
2024-09-12 Mapping Technical Safety Research at AI Companies: A literature review and incentives analysis Oscar Delaney et.al. 2409.07878 null
2024-09-12 ReGentS: Real-World Safety-Critical Driving Scenario Generation Made Stable Yuan Yin et.al. 2409.07830 null
2024-09-12 GateAttentionPose: Enhancing Pose Estimation with Agent Attention and Improved Gated Convolutions Liang Feng et.al. 2409.07798 null
2024-09-12 A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning Yinbo Yu et.al. 2409.07775 null
2024-09-12 Accelerated Multi-Time-Scale Stochastic Approximation: Optimal Complexity and Applications in Reinforcement Learning and Multi-Agent Games Sihan Zeng et.al. 2409.07767 null
2024-09-12 Distributed Learning Dynamics Converging to the Core of $B$ -Matchings Aya Hamed et.al. 2409.07754 null
2024-09-12 Self-similarity of temporal interaction networks arises from hyperbolic geometry with time-varying curvature Subhabrata Dutta et.al. 2409.07733 link
2024-09-12 A Conceptual Framework for Understanding Empathy in Physics Faculty Alia Hamdan et.al. 2409.07724 null
2024-09-12 CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model Yang Li et.al. 2409.07714 null
2024-09-12 DSBench: How Far Are Data Science Agents to Becoming Data Science Experts? Liqiang Jing et.al. 2409.07703 link
2024-09-11 SimulBench: Evaluating Language Models with Creative Simulation Tasks Qi Jia et.al. 2409.07641 null
2024-09-11 HERL: Tiered Federated Learning with Adaptive Homomorphic Encryption using Reinforcement Learning Jiaxang Tang et.al. 2409.07631 null
2024-09-11 A Survey of Inverse Constrained Reinforcement Learning: Definitions, Progress and Challenges Guiliang Liu et.al. 2409.07569 null
2024-09-11 Connecting extended Wigner's friend arguments and noncontextuality Laurens Walleghem et.al. 2409.07537 null
2024-09-13 MoA is All You Need: Building LLM Research Team using Mixture of Agents Sandy Chen et.al. 2409.07487 null
2024-09-04 MarS: a Financial Market Simulation Engine Powered by Generative Foundation Model Junjie Li et.al. 2409.07486 null
2024-09-11 "My Grade is Wrong!": A Contestable AI Framework for Interactive Feedback in Evaluating Student Essays Shengxin Hong et.al. 2409.07453 null
2024-09-11 SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories Ben Bogin et.al. 2409.07440 link
2024-09-11 Agent Workflow Memory Zora Zhiruo Wang et.al. 2409.07429 link
2024-09-11 Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise Recommendation Luo Ji et.al. 2409.07416 null
2024-09-11 A Contrastive Symmetric Forward-Forward Algorithm (SFFA) for Continual Learning Tasks Erik B. Terres-Escudero et.al. 2409.07387 null
2024-09-11 Policy consequences of the new neuroeconomic framework A. David Redish et.al. 2409.07373 null
2024-09-11 Online Decision MetaMorphFormer: A Casual Transformer-Based Reinforcement Learning Framework of Universal Embodied Intelligence Luo Ji et.al. 2409.07341 null
2024-09-11 Explanation, Debate, Align: A Weak-to-Strong Framework for Language Model Generalization Mehrdad Zakershahrak et.al. 2409.07335 null
2024-09-11 Using Generative Agents to Create Tip Sheets for Investigative Data Reporting Joris Veerbeek et.al. 2409.07286 null
2024-09-11 Multi-Type Preference Learning: Empowering Preference-Based Reinforcement Learning with Equal Preferences Ziang Liu et.al. 2409.07268 null
2024-09-11 Cross-Dialect Text-To-Speech in Pitch-Accent Language Incorporating Multi-Dialect Phoneme-Level BERT Kazuki Yamauchi et.al. 2409.07265 null
2024-09-11 Propaganda to Hate: A Multimodal Analysis of Arabic Memes with Multi-Agent LLMs Firoj Alam et.al. 2409.07246 null
2024-09-11 A Perspective on AI-Guided Molecular Simulations in VR: Exploring Strategies for Imitation Learning in Hyperdimensional Molecular Systems Mohamed Dhouioui et.al. 2409.07189 null
2024-09-11 Identify Design Problems Through Questioning: Exploring Role-playing Interactions with Large Language Models to Foster Design Questioning Skills Hyunseung Lim et.al. 2409.07178 null
2024-09-11 Learning Efficient Recursive Numeral Systems via Reinforcement Learning Jonathan D. Thomas et.al. 2409.07170 null
2024-09-11 Randomized Strategic Facility Location with Predictions Eric Balkanski et.al. 2409.07142 null
2024-09-11 MVLLaVA: An Intelligent Agent for Unified and Flexible Novel View Synthesis Hanyu Jiang et.al. 2409.07129 null
2024-09-11 DCMAC: Demand-aware Customized Multi-Agent Communication via Upper Bound Training Dongkun Huo et.al. 2409.07127 null
2024-09-17 Inefficient Alliance Formation in Coalitional Blotto Games Vade Shah et.al. 2409.06899 null
2024-09-10 A Quality Diversity Approach to Automatically Generate Multi-Agent Path Finding Benchmark Maps Cheng Qian et.al. 2409.06888 null
2024-09-10 Can Agents Spontaneously Form a Society? Introducing a Novel Architecture for Generative Multi-Agents to Elicit Social Emergence H. Zhang et.al. 2409.06750 null
2024-09-19 Decentralized Neural Networks for Robust and Scalable Eigenvalue Computation Ronald Katende et.al. 2409.06746 null
2024-09-10 Memory and Personality in Ideological Polarization: The Politico-physics of Mnemomatter Shengkai Li et.al. 2409.06660 null
2024-09-10 Fixed-budget and Multiple-issue Quadratic Voting Laura Georgescu et.al. 2409.06614 null
2024-09-10 On Epistemic Properties in Discrete-Event Systems: A Uniform Framework and Its Applications Bohan Cui et.al. 2409.06588 null
2024-09-10 Think-on-Process: Dynamic Process Generation for Collaborative Development of Multi-Agent System Leilei Lin et.al. 2409.06568 link
2024-09-10 Indirect Dynamic Negotiation in the Nash Demand Game Tatiana V. Guy et.al. 2409.06566 null
2024-09-10 Social Mediation through Robots -- A Scoping Review on Improving Group Interactions through Directed Robot Action using an Extended Group Process Model Thomas H. Weisswange et.al. 2409.06557 null
2024-09-10 Coordinated Motion Planning: Multi-Agent Path Finding in a Densely Packed, Bounded Domain Sándor P. Fekete et.al. 2409.06486 null
2024-09-10 Learning Generative Interactive Environments By Trained Agent Exploration Naser Kazemi et.al. 2409.06445 link
2024-09-10 Position Fair Mechanisms Allocating Indivisible Goods Ryoga Mahara et.al. 2409.06423 null
2024-09-10 Exploring the Integration of Large Language Models in Industrial Test Maintenance Processes Ludvig Lemner et.al. 2409.06416 null
2024-09-10 MAGDA: Multi-agent guideline-driven diagnostic assistance David Bani-Harouni et.al. 2409.06351 null
2024-09-17 Foragax: An Agent-Based Modelling Framework Based on JAX Siddharth Chaturvedi et.al. 2409.06345 link
2024-09-10 Towards Agentic AI on Particle Accelerators Antonin Sulc et.al. 2409.06336 null
2024-09-11 Modified Meta-Thompson Sampling for Linear Bandits and Its Bayes Regret Analysis Hao Li et.al. 2409.06329 null
2024-09-10 Automate Strategy Finding with LLM in Quant investment Zhizhuo Kou et.al. 2409.06289 null
2024-09-10 Evidence gathering under competitive and noncompetitive rewards Philip Brookins et.al. 2409.06248 null
2024-09-10 INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding Ji Ha Jang et.al. 2409.06210 null
2024-09-11 A Policy Iteration Method for Inverse Mean Field Games Kui Ren et.al. 2409.06184 null
2024-09-10 Contrastive Federated Learning with Tabular Data Silos Achmad Ginanjar et.al. 2409.06123 null
2024-09-14 ClarQ-LLM: A Benchmark for Models Clarifying and Requesting Information in Task-Oriented Dialog Yujian Gan et.al. 2409.06097 link
2024-09-09 Coarse Descriptions and Cautious Preferences Evan Piermont et.al. 2409.06054 null
2024-09-09 When Learning Meets Dynamics: Distributed User Connectivity Maximization in UAV-Based Communication Networks Bowei Li et.al. 2409.06010 null
2024-09-09 Promptable Closed-loop Traffic Simulation Shuhan Tan et.al. 2409.05863 null
2024-09-15 MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct Run Luo et.al. 2409.05840 null
2024-09-09 Cooperative Decision-Making for CAVs at Unsignalized Intersections: A MARL Approach with Attention and Hierarchical Game Priors Jiaqi Liu et.al. 2409.05712 null
2024-09-09 StratXplore: Strategic Novelty-seeking and Instruction-aligned Exploration for Vision and Language Navigation Muraleekrishna Gopinathan et.al. 2409.05593 null
2024-09-09 Interpretable Responsibility Sharing as a Heuristic for Task and Motion Planning Arda Sarp Yenicesu et.al. 2409.05586 link
2024-09-09 SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning Alireza Ghafarollahi et.al. 2409.05556 link
2024-09-09 Seeing is Believing? Enhancing Vision-Language Navigation using Visual Perturbations Xuesong Zhang et.al. 2409.05552 null
2024-09-09 A refined Frauchiger--Renner paradox based on strong contextuality Laurens Walleghem et.al. 2409.05491 null
2024-09-09 Adaptive Multi-Layer Deployment for A Digital Twin Empowered Satellite-Terrestrial Integrated Network Yihong Tao et.al. 2409.05480 null
2024-09-09 Reinforcement Learning for Variational Quantum Circuits Design Simone Foderà et.al. 2409.05475 null
2024-09-09 Semifactual Explanations for Reinforcement Learning Jasmina Gajcin et.al. 2409.05435 link
2024-09-09 Leveraging Computation of Expectation Models for Commonsense Affordance Estimation on 3D Scene Graphs Mario Alberto Valdes Saucedo et.al. 2409.05392 null
2024-09-09 BAMDP Shaping: a Unified Theoretical Framework for Intrinsic Motivation and Reward Shaping Aly Lidayan et.al. 2409.05358 null
2024-09-09 Obvious Strategy-proofness with Respect to a Partition R. Pablo Arribillaga et.al. 2409.05315 null
2024-09-09 Distributed Robust Continuous-Time Optimization Algorithms for Time-Varying Constrained Cost Zeinab Ebrahimi et.al. 2409.05293 null
2024-09-09 Towards Fast Rates for Federated and Multi-Task Reinforcement Learning Feng Zhu et.al. 2409.05291 null
2024-09-08 COVID19-CBABM: A City-Based Agent Based Disease Spread Modeling Framework Raunak Sarbajna et.al. 2409.05235 null
2024-09-08 Banded phases in topological flocks Charles R. Packard et.al. 2409.05198 null
2024-09-08 Difference Between Cyclic and Distributed Approach in Stochastic Optimization for Multi-agent System Jiahao Shi et.al. 2409.05155 null
2024-09-08 Nonlinear Cooperative Output Regulation with Input Delay Compensation Shiqi Zheng et.al. 2409.05113 null
2024-09-11 Decentralized Control of Multi-Agent Systems Under Acyclic Spatio-Temporal Task Dependencies Gregorio Marchesini et.al. 2409.05106 null
2024-09-08 Pareto-Optimal Peer-to-Peer Risk Sharing with Robust Distortion Risk Measures Mario Ghossoub et.al. 2409.05103 null
2024-09-08 On final opinions of the Friedkin-Johnsen model over random graphs with partially stubborn community Lingfei Wang et.al. 2409.05063 null
2024-09-08 Towards Multi-agent Policy-based Directed Hypergraph Learning for Traffic Signal Control Kang Wang et.al. 2409.05037 null
2024-09-08 Cooperative Learning-Based Framework for VNF Caching and Placement Optimization over Low Earth Orbit Satellite Networks Khai Doan et.al. 2409.05025 null
2024-09-08 A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement Huan Zhang et.al. 2409.05001 link
2024-09-08 Multi-V2X: A Large Scale Multi-modal Multi-penetration-rate Dataset for Cooperative Perception Rongsong Li et.al. 2409.04980 null
2024-09-07 DEPLOYERS: An agent based modeling tool for multi country real world data Martin Jaraiz et.al. 2409.04876 null
2024-09-07 Adaptation Procedure in Misinformation Games Konstantinos Varsos et.al. 2409.04854 null
2024-09-07 Context-Aware Replanning with Pre-explored Semantic Map for Object Navigation Hung-Ting Su et.al. 2409.04837 null
2024-09-07 LMGT: Optimizing Exploration-Exploitation Balance in Reinforcement Learning through Language Model Guided Trade-offs Yongxin Deng et.al. 2409.04744 null
2024-09-07 Algorithmic Scenario Generation as Quality Diversity Optimization Stefanos Nikolaidis et.al. 2409.04711 null
2024-09-07 Learning Optimal Stable Matches in Decentralized Markets with Unknown Preferences Vade Shah et.al. 2409.04669 null
2024-09-10 QueryBuilder: Human-in-the-Loop Query Development for Information Retrieval Hemanth Kandula et.al. 2409.04667 null
2024-09-06 Stacked Universal Successor Feature Approximators for Safety in Reinforcement Learning Ian Cannon et.al. 2409.04641 null
2024-09-06 Sparse Rewards Can Self-Train Dialogue Agents Barrett Martin Lattimer et.al. 2409.04617 link
2024-09-15 Decentralized Learning in General-sum Markov Games Chinmay Maheshwari et.al. 2409.04613 null
2024-09-06 Impact of Transit on Mobility, Equity, and Economy in the Chicago Metropolitan Region Omer Verbas et.al. 2409.04568 null
2024-09-03 State and Action Factorization in Power Grids Gianvito Losapio et.al. 2409.04467 null
2024-09-03 Here's Charlie! Realising the Semantic Web vision of Agents in the age of LLMs Jesse Wright et.al. 2409.04465 null
2024-09-06 A Survey on Knowledge Organization Systems of Research Fields: Resources and Challenges Angelo Salatino et.al. 2409.04432 null
2024-09-06 RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs Jiaxing Wu et.al. 2409.04421 null
2024-09-06 MATWA: A Web Toolkit for Matching under Preferences Frederik Glitzner et.al. 2409.04402 null
2024-09-06 Cs-O $_2$ -Li as enhanced NEA surface layer with increased lifetime for GaAs photocathodes Maximilian Herbert et.al. 2409.04319 null
2024-09-06 Safe and Efficient Path Planning under Uncertainty via Deep Collision Probability Fields Felix Herrmann et.al. 2409.04306 null
2024-09-06 Using Large Language Models to Generate Authentic Multi-agent Knowledge Work Datasets Desiree Heim et.al. 2409.04286 null
2024-09-06 Collective chemotactic search strategies Hugues Meyer et.al. 2409.04262 null
2024-09-06 SPACE: A Python-based Simulator for Evaluating Decentralized Multi-Robot Task Allocation Algorithms Inmo Jang et.al. 2409.04230 link
2024-09-06 FPT Algorithms using Minimal Parameters for a Generalized Version of Maximin Shares Klaus Jansen et.al. 2409.04225 null
2024-09-06 Advancing Multi-Organ Disease Care: A Hierarchical Multi-Agent Reinforcement Learning Framework Daniel J. Tan et.al. 2409.04224 null
2024-09-06 Runtime analysis of a coevolutionary algorithm on impartial combinatorial games Alistair Benford et.al. 2409.04177 null
2024-09-06 Towards a Socially Acceptable Competitive Equilibrium in Energy Markets Koorosh Shomalzadeh et.al. 2409.04157 null
2024-09-06 Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers Chenglei Si et.al. 2409.04109 link
2024-09-06 Tighter Analysis for Decentralized Stochastic Gradient Method: Impact of Data Homogeneity Qiang Li et.al. 2409.04092 null
2024-09-06 Surface Patterns Shaped by Additives in Crystals M. A. Chabowska et.al. 2409.04084 null
2024-09-05 DRAL: Deep Reinforcement Adaptive Learning for Multi-UAVs Navigation in Unknown Indoor Environment Kangtong Mo et.al. 2409.03930 null
2024-09-05 On the Convergence Rates of Federated Q-Learning across Heterogeneous Environments Muxing Wang et.al. 2409.03897 null
2024-09-05 Multi-agent Path Finding for Mixed Autonomy Traffic Coordination Han Zheng et.al. 2409.03881 null
2024-09-05 PARCO: Learning Parallel Autoregressive Policies for Efficient Multi-Agent Combinatorial Optimization Federico Berto et.al. 2409.03811 link
2024-09-04 NESTFUL: A Benchmark for Evaluating LLMs on Nested Sequences of API Calls Kinjal Basu et.al. 2409.03797 null
2024-09-13 Safeguarding AI Agents: Developing and Analyzing Safety Architectures Ishaan Domkundwar et.al. 2409.03793 null
2024-08-31 BreachSeek: A Multi-Agent Automated Penetration Tester Ibrahim Alshehri et.al. 2409.03789 link
2024-09-06 RAG based Question-Answering for Contextual Response Prediction System Sriram Veturi et.al. 2409.03708 null
2024-09-05 TRACE-cs: Trustworthy Reasoning for Contrastive Explanations in Course Scheduling Problems Stylianos Loukas Vasileiou et.al. 2409.03671 null
2024-09-06 LLM-based multi-agent poetry generation in non-cooperative environments Ran Zhang et.al. 2409.03659 link
2024-09-05 A Complete Landscape of EFX Allocations of Mixed Manna on Graphs Yu Zhou et.al. 2409.03594 null
2024-09-05 CHIRPs: Change-Induced Regret Proxy metrics for Lifelong Reinforcement Learning John Birkbeck et.al. 2409.03577 null
2024-09-05 From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents Jifan Yu et.al. 2409.03512 null
2024-09-05 Rx Strategist: Prescription Verification using LLM Agents System Phuc Phan Van et.al. 2409.03440 null
2024-09-05 Reinforcement Learning Approach to Optimizing Profilometric Sensor Trajectories for Surface Inspection Sara Roos-Hoefgeest et.al. 2409.03429 null
2024-09-05 Game On: Towards Language Models as RL Experimenters Jingwei Zhang et.al. 2409.03402 null
2024-09-05 ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models Qi Ju et.al. 2409.03301 link
2024-09-05 Robust synchronization and policy adaptation for networked heterogeneous agents Miguel F. Arevalo-Castiblanco et.al. 2409.03273 null
2024-09-05 GraphInsight: Unlocking Insights in Large Language Models for Graph Structure Understanding Yukun Cao et.al. 2409.03258 null
2024-09-05 E2CL: Exploration-based Error Correction Learning for Embodied Agents Hanlin Wang et.al. 2409.03256 null
2024-09-05 Improving agent performance in fluid environments by perceptual pretraining Jin Zhang et.al. 2409.03230 null
2024-09-05 xLAM: A Family of Large Action Models to Empower AI Agent Systems Jianguo Zhang et.al. 2409.03215 link
2024-09-05 Predefined-time distributed non-convex optimization via a time-base generator Qinlong Lin et.al. 2409.03188 null
2024-09-11 Continual Skill and Task Learning via Dialogue Weiwei Gu et.al. 2409.03166 null
2024-09-04 Subsidy design for better social outcomes Maria-Florina Balcan et.al. 2409.03129 null
2024-09-04 RoboKoop: Efficient Control Conditioned Representations from Visual Input in Robotics using Koopman Operator Hemant Kumawat et.al. 2409.03107 null
2024-09-04 Resilient Two-Time-Scale Local Stochastic Gradient Descent for Byzantine Federated Learning Amit Dutta et.al. 2409.03092 null
2024-09-04 An Introduction to Centralized Training for Decentralized Execution in Cooperative Multi-Agent Reinforcement Learning Christopher Amato et.al. 2409.03052 null
2024-09-04 Large Language Model-Based Agents for Software Engineering: A Survey Junwei Liu et.al. 2409.02977 link
2024-09-03 Managing multiple agents by automatically adjusting incentives Shunichi Akatsuka et.al. 2409.02960 null
2024-09-04 LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture Xidong Wang et.al. 2409.02889 link
2024-09-04 Bioinformatics Retrieval Augmentation Data (BRAD) Digital Assistant Joshua Pickard et.al. 2409.02864 null
2024-09-06 Language Understanding as a Constraint on Consensus Size in LLM Societies Giordano De Marzo et.al. 2409.02822 null
2024-09-04 Ion-specific Stability of Gold Nanoparticle Suspensions Philipp Ritzert et.al. 2409.02762 null
2024-09-04 Adaptive Formation Learning Control for Cooperative AUVs under Complete Uncertainty Emadodin Jandaghi et.al. 2409.02745 null
2024-09-04 Creating a Gen-AI based Track and Trace Assistant MVP (SuperTracy) for PostNL Mohammad Reshadati et.al. 2409.02711 null
2024-09-04 Decision Transformer for Enhancing Neural Local Search on the Job Shop Scheduling Problem Constantin Waubert de Puiseau et.al. 2409.02697 null
2024-09-04 Generalized Individual Q-learning for Polymatrix Games with Partial Observations Ahmed Said Donmez et.al. 2409.02663 null
2024-09-04 A Survey on Emergent Language Jannik Peters et.al. 2409.02645 null
2024-09-04 Evaluating Environments Using Exploratory Agents Bobby Khaleque et.al. 2409.02632 null
2024-09-04 Advancing Cyber Incident Timeline Analysis Through Rule Based AI and Large Language Models Fatma Yasmine Loumachi et.al. 2409.02572 null
2024-09-04 Vision-Language Navigation with Continual Learning Zhiyuan Li et.al. 2409.02561 null
2024-09-05 A Sequential Decision-Making Model for Perimeter Identification Ayal Taitler et.al. 2409.02549 null
2024-09-04 Astrochemistry on Galactic scales L. Colzi et.al. 2409.02537 null
2024-09-04 Cog-GA: A Large Language Models-based Generative Agent for Vision-Language Navigation in Continuous Environments Zhiyuan Li et.al. 2409.02522 null
2024-09-04 Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal Jifeng Hu et.al. 2409.02512 link
2024-09-04 Occlusion-Based Cooperative Transport for Concave Objects with a Swarm of Miniature Mobile Robots Sanjuksha Nirgude et.al. 2409.02436 null
2024-09-04 Context-Aware Agent-based Model for Smart Long Distance Transport System Muhammad Raees et.al. 2409.02434 null
2024-09-04 Building Math Agents with Multi-Turn Iterative Preference Learning Wei Xiong et.al. 2409.02392 null
2024-09-04 Multi-modal Situated Reasoning in 3D Scenes Xiongkun Linghu et.al. 2409.02389 null
2024-09-04 Neighbourhood conditions for network stability with link uncertainty Simone Mariano et.al. 2409.02350 null
2024-09-03 Kinesthetic Teaching in Robotics: a Mixed Reality Approach Simone Macci`o et.al. 2409.02305 null
2024-09-03 Multi-Agent Reinforcement Learning for Joint Police Patrol and Dispatch Matthew Repasky et.al. 2409.02246 null
2024-09-02 AutoEncoder Convolutional Neural Network for Pneumonia Detection Michael Nosa-Omoruyi et.al. 2409.02142 null
2024-09-01 TrajWeaver: Trajectory Recovery with State Propagation Diffusion Model Jinming Wang et.al. 2409.02124 null
2024-09-05 Noise-free comparison of stochastic agent-based simulations using common random numbers Daniel J. Klein et.al. 2409.02086 null
2024-09-03 A Modern Take on Visual Relationship Reasoning for Grasp Planning Paolo Rabino et.al. 2409.02035 null
2024-09-03 Optimal allocations with capacity constrained verification Albin Erlanson et.al. 2409.02031 null
2024-09-03 Planning to avoid ambiguous states through Gaussian approximations to non-linear sensors in active inference agents Wouter M. Kouw et.al. 2409.01974 null
2024-09-03 Snapshot: Towards Application-centered Models for Pedestrian Trajectory Prediction in Urban Traffic Environments Nico Uhlemann et.al. 2409.01971 null
2024-09-03 Achieving Maximin Share and EFX/EF1 Guarantees Simultaneously Hannaneh Akrami et.al. 2409.01963 null
2024-09-03 Learning Resilient Formation Control of Drones with Graph Attention Network Jiaping Xiao et.al. 2409.01953 null
2024-09-03 From Grounding to Planning: Benchmarking Bottlenecks in Web Agents Segev Shlomov et.al. 2409.01927 null
2024-09-03 Focus Agent: LLM-Powered Virtual Focus Group Taiyu Zhang et.al. 2409.01907 null
2024-09-03 What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices Zhi Chen et.al. 2409.01893 link
2024-09-03 AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction Yuchen Shi et.al. 2409.01854 link
2024-09-03 Segmenting Object Affordances: Reproducibility and Sensitivity to Scale Tommaso Apicella et.al. 2409.01814 link
2024-09-03 Empirical evidence of Large Language Model's influence on human spoken communication Hiromu Yakura et.al. 2409.01754 null
2024-09-03 4D-CAT: Synthesis of 4D Coronary Artery Trees from Systole and Diastole Daosong Hu et.al. 2409.01725 null
2024-09-03 VProChart: Answering Chart Question through Visual Perception Alignment Agent and Programmatic Solution Reasoning Muye Huang et.al. 2409.01667 null
2024-09-03 T1-contrast Enhanced MRI Generation from Multi-parametric MRI for Glioma Patients with Latent Tumor Conditioning Zach Eidex et.al. 2409.01622 null
2024-09-03 A Time-Intensity Aware Pipeline for Generating Late-Stage Breast DCE-MRI using Generative Adversarial Models Ruben D. Fonnegra et.al. 2409.01596 null
2024-09-03 Convergence of the Heterogeneous Deffuant-Weisbuch Model: A Complete Proof and Some Extensions Ge Chen et.al. 2409.01593 null
2024-09-03 An Implementation of Werewolf Agent That does not Truly Trust LLMs Takehiro Sato et.al. 2409.01575 null
2024-09-03 Purification-Agnostic Proxy Learning for Agentic Copyright Watermarking against Adversarial Evidence Forgery Erjin Bao et.al. 2409.01541 null
2024-09-03 Bridging the Gap Between Central and Local Decision-Making: The Efficacy of Collaborative Equilibria in Altruistic Congestion Games Bryce L Ferguson et.al. 2409.01525 null
2024-09-02 The Compressor-Retriever Architecture for Language Model OS Yuan Yang et.al. 2409.01495 link
2024-09-02 Watermarking of Quantum Circuits Rupshali Roy et.al. 2409.01484 null
2024-09-02 Irreversible investment under weighted discounting: effects of decreasing impatience Pengyu Wei et.al. 2409.01478 null
2024-09-02 Real-Time Recurrent Learning using Trace Units in Reinforcement Learning Esraa Elelimy et.al. 2409.01449 null
2024-09-02 Performance-Aware Self-Configurable Multi-Agent Networks: A Distributed Submodular Approach for Simultaneous Coordination and Network Design Zirui Xu et.al. 2409.01411 link
2024-09-02 GenAgent: Build Collaborative AI Systems with Automated Workflow Generation -- Case Studies on ComfyUI Xiangyuan Xue et.al. 2409.01392 null
2024-09-02 Modeling contagious disease spreading Dipak Patra et.al. 2409.01103 null
2024-09-02 Two-Timescale Synchronization and Migration for Digital Twin Networks: A Multi-Agent Deep Reinforcement Learning Approach Wenshuai Liu et.al. 2409.01092 null
2024-09-02 Learning in Hybrid Active Inference Models Poppy Collis et.al. 2409.01066 null
2024-09-02 Multiagent Reinforcement Learning Enhanced Decision-making of Crew Agents During Floor Construction Process Bin Yang et.al. 2409.01060 null
2024-09-02 Accelerated Multi-objective Task Learning using Modified Q-learning Algorithm Varun Prakash Rajamohan et.al. 2409.01046 null
2024-09-02 Federated Deep Reinforcement Learning-Based Intelligent Channel Access in Dense Wi-Fi Deployments Xinyang Du et.al. 2409.01004 null
2024-09-02 Evolution of Social Norms in LLM Agents using Natural Language Ilya Horiguchi et.al. 2409.00993 null
2024-09-02 Co-Learning: Code Learning for Multi-Agent Reinforcement Collaborative Framework with Conversational Natural Language Interfaces Jiapeng Yu et.al. 2409.00985 link
2024-09-02 Semantically Controllable Augmentations for Generalizable Robot Learning Zoey Chen et.al. 2409.00951 null
2024-09-02 Distributed Optimization under Edge Agreement with Application in Battery Network Management Zehui Lu et.al. 2409.00936 null
2024-09-02 ToolACE: Winning the Points of LLM Function Calling Weiwen Liu et.al. 2409.00920 null
2024-09-04 MarsCode Agent: AI-native Automated Bug Fixing Yizhou Liu et.al. 2409.00899 null
2024-09-02 Whole-Body Control Through Narrow Gaps From Pixels To Action Tianyue Wu et.al. 2409.00895 null
2024-09-01 Self-evolving Agents with reflective and memory-augmented abilities Xuechen Liang et.al. 2409.00872 null
2024-09-01 JaxLife: An Open-Ended Agentic Simulator Chris Lu et.al. 2409.00853 link
2024-09-01 Satisficing Equilibrium Bary S. R. Pradelski et.al. 2409.00832 null
2024-09-01 Digital Homunculi: Reimagining Democracy Research with Generative Agents Petr Specian et.al. 2409.00826 null
2024-09-01 Accelerating Hybrid Agent-Based Models and Fuzzy Cognitive Maps: How to Combine Agents who Think Alike? Philippe J. Giabbanelli et.al. 2409.00824 null
2024-09-01 Cooperative Path Planning with Asynchronous Multiagent Reinforcement Learning Jiaming Yin et.al. 2409.00754 null
2024-09-01 Simulation of Social Media-Driven Bubble Formation in Financial Markets using an Agent-Based Model with Hierarchical Influence Network Gonzalo Bohorquez et.al. 2409.00742 link
2024-09-01 Fair Reciprocal Recommendation in Matching Markets Yoji Tomita et.al. 2409.00720 link
2024-09-04 Multi-Agent Reinforcement Learning from Human Feedback: Data Coverage and Algorithmic Techniques Natalia Zhang et.al. 2409.00717 null
2024-09-01 Universal Finite-State and Self-Stabilizing Computation in Anonymous Dynamic Networks Giuseppe A. Di Luna et.al. 2409.00688 null
2024-09-01 A Learnable Agent Collaboration Network Framework for Personalized Multimodal AI Search Engine Yunxiao Shi et.al. 2409.00636 null
2024-09-01 Roundabout Dilemma Zone Data Mining and Forecasting with Trajectory Prediction and Graph Neural Networks Manthan Chelenahalli Satish et.al. 2409.00622 null
2024-09-01 TinyAgent: Function Calling at the Edge Lutfi Eren Erdogan et.al. 2409.00608 null
2024-09-01 Average-case optimization analysis for distributed consensus algorithms on regular graphs Nhat Trung Nguyen et.al. 2409.00605 null
2024-09-04 GenAI-powered Multi-Agent Paradigm for Smart Urban Mobility: Opportunities and Challenges for Integrating Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) with Intelligent Transportation Systems Haowen Xu et.al. 2409.00494 null
2024-08-31 Online Learning of Interaction Dynamics with Dual Model Predictive Control for Multi-Agent Systems Using Gaussian Processes T. M. J. T. Baltussen et.al. 2409.00432 null
2024-08-31 Chatting Up Attachment: Using LLMs to Predict Adult Bonds Paulo Soares et.al. 2409.00347 null
2024-08-29 PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action Yijia Shao et.al. 2409.00138 link
2024-08-29 HoneyComb: A Flexible LLM-Based Agent System for Materials Science Huan Zhang et.al. 2409.00135 null
2024-08-29 MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale Anton Andreychuk et.al. 2409.00134 link
2024-08-27 Evaluating the Impact of Multiple DER Aggregators on Wholesale Energy Markets: A Hybrid Mean Field Approach Jun He et.al. 2409.00107 null
2024-08-27 Collective Predictive Coding as Model of Science: Formalizing Scientific Activities Towards Generative Science Tadahiro Taniguchi et.al. 2409.00102 null
2024-08-27 Modelisation a base d'Agent Augmentes par LLM pour les Simulations Sociales: Defis et Opportunites Önder Gürcan et.al. 2409.00100 null
2024-08-24 Towards Human-Level Understanding of Complex Process Engineering Schematics: A Pedagogical, Introspective Multi-Agent Framework for Open-Domain Question Answering Sagar Srinivas Sakhinana et.al. 2409.00082 null
2024-08-30 Robust Technology Regulation Andrew Koh et.al. 2408.17398 null
2024-08-30 Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory control Zihao Sheng et.al. 2408.17380 link
2024-08-30 EMPOWER: Embodied Multi-role Open-vocabulary Planning with Online Grounding and Execution Francesco Argenziano et.al. 2408.17379 null
2024-08-30 Non-reciprocal spin-glass transition and aging Giulia Garcia Lorenzana et.al. 2408.17360 null
2024-08-30 Why do elites extend property rights: unlocking investment and the switch to public goods Alastair Langtry et.al. 2408.17335 null
2024-08-30 All You Need is Group Actions: Advancing Robust Autonomous Planning Vincenzo Basco et.al. 2408.17295 null
2024-08-30 Predicting the Impact of Generative AI Using an Agent-Based Model Joao Tiago Aparicio et.al. 2408.17268 null
2024-08-30 Using Quantum Solved Deep Boltzmann Machines to Increase the Data Efficiency of RL Agents Daniel Kent et.al. 2408.17240 null
2024-08-30 Asynchronous Distributed Learning with Quantized Finite-Time Coordination Nicola Bastianello et.al. 2408.17156 null
2024-08-30 Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning Shuyang Zhang et.al. 2408.17005 link
2024-08-30 Characterizing User Platforms for Video Streaming in Broadband Networks Yifan Wang et.al. 2408.16995 link
2024-08-30 Tool-Assisted Agent on SQL Inspection and Refinement in Real-World Scenarios Zhongyuan Wang et.al. 2408.16991 null
2024-08-30 Beyond Preferences in AI Alignment Tan Zhi-Xuan et.al. 2408.16984 null
2024-08-30 The Sample-Communication Complexity Trade-off in Federated Q-Learning Sudeep Salgia et.al. 2408.16981 null
2024-08-30 Discovery of False Data Injection Schemes on Frequency Controllers with Reinforcement Learning Romesh Prasad et.al. 2408.16958 null
2024-08-29 Robotic warehousing operations: a learn-then-optimize approach to large-scale neighborhood search Cynthia Barnhart et.al. 2408.16890 null
2024-08-29 Learning Multi-agent Multi-machine Tending by Mobile Robots Abdalwhab Abdalwhab et.al. 2408.16875 null
2024-08-29 A framework for training and benchmarking algorithms that schedule robot tasks Wojciech Dudek et.al. 2408.16844 null
2024-08-29 AdapShare: An RL-Based Dynamic Spectrum Sharing Solution for O-RAN Sneihil Gopal et.al. 2408.16842 null
2024-08-29 A Bibliometric Analysis of Trust in Conversational Agents over the Past Fifteen Years Meltem Aksoy et.al. 2408.16837 null
2024-08-29 Maelstrom Networks Matthew Evanusa et.al. 2408.16632 null
2024-08-29 On the data-sparsity of the solution of Riccati equations with applications to feedback control Stefano Massei et.al. 2408.16569 null
2024-08-29 CooTest: An Automated Testing Approach for V2X Communication Systems An Guo et.al. 2408.16470 null
2024-08-29 Consensus Planning with Primal, Dual, and Proximal Agents Alvaro Maggiar et.al. 2408.16462 null
2024-08-29 3D Topological Modeling and Multi-Agent Movement Simulation for Viral Infection Risk Analysis Wassim Jabi et.al. 2408.16417 null
2024-09-04 Efficient Multi-agent Navigation with Lightweight DRL Policy Xingrong Diao et.al. 2408.16370 null
2024-08-29 Guided Reasoning: A Non-Technical Introduction Gregor Betz et.al. 2408.16331 link
2024-08-29 Autocorrelation properties of temporal networks governed by dynamic node variables Harrison Hartle et.al. 2408.16270 null
2024-08-29 Action potential dynamics on heterogenous neural networks: from kinetic to macroscopic equations Marzia Bisi et.al. 2408.16214 null
2024-08-28 DECAF: a Discrete-Event based Collaborative Human-Robot Framework for Furniture Assembly Giulio Giacomuzzo et.al. 2408.16125 null
2024-08-28 EPO: Hierarchical LLM Agents with Environment Preference Optimization Qi Zhao et.al. 2408.16090 null
2024-08-28 Logic-Enhanced Language Model Agents for Trustworthy Social Simulations Agnieszka Mensfelt et.al. 2408.16081 link
2024-08-28 Hitting the Gym: Reinforcement Learning Control of Exercise-Strengthened Biohybrid Robots in Simulation Saul Schaffer et.al. 2408.16069 null
2024-08-28 An Extremely Data-efficient and Generative LLM-based Reinforcement Learning Agent for Recommenders Shuang Feng et.al. 2408.16032 null
2024-08-28 Thoughtseeds: Evolutionary Priors, Nested Markov Blankets, and the Emergence of Embodied Cognition Prakash Chandra Kavi et.al. 2408.15982 null
2024-08-28 WebPilot: A Versatile and Autonomous Multi-Agent System for Web Task Execution with Strategic Exploration Yao Zhang et.al. 2408.15978 null
2024-08-28 BattleAgentBench: A Benchmark for Evaluating Cooperation and Competition Capabilities of Language Models in Multi-Agent Systems Wei Wang et.al. 2408.15971 null
2024-08-28 Atari-GPT: Investigating the Capabilities of Multimodal Large Language Models as Low-Level Policies for Atari Games Nicholas R. Waytowich et.al. 2408.15950 null
2024-08-28 Auxiliary Input in Training: Incorporating Catheter Features into Deep Learning Models for ECG-Free Dynamic Coronary Roadmapping Yikang Liu et.al. 2408.15947 null
2024-09-02 Persuasion Games using Large Language Models Ganesh Prasath Ramani et.al. 2408.15879 null
2024-08-28 Retrieval-Augmented Instruction Tuning for Automated Process Engineering Calculations : A Tool-Chaining Problem-Solving Framework with Attributable Reflection Sagar Srinivas Sakhinana et.al. 2408.15866 null
2024-08-28 FlowAct: A Proactive Multimodal Human-robot Interaction System with Continuous Flow of Perception and Modular Action Sub-systems Timothée Dhaussy et.al. 2408.15864 null
2024-08-28 Interactive Agents: Simulating Counselor-Client Psychological Counseling via Role-Playing LLM-to-LLM Interactions Huachuan Qiu et.al. 2408.15787 link
2024-09-05 LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models Jiayi Gui et.al. 2408.15778 null
2024-08-28 A Survey on Evaluation of Multimodal Large Language Models Jiaxing Huang et.al. 2408.15769 null
2024-08-28 Evaluating and Comparing Crowd Simulations: Perspectives from a Crowd Authoring Tool Gabriel Fonseca Silva et.al. 2408.15762 null
2024-09-01 Reinforcement Learning for Adaptive Traffic Signal Control: Turn-Based and Time-Based Approaches to Reduce Congestion Muhammad Tahir Rafique et.al. 2408.15751 null
2024-08-28 Different Facets for Different Experts: A Framework for Streamlining The Integration of Qualitative Insights into ABM Development Vivek Nallur et.al. 2408.15725 null
2024-08-28 Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning Minjong Yoo et.al. 2408.15593 null
2024-08-28 TrafficGamer: Reliable and Flexible Traffic Simulation for Safety-Critical Scenarios with Game-Theoretic Oracles Guanren Qiao et.al. 2408.15538 link
2024-08-28 Towards Fully Autonomous Research Powered by LLMs: Case Study on Simulations Zhihan Liu et.al. 2408.15512 link
2024-08-28 AeroVerse: UAV-Agent Benchmark Suite for Simulating, Pre-training, Finetuning, and Evaluating Aerospace Embodied World Models Fanglong Yao et.al. 2408.15511 null
2024-08-28 Infinite-Horizon Optimal Wireless Control Over Shared State-Dependent Fading Channels for IIoT Systems Shuling Wang et.al. 2408.15492 null
2024-08-27 Graph Attention Inference of Network Topology in Multi-Agent Systems Akshay Kolli et.al. 2408.15449 null
2024-08-27 Fast and Modular Autonomy Software for Autonomous Racing Vehicles Andrew Saba et.al. 2408.15425 null
2024-09-04 Simultaneous Training of First- and Second-Order Optimizers in Population-Based Reinforcement Learning Felix Pfeiffer et.al. 2408.15421 null
2024-08-27 On Stateful Value Factorization in Multi-Agent Reinforcement Learning Enrico Marchesini et.al. 2408.15381 null
2024-08-27 A Multi-Agent Reinforcement Learning Scheme for SFC Placement in Edge Computing Networks Congzhou Li et.al. 2408.15337 null
2024-08-27 Artificially intelligent Maxwell's demon for optimal control of open quantum systems Paolo Andrea Erdman et.al. 2408.15328 null
2024-08-27 TourSynbio: A Multi-Modal Large Model and Agent Framework to Bridge Text and Protein Sequences for Protein Engineering Yiqing Shen et.al. 2408.15299 link
2024-08-27 Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations Yucheng Jiang et.al. 2408.15232 null
2024-08-27 Exploiting Approximate Symmetry for Efficient Multi-Agent Reinforcement Learning Batuhan Yardim et.al. 2408.15173 null
2024-08-27 Applications in CityLearn Gym Environment for Multi-Objective Control Benchmarking in Grid-Interactive Buildings and Districts Kingsley Nweye et.al. 2408.15170 null
2024-08-27 Delay as Payoff in MAB Ofir Schlisselberg et.al. 2408.15158 null
2024-08-27 muPRL: A Mutation Testing Pipeline for Deep Reinforcement Learning based on Real Faults Deepak-George Thomas et.al. 2408.15150 link
2024-08-29 No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery Alexander Rutherford et.al. 2408.15099 link
2024-08-23 Flexible categorization using formal concept analysis and Dempster-Shafer theory Marcel Boersma et.al. 2408.15012 null
2024-08-27 AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems Chi-Min Chan et.al. 2408.14972 link
2024-08-27 The Asymptotic Cost of Complexity Martin W Cripps et.al. 2408.14949 null
2024-08-27 Decentralized Unlabeled Multi-agent Pathfinding Via Target And Priority Swapping (With Supplementary) Stepan Dergachev et.al. 2408.14948 link
2024-08-27 Learning Robust Reward Machines from Noisy Labels Roko Parac et.al. 2408.14871 link
2024-08-27 Diffusion Models Are Real-Time Game Engines Dani Valevski et.al. 2408.14837 null
2024-08-27 Sequential-Scanning Dual-Energy CT Imaging Using High Temporal Resolution Image Reconstruction and Error-Compensated Material Basis Image Generation Qiaoxin Li et.al. 2408.14754 null
2024-08-27 Sub-Riemannian Geometry, Mixing, and the Holonomy of Optimal Mass Transport Mahmoud Abdelgalil et.al. 2408.14707 null
2024-08-26 Model-Based Reinforcement Learning for Control of Strongly-Disturbed Unsteady Aerodynamic Flows Zhecheng Liu et.al. 2408.14685 null
2024-08-26 Emergent Language in Open-Ended Environments Cornelius Wolff et.al. 2408.14649 null
2024-08-26 Biased Dueling Bandits with Stochastic Delayed Feedback Bongsoo Yi et.al. 2408.14603 null
2024-08-26 On Centralized Critics in Multi-Agent Reinforcement Learning Xueguang Lyu et.al. 2408.14597 link
2024-08-26 Multi-Agent Path Finding with Real Robot Dynamics and Interdependent Tasks for Automated Warehouses Vassilissa Lehoux-Lebacque et.al. 2408.14527 null
2024-08-26 A Survey on Reinforcement Learning Applications in SLAM Mohammad Dehghani Tezerjani et.al. 2408.14518 null
2024-08-24 Artificial intelligence for science: The easy and hard problems Ruairidh M. Battleday et.al. 2408.14508 null
2024-08-23 Knowledge Graph Modeling-Driven Large Language Model Operating System (LLM OS) for Task Automation in Process Engineering Problem-Solving Sakhinana Sagar Srinivas et.al. 2408.14494 null
2024-08-18 Agentic Retrieval-Augmented Generation for Time Series Analysis Chidaksh Ravuru et.al. 2408.14484 null
2024-08-26 Employing Artificial Intelligence to Steer Exascale Workflows with Colmena Logan Ward et.al. 2408.14434 null
2024-08-26 SWE-bench-java: A GitHub Issue Resolving Benchmark for Java Daoguang Zan et.al. 2408.14354 link
2024-09-03 Foundation Models for Music: A Survey Yinghao Ma et.al. 2408.14340 link
2024-08-26 Equivariant Reinforcement Learning under Partial Observability Hai Nguyen et.al. 2408.14336 null
2024-08-26 LLM-3D Print: Large Language Models To Monitor and Control 3D Printing Yayati Jadhav et.al. 2408.14307 null
2024-08-26 Fact Probability Vector Based Goal Recognition Nils Wilken et.al. 2408.14224 link
2024-08-26 Robot Navigation with Entity-Based Collision Avoidance using Deep Reinforcement Learning Yury Kolomeytsev et.al. 2408.14183 null
2024-08-26 "Hi. I'm Molly, Your Virtual Interviewer!" -- Exploring the Impact of Race and Gender in AI-powered Virtual Interview Experiences Shreyan Biswas et.al. 2408.14159 null
2024-08-26 Investigating the effect of Mental Models in User Interaction with an Adaptive Dialog Agent Lindsey Vanderlyn et.al. 2408.14154 null
2024-09-02 MLR-Copilot: Autonomous Machine Learning Research based on Large Language Models Agents Ruochen Li et.al. 2408.14033 link
2024-08-26 Optimizing TD3 for 7-DOF Robotic Arm Grasping: Overcoming Suboptimality with Exploration-Enhanced Contrastive Learning Wen-Han Hsieh et.al. 2408.14009 null
2024-08-26 Decentralized Federated Learning with Model Caching on Mobile Agents Xiaoyu Wang et.al. 2408.14001 null
2024-08-26 Quantitative Representation of Scenario Difficulty for Autonomous Driving Based on Adversarial Policy Search Shuo Yang et.al. 2408.14000 null
2024-08-26 AgentMove: Predicting Human Mobility Anywhere Using Large Language Model based Agentic Framework Jie Feng et.al. 2408.13986 link
2024-08-25 CoT Rerailer: Enhancing the Reliability of Large Language Models in Complex Reasoning Tasks through Error Detection and Correction Guangya Wan et.al. 2408.13940 null
2024-08-25 Safe Policy Exploration Improvement via Subgoals Brian Angulo et.al. 2408.13881 null
2024-08-25 Flexible game-playing AI with AlphaViT: adapting to multiple games and board sizes Kazuhisa Fujita et.al. 2408.13871 null
2024-08-25 Informativeness and Trust in Bayesian Persuasion Reema Deori et.al. 2408.13822 null
2024-08-25 Optical Inversion Using Plasmonic Contrast Agents Xinlin Cao et.al. 2408.13793 null
2024-08-25 Demo: Generative Open xG Network Simulation with Multi-Agent LLM and ns-3 (GenOnet) Farhad Rezazadeh et.al. 2408.13781 null
2024-08-25 MASQ: Multi-Agent Reinforcement Learning for Single Quadruped Robot Locomotion Qi Liu et.al. 2408.13759 null
2024-08-25 Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation Zhaoyang Li et.al. 2408.13752 null
2024-08-25 Multi-Agent Target Assignment and Path Finding for Intelligent Warehouse: A Cooperative Multi-Agent Deep Reinforcement Learning Perspective Qi Liu et.al. 2408.13750 null
2024-08-25 Count-based Novelty Exploration in Classical Planning Giacomo Rosa et.al. 2408.13719 null
2024-08-24 How to guide a present-biased agent through prescribed tasks? Tatiana Belova et.al. 2408.13675 null
2024-08-24 Temporal Elections: Welfare, Strategyproofness, and Proportionality Edith Elkind et.al. 2408.13637 null
2024-08-24 DeepVoting: Learning Voting Rules with Tailored Embeddings Leonardo Matone et.al. 2408.13630 null
2024-08-24 Reaching New Heights in Multi-Agent Collective Construction Martin Rameš et.al. 2408.13615 null
2024-08-24 Hybrid Training for Enhanced Multi-task Generalization in Multi-agent Reinforcement Learning Mingliang Zhang et.al. 2408.13567 null
2024-08-27 Control-Informed Reinforcement Learning for Chemical Processes Maximilian Bloor et.al. 2408.13566 link
2024-08-24 IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question Answering Ruosen Li et.al. 2408.13545 null
2024-08-24 Unleashing Collaborative Computing for Adaptive Video Streaming with Multi-objective Optimization in Satellite Terrestrial Networks Zhishu Shen et.al. 2408.13512 null
2024-08-23 Optimizing Collaboration of LLM based Agents for Finite Element Analysis Chuan Tian et.al. 2408.13406 null
2024-08-23 DrugAgent: Explainable Drug Repurposing Agent with Large Language Model-based Reasoning Yoshitaka Inoue et.al. 2408.13378 null
2024-08-23 Generative Blockchain: Transforming Blockchain from Transaction Recording to Transaction Generation through Proof-of-Merit Haozhao Zhang et.al. 2408.13367 null
2024-08-23 Reconciling Different Theories of Learning with an Agent-based Model of Procedural Learning Sina Rismanchian et.al. 2408.13364 null
2024-08-23 Oscillatory and Excitable Dynamics in an Opinion Model with Group Opinions Corbit R. Sampson et.al. 2408.13336 link
2024-08-23 Mastering the Digital Art of War: Developing Intelligent Combat Simulation Agents for Wargaming Using Hierarchical Reinforcement Learning Scotty Black et.al. 2408.13333 null
2024-08-23 Localized Observation Abstraction Using Piecewise Linear Spatial Decay for Reinforcement Learning in Combat Simulations Scotty Black et.al. 2408.13328 null
2024-08-23 Large Language Models for Zero Touch Network Configuration Management Oscar G. Lira et.al. 2408.13298 null
2024-08-23 The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities Venkatesh Balavadhani Parthasarathy et.al. 2408.13296 null
2024-08-23 Optimally Solving Simultaneous-Move Dec-POMDPs: The Sequential Central Planning Approach Johan Peralez et.al. 2408.13139 null
2024-08-18 An Introduction to Cognidynamics Marco Gori et.al. 2408.13112 null
2024-08-23 Diffusion-based Episodes Augmentation for Offline Multi-Agent Reinforcement Learning Jihwan Oh et.al. 2408.13092 null
2024-09-01 Controllable Financial Market Generation with Diffusion Guided Meta Agent Yu-Hao Huang et.al. 2408.12991 null
2024-08-26 Zeoformer: Coarse-Grained Periodic Graph Transformer for OSDA-Zeolite Affinity Prediction Xiangxiang Shen et.al. 2408.12984 null
2024-08-23 Informational Embodiment: Computational role of information structure in codes and robots Alexandre Pitti et.al. 2408.12950 null
2024-08-23 Complete Graph Identification in Population Protocols Haruki Kanaya et.al. 2408.12862 null
2024-08-23 Online Fair Division with Contextual Bandits Arun Verma et.al. 2408.12845 null
2024-08-23 LIMP: Large Language Model Enhanced Intent-aware Mobility Prediction Songwei Li et.al. 2408.12832 link
2024-08-23 From Mobilisation to Radicalisation: Probing the Persistence and Radicalisation of Social Movements Using an Agent-Based Model Emma F. Thomas et.al. 2408.12795 null
2024-08-23 Environment-Centric Active Inference Kanako Esaki et.al. 2408.12777 null
2024-08-27 Intelligent OPC Engineer Assistant for Semiconductor Manufacturing Guojin Chen et.al. 2408.12775 null
2024-08-22 Free-breathing 3D cardiac extracellular volume (ECV) mapping using a linear tangent space alignment (LTSA) model Wonil Lee et.al. 2408.12706 null
2024-09-01 Can LLMs Understand Social Norms in Autonomous Driving Games? Boxuan Wang et.al. 2408.12680 null
2024-08-22 Integrating an agent-based behavioral model in microtransit forecasting and revenue management Xiyuan Ren et.al. 2408.12577 null
2024-08-25 MuMA-ToM: Multi-modal Multi-Agent Theory of Mind Haojun Shi et.al. 2408.12574 link
2024-08-22 PCGRL+: Scaling, Control and Generalization in Reinforcement Learning Level Generators Sam Earle et.al. 2408.12525 null
2024-08-22 Stochastic Online Correlated Selection Ziyun Chen et.al. 2408.12524 null
2024-08-22 Weighted Envy-Freeness in House Allocation Sijia Dai et.al. 2408.12523 null
2024-08-22 MEDCO: Medical Education Copilots Based on A Multi-Agent Framework Hao Wei et.al. 2408.12496 null
2024-08-22 Multi Agent Framework for Collective Intelligence Research Alexandru Dochian et.al. 2408.12391 link
2024-08-22 Recursive Distributed Collaborative Aided Inertial Navigation Roland Jung et.al. 2408.12360 link
2024-09-04 Graph Retrieval Augmented Trustworthiness Reasoning Ying Zhu et.al. 2408.12333 link
2024-08-22 Can Artificial Intelligence Embody Moral Values? Torben Swoboda et.al. 2408.12250 null
2024-08-22 Time Optimal Distance- $k$ -Dispersion on Dynamic Ring Brati Mondal et.al. 2408.12220 null
2024-08-22 MDD-5k: A New Diagnostic Conversation Dataset for Mental Disorders Synthesized via Neuro-Symbolic LLM Agents Congchi Yin et.al. 2408.12142 link
2024-08-22 An evidence-accumulating drift-diffusion model of competing information spread on networks Julien Corsin et.al. 2408.12127 null
2024-08-22 Emotion-Agent: Unsupervised Deep Reinforcement Learning with Distribution-Prototype Reward for Continuous Emotional EEG Analysis Zhihao Zhou et.al. 2408.12121 null
2024-08-22 Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards Shresth Verma et.al. 2408.12112 null
2024-08-22 Distributed Noncoherent Joint Transmission Based on Multi-Agent Reinforcement Learning for Dense Small Cell MISO Systems Shaozhuang Bai et.al. 2408.12067 null
2024-08-21 Empirical Equilibria in Agent-based Economic systems with Learning agents Kshama Dwarakanath et.al. 2408.12038 null
2024-08-21 Reasoning and Tools for Human-Level Forecasting Elvis Hsieh et.al. 2408.12036 null
2024-08-21 Understanding Epistemic Language with a Bayesian Theory of Mind Lance Ying et.al. 2408.12022 null
2024-08-21 Controlling nonergodicity in quantum many-body systems by reinforcement learning Li-Li Ye et.al. 2408.11989 link
2024-08-21 Advances in Preference-based Reinforcement Learning: A Review Youssef Abdelkareem et.al. 2408.11943 null
2024-08-21 Distributed alternating gradient descent for convex semi-infinite programs over a network Ashwin Aravind et.al. 2408.11937 null
2024-08-21 Spline tie-decay temporal networks Chanon Thongprayoon et.al. 2408.11913 null
2024-08-21 Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction Anthony GX-Chen et.al. 2408.11816 null
2024-08-21 EmbodiedSAM: Online Segment Any 3D Thing in Real Time Xiuwei Xu et.al. 2408.11811 null
2024-08-21 Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models Yuzhou Huang et.al. 2408.11801 null
2024-08-21 Leveraging Chemistry Foundation Models to Facilitate Structure Focused Retrieval Augmented Generation in Multi-Agent Workflows for Catalyst and Materials Design Nathaniel H. Park et.al. 2408.11793 null
2024-08-21 DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework Zhifei Xie et.al. 2408.11788 null
2024-08-21 Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards Omar Erak et.al. 2408.11775 link
2024-08-21 Deviations from the Nash equilibrium and emergence of tacit collusion in a two-player optimal execution game with reinforcement learning Fabrizio Lillo et.al. 2408.11773 null
2024-08-21 VIRIS: Simulating indoor airborne transmission combining architectural design and people movement Yidan Xue et.al. 2408.11772 link
2024-08-23 Consensus over Clustered Networks Using Intermittent and Asynchronous Output Feedback Federico M. Zegers et.al. 2408.11752 null
2024-08-21 Bayesian Optimization Framework for Efficient Fleet Design in Autonomous Multi-Robot Exploration David Molina Concha et.al. 2408.11751 null
2024-08-21 Open-Ended 3D Point Cloud Instance Segmentation Phuc D. A. Nguyen et.al. 2408.11747 null
2024-08-21 Less is more: AI Decision-Making using Dynamic Deep Neural Networks for Short-Term Stock Index Prediction CJ Finnegan et.al. 2408.11740 null
2024-08-22 LLM4VV: Exploring LLM-as-a-Judge for Validation and Verification Testsuites Zachariah Sollenberger et.al. 2408.11729 null
2024-08-21 Networked Communication for Mean-Field Games with Function Approximation and Empirical Mean-Field Estimation Patrick Benjamin et.al. 2408.11607 null
2024-08-21 Optimizing QoS in HD Map Updates: Cross-Layer Multi-Agent with Hierarchical and Independent Learning Jeffrey Redondo et.al. 2408.11605 null
2024-08-21 Cause-Aware Empathetic Response Generation via Chain-of-Thought Fine-Tuning Xinhao Chen et.al. 2408.11599 null
2024-08-21 Drama Engine: A Framework for Narrative Agents Martin Pichlmair et.al. 2408.11574 null
2024-08-21 AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition Minheng Ni et.al. 2408.11564 null
2024-08-21 Semi-supervised 3D Semantic Scene Completion with 2D Vision Foundation Model Guidance Duc-Hai Pham et.al. 2408.11559 null
2024-08-21 Fixation of leadership in non-Markovian growth processes Tejas Iyer et.al. 2408.11516 null
2024-08-21 Verifying Approximate Equilibrium in Auctions Fabian R. Pieroth et.al. 2408.11445 null
2024-08-21 Subgoal-based Hierarchical Reinforcement Learning for Multi-Agent Collaboration Cheng Xu et.al. 2408.11416 link
2024-08-21 Towards "Differential AI Psychology" and in-context Value-driven Statement Alignment with Moral Foundations Theory Simon Münker et.al. 2408.11415 null
2024-08-21 Swarm Intelligence in Geo-Localization: A Multi-Agent Large Vision-Language Model Collaborative Framework Xiao Han et.al. 2408.11312 null
2024-08-20 CooPre: Cooperative Pretraining for V2X Cooperative Perception Seth Z. Zhao et.al. 2408.11241 null
2024-08-20 Optimization of Multi-Agent Flying Sidekick Traveling Salesman Problem over Road Networks Ruixiao Yang et.al. 2408.11187 null
2024-08-20 Autonomous Negotiation Using Comparison-Based Gradient Estimation Surya Murthy et.al. 2408.11186 link
2024-08-20 Range-based Multi-Robot Integrity Monitoring Against Cyberattacks and Faults: An Anchor-Free Approach Vishnu Vijay et.al. 2408.11155 null
2024-08-20 Accelerating Goal-Conditioned RL Algorithms and Research Michał Bortkiewicz et.al. 2408.11052 link
2024-08-20 FLAME: Learning to Navigate with Multimodal LLM in Urban Environments Yunzhe Xu et.al. 2408.11051 link
2024-08-23 MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding Jian Chen et.al. 2408.11049 link
2024-08-20 Athena: Safe Autonomous Agents with Verbal Contrastive Learning Tanmana Sadhu et.al. 2408.11021 null
2024-08-20 The Evolution of Reinforcement Learning in Quantitative Finance Nikolaos Pippas et.al. 2408.10932 null
2024-08-20 All Robots in One: A New Standard and Unified Dataset for Versatile, General-Purpose Embodied Agents Zhiqiang Wang et.al. 2408.10899 null
2024-08-23 DBHP: Trajectory Imputation in Multi-Agent Sports Using Derivative-Based Hybrid Prediction Hanjun Choi et.al. 2408.10878 null
2024-08-20 More Options for Prelabor Rupture of Membranes, A Bayesian Analysis Ashley Klein et.al. 2408.10876 null
2024-08-20 Multi-agent Multi-armed Bandits with Stochastic Sharable Arm Capacities Hong Xie et.al. 2408.10865 null
2024-08-20 Knowledge Sharing and Transfer via Centralized Reward Agent for Multi-Task Reinforcement Learning Haozhe Ma et.al. 2408.10858 link
2024-08-20 Learning Randomized Algorithms with Transformers Johannes von Oswald et.al. 2408.10818 null
2024-08-20 Multi-Agent Based Simulation for Decentralized Electric Vehicle Charging Strategies and their Impacts Kristoffer Christensen et.al. 2408.10790 null
2024-08-20 Multi-agent based modeling for investigating excess heat utilization from electrolyzer production to district heating network Kristoffer Christensen et.al. 2408.10783 null
2024-08-20 Multi-Agent Based Simulation for Investigating Centralized Charging Strategies and their Impact on Electric Vehicle Home Charging Ecosystem Kristoffer Christensen et.al. 2408.10773 null
2024-08-20 PhishAgent: A Robust Multimodal Agent for Phishing Webpage Detection Tri Cao et.al. 2408.10738 null
2024-08-20 Investigating Context Effects in Similarity Judgements in Large Language Models Sagar Uprety et.al. 2408.10711 null
2024-08-20 Genesis: Towards the Automation of Systems Biology Research Ievgeniia A. Tiukova et.al. 2408.10689 null
2024-08-20 Neural Exploratory Landscape Analysis Zeyuan Ma et.al. 2408.10672 null
2024-08-20 Incorporating a 'ladder of trust' into dynamic Allocation of Function in Human-Autonomous Agent Collectives Chris Baber et.al. 2408.10654 null
2024-08-20 Variations on distributed belief John Lindqvist et.al. 2408.10637 null
2024-08-20 Strategist: Learning Strategic Skills by LLMs via Bi-Level Tree Search Jonathan Light et.al. 2408.10635 null
2024-08-21 MUSES: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration Yanbo Ding et.al. 2408.10605 null
2024-08-20 Fast Collective Evasion in Self-Localized Swarms of Unmanned Aerial Vehicles Filip Novák et.al. 2408.10596 null
2024-08-20 Synchronization behind Learning in Periodic Zero-Sum Games Triggers Divergence from Nash equilibrium Yuma Fujimoto et.al. 2408.10595 null
2024-08-20 Bidirectional Intent Communication: A Role for Large Foundation Models Tim Schreiter et.al. 2408.10589 null
2024-08-20 DEGAS: Detailed Expressions on Full-Body Gaussian Avatars Zhijing Shao et.al. 2408.10588 null
2024-08-20 Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks Yun Qu et.al. 2408.10556 link
2024-08-20 Semi-on-Demand Off-Peak Transit Services with Shared Autonomous Vehicles -- Service Planning, Simulation, and Analysis in Munich, Germany Max T. M. Ng et.al. 2408.10547 null
2024-08-20 Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation Jiawei Han et.al. 2408.10537 link
2024-08-20 Approximate Estimation of High-dimension Execution Skill for Dynamic Agents in Continuous Domains Delma Nieves-Rivera et.al. 2408.10512 null
2024-08-20 Evaluation Framework for AI-driven Molecular Design of Multi-target Drugs: Brain Diseases as a Case Study Arthur Cerveira et.al. 2408.10482 link
2024-08-24 IDEA:Enhancing the Rule Learning Ability of Language Agents through Induction, Deduction, and Abduction Kaiyu He et.al. 2408.10455 null
2024-08-19 Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation Liu He et.al. 2408.10453 null
2024-08-19 Tax Credits and Household Behavior: The Roles of Myopic Decision-Making and Liquidity in a Simulated Economy Jialin Dong et.al. 2408.10391 null
2024-08-19 Narrowing the Gap between Vision and Action in Navigation Yue Zhang et.al. 2408.10388 link
2024-08-19 Competing Social Contagions with Opinion Dependent Infectivity Corbit R. Sampson et.al. 2408.10373 link
2024-08-19 Toward Fair and Strategyproof Tournament Rules for Tournaments with Partially Transferable Utilities David Pennock et.al. 2408.10346 null
2024-08-17 Why and How do Complex Systems Self-Organize at All? Average Action Efficiency as a Predictor, Measure, Driver, and Mechanism of Self-Organization Matthew J Brouillet et.al. 2408.10278 null
2024-08-19 Don't Get Stuck: A Deadlock Recovery Approach Francesca Baldini et.al. 2408.10167 null
2024-08-19 Learning Precise Affordances from Egocentric Videos for Robotic Manipulation Gen Li et.al. 2408.10123 null
2024-08-19 Enhancing Reinforcement Learning Through Guided Search Jérôme Arjonilla et.al. 2408.10113 null
2024-08-19 No Screening is More Efficient with Multiple Objects Shunya Noda et.al. 2408.10077 null
2024-08-19 Synthesis of Reward Machines for Multi-Agent Equilibrium Design (Full Version) Muhammad Najib et.al. 2408.10074 null
2024-08-19 Near-Optimal Mechanisms for Resource Allocation Without Monetary Transfers Moise Blanchard et.al. 2408.10066 null
2024-08-19 The Practimum-Optimum Algorithm for Manufacturing Scheduling: A Paradigm Shift Leading to Breakthroughs in Scale and Performance Moshe BenBassat et.al. 2408.10040 null
2024-08-19 The Expressive Power of Uniform Population Protocols with Logarithmic Space Philipp Czerner et.al. 2408.10027 null
2024-08-19 Adaptive BESS and Grid Setpoints Optimization: A Model-Free Framework for Efficient Battery Management under Dynamic Tariff Pricing Alaa Selim et.al. 2408.09989 null
2024-08-19 The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective Renye Yan et.al. 2408.09974 null
2024-08-20 MegaAgent: A Practical Framework for Autonomous Cooperation in Large-Scale LLM Agent Systems Qian Wang et.al. 2408.09955 null
2024-08-19 Boltzmann approach to collective motion via non-local visual interaction Susumu Ito et.al. 2408.09917 null
2024-08-19 Multi-layer diffusion model of photovoltaic installations Tomasz Weron et.al. 2408.09904 null
2024-08-19 Demystifying Reinforcement Learning in Production Scheduling via Explainable AI Daniel Fischer et.al. 2408.09841 null
2024-08-19 Mitigating the Stability-Plasticity Dilemma in Adaptive Train Scheduling with Curriculum-Driven Continual DQN Expansion Achref Jaziri et.al. 2408.09838 null
2024-08-20 World Models Increase Autonomy in Reinforcement Learning Zhao Yang et.al. 2408.09807 null
2024-08-19 Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation Yunxin Li et.al. 2408.09787 link
2024-08-19 GoNoGo: An Efficient LLM-based Multi-Agent System for Streamlining Automotive Software Release Decision-Making Arsham Gholamzadeh Khoee et.al. 2408.09785 null
2024-08-19 Targeted Drug Delivery: Algorithmic Methods for Collecting a Swarm of Particles with Uniform External Forces Aaron T. Becker et.al. 2408.09729 null
2024-08-19 Algorithmic Contract Design with Reinforcement Learning Agents David Molina Concha et.al. 2408.09686 null
2024-08-19 Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey Ruiqi Zhang et.al. 2408.09675 link
2024-08-20 BLADE: Benchmarking Language Model Agents for Data-Driven Science Ken Gu et.al. 2408.09667 link
2024-08-19 Linear-Quadratic Mean-Field Game for Stochastic Systems with Partial Observation Min Li et.al. 2408.09652 null
2024-08-18 Prescribed-time Convergent Distributed Multiobjective Optimization with Dynamic Event-triggered Communication Tengyang Gong et.al. 2408.09602 null
2024-08-21 Löb-Safe Logics for Reflective Agents Seth Ahrenbach et.al. 2408.09590 null
2024-08-18 HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model Mengkang Hu et.al. 2408.09559 link
2024-08-18 Enhancing Population-based Search with Active Inference Nassim Dehouche et.al. 2408.09548 null
2024-08-18 A Logic for Policy Based Resource Exchanges in Multiagent Systems Lorenzo Ceragioli et.al. 2408.09516 null
2024-08-18 Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning Zhiwei Xu et.al. 2408.09501 null
2024-08-18 Ancestral Reinforcement Learning: Unifying Zeroth-Order Optimization and Genetic Algorithms for Reinforcement Learning So Nakashima et.al. 2408.09493 null
2024-08-18 HySem: A context length optimized LLM pipeline for unstructured tabular extraction Narayanan PP et.al. 2408.09434 null
2024-08-18 Value-Enriched Population Synthesis: Integrating a Motivational Layer Alba Aguilera et.al. 2408.09407 null
2024-08-18 Optimal stopping and divestment timing under scenario ambiguity and learning Andrea Mazzon et.al. 2408.09349 null
2024-08-17 How to Make an Action Better Marilyn Pease et.al. 2408.09294 null
2024-08-17 GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System Shuo Wang et.al. 2408.09191 null
2024-08-17 Generative Agent-Based Models for Complex Systems Research: a review Yikang Lu et.al. 2408.09175 null
2024-08-17 Worst- and Average-Case Robustness of Stable Matchings: (Counting) Complexity and Experiments Kimon Boehmer et.al. 2408.09160 null
2024-08-17 Training Verifiably Robust Agents Using Set-Based Reinforcement Learning Manuel Wendl et.al. 2408.09112 null
2024-08-17 Me want cookie! Towards automated and transparent data governance on the Web Jesse Wright et.al. 2408.09071 null
2024-08-16 On the Completeness of Conflict-Based Search: Temporally-Relative Duplicate Pruning Thayne T Walker et.al. 2408.09028 null
2024-08-16 Visual Agents as Fast and Slow Thinkers Guangyan Sun et.al. 2408.08862 link
2024-08-16 CPS-TaskForge: Generating Collaborative Problem Solving Environments for Diverse Communication Tasks Nikita Haduong et.al. 2408.08853 null
2024-08-16 A Novel Quantum Algorithm for Efficient Attractor Search in Gene Regulatory Networks Mirko Rossini et.al. 2408.08814 link
2024-08-16 CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk Mohamad Fares El Hajj Chehade et.al. 2408.08812 null
2024-08-16 EmoDynamiX: Emotional Support Dialogue Strategy Prediction by Modelling MiXed Emotions and Discourse Dynamics Chenwei Wan et.al. 2408.08782 link
2024-08-16 Beyond Proportional Individual Guarantees for Binary Perpetual Voting Yotam Gafni et.al. 2408.08767 null
2024-08-16 Rethinking Generative Semantic Communication for Multi-User Systems with Multi-Modal LLM Wanting Yang et.al. 2408.08765 null
2024-08-16 SYMPOL: Symbolic Tree-Based On-Policy Reinforcement Learning Sascha Marton et.al. 2408.08761 link
2024-08-16 Weighted Envy-free Allocation with Subsidy Haris Aziz et.al. 2408.08711 null
2024-08-16 Explore-then-Commit Algorithms for Decentralized Two-Sided Matching Markets Tejas Pagare et.al. 2408.08690 null
2024-08-24 The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset Generation Samee Arif et.al. 2408.08688 link
2024-08-16 Neural Reward Machines Elena Umili et.al. 2408.08677 link
2024-08-16 Fine-tuning LLMs for Autonomous Spacecraft Control: A Case Study Using Kerbal Space Program Alejandro Carrasco et.al. 2408.08676 link
2024-08-16 An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation Peiming Guo et.al. 2408.08650 null
2024-08-16 A survey on secure decentralized optimization and learning Changxin Liu et.al. 2408.08628 null
2024-08-16 DeepREST: Automated Test Case Generation for REST APIs Exploiting Deep Reinforcement Learning Davide Corradini et.al. 2408.08594 null

(back to top)

About

Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages