GitHub - Lyz103/LLM-Agent-Paper-daily: Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)

Updated on 2024.09.20

Usage instructions: here

Table of Contents

Agents

Agents

Publish Date	Title	Authors	PDF	Code
2024-09-18	Residual Descent Differential Dynamic Game (RD3G) -- A Fast Newton Solver for Constrained General Sum Games	Zhiyuan Zhang et.al.	2409.12152	null
2024-09-18	MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning	Justin Chih-Yao Chen et.al.	2409.12147	link
2024-09-19	The Impact of Element Ordering on LM Agent Performance	Wayne Chi et.al.	2409.12089	link
2024-09-19	Using Large Language Models to Generate Clinical Trial Tables and Figures	Yumeng Yang et.al.	2409.12046	null
2024-09-19	Representing Positional Information in Generative World Models for Object Manipulation	Stefano Ferraro et.al.	2409.12005	null
2024-09-18	Putting Data at the Centre of Offline Multi-Agent Reinforcement Learning	Claude Formanek et.al.	2409.12001	null
2024-09-18	On the Stability of Consensus Control under Rotational Ambiguities	Zhonggang Li et.al.	2409.11979	null
2024-09-18	Anomalous behavior of Replicator dynamics for the Prisoner's Dilemma on diluted lattices	Fernanda R. Leivas et.al.	2409.11955	null
2024-09-18	Reinforcement Learning as an Improvement Heuristic for Real-World Production Scheduling	Arthur Müller et.al.	2409.11933	null
2024-09-18	Secure Control Systems for Autonomous Quadrotors against Cyber-Attacks	Samuel Belkadi et.al.	2409.11897	link
2024-09-18	Motivations, Challenges, Best Practices, and Benefits for Bots and Conversational Agents in Software Engineering: A Multivocal Literature Review	Stefano Lambiase et.al.	2409.11864	null
2024-09-18	XP-MARL: Auxiliary Prioritization in Multi-Agent Reinforcement Learning to Address Non-Stationarity	Jianye Xu et.al.	2409.11852	null
2024-09-18	Optimizing Job Shop Scheduling in the Furniture Industry: A Reinforcement Learning Approach Considering Machine Setup, Batch Variability, and Intralogistics	Malte Schneevogt et.al.	2409.11820	null
2024-09-18	Distributed Resilient Secondary Control for Microgrids with Attention-based Weights against High-density Misbehaving Agents	Yutong Li et.al.	2409.11812	null
2024-09-18	Synthesizing Evolving Symbolic Representations for Autonomous Systems	Gabriele Sartor et.al.	2409.11756	link
2024-09-18	HARP: Human-Assisted Regrouping with Permutation Invariant Critic for Multi-Agent Reinforcement Learning	Huawen Hu et.al.	2409.11741	null
2024-09-18	Revealing the Challenge of Detecting Character Knowledge Errors in LLM Role-Playing	Wenyuan Zhang et.al.	2409.11726	link
2024-09-18	Discovering Conceptual Knowledge with Analytic Ontology Templates for Articulated Objects	Jianhua Sun et.al.	2409.11702	null
2024-09-18	RMP-YOLO: A Robust Motion Predictor for Partially Observable Scenarios even if You Only Look Once	Jiawei Sun et.al.	2409.11696	null
2024-09-18	Towards Explainable Goal Recognition Using Weight of Evidence (WoE): A Human-Centered Approach	Abeer Alshehri et.al.	2409.11675	null
2024-09-18	Agent Aggregator with Mask Denoise Mechanism for Histopathology Whole Slide Image Analysis	Xitong Ling et.al.	2409.11664	null
2024-09-18	From Data Stories to Dialogues: A Randomised Controlled Trial of Generative AI Agents and Data Storytelling in Enhancing Data Visualisation Comprehension	Lixiang Yan et.al.	2409.11645	null
2024-09-17	Context-Generative Default Policy for Bounded Rational Agent	Durgakant Pushp et.al.	2409.11604	null
2024-09-17	React to This! How Humans Challenge Interactive Agents using Nonverbal Behaviors	Chuxuan Zhang et.al.	2409.11602	null
2024-09-17	Distributed Deep Koopman Learning for Nonlinear Dynamics	Wenjian Hao et.al.	2409.11586	null
2024-09-17	PLATO: Planning with LLMs and Affordances for Tool Manipulation	Arvind Car et.al.	2409.11580	null
2024-09-17	Optimal Investment with Costly Expert Opinions	Christoph Knochenhauer et.al.	2409.11569	null
2024-09-17	Hyper-SAMARL: Hypergraph-based Coordinated Task Allocation and Socially-aware Navigation for Multi-Robot Systems	Weizheng Wang et.al.	2409.11561	null
2024-09-17	Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent	Fatemeh Haji et.al.	2409.11527	null
2024-09-17	Diffusion of knowledge and the lottery society	Henri Berestycki et.al.	2409.11479	null
2024-09-17	Consensus decision making on a complete graph: complex behaviour from simple assumptions	P. Sarkanych et.al.	2409.11475	null
2024-09-12	Towards Opinion Shaping: A Deep Reinforcement Learning Approach in Bot-User Interactions	Farbod Siahkali et.al.	2409.11426	null
2024-09-17	Ising model with varying spin strength on a scale-free network: scaling functions and critical amplitude ratios	M. Krasnytska et.al.	2409.11396	null
2024-09-17	Distributed Perception Aware Safe Leader Follower System via Control Barrier Methods	Richie R. Suganda et.al.	2409.11394	null
2024-09-17	LLM-Agent-UMF: LLM-based Agent Unified Modeling Framework for Seamless Integration of Multi Active/Passive Core-Agents	Amine B. Hassouna et.al.	2409.11393	null
2024-09-17	CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark	Zachary S. Siegel et.al.	2409.11363	link
2024-09-17	A Scalable Game Theoretic Approach for Coordination of Multiple Dynamic Systems	Mostafa M. Shibl et.al.	2409.11358	null
2024-09-17	EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage	Zeyi Liao et.al.	2409.11295	null
2024-09-17	P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task	Weiye Xu et.al.	2409.11279	null
2024-09-17	Hackphyr: A Local Fine-Tuned LLM Agent for Network Security Environments	Maria Rigaki et.al.	2409.11276	null
2024-09-19	The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives	Samee Arif et.al.	2409.11261	link
2024-09-17	To What Extent do Open-loop and Feedback Nash Equilibria Diverge in General-Sum Linear Quadratic Dynamic Games?	Chih-Yuan Chiu et.al.	2409.11257	null
2024-09-17	A Continuous-time Tractable Model for Present-biased Agents	Yasunori Akagi et.al.	2409.11225	null
2024-09-17	Bearing-based Target Localisation in Search and Rescue Scenarios	Giulia Michieletto et.al.	2409.11221	null
2024-09-17	SuperCoder2.0: Technical Report on Exploring the feasibility of LLMs as Autonomous Programmer	Anmol Gautam et.al.	2409.11190	null
2024-09-18	Annealed Winner-Takes-All for Motion Forecasting	Yihong Xu et.al.	2409.11172	link
2024-09-17	Preventing Unconstrained CBF Safety Filters Caused by Invalid Relative Degree Assumptions	Lukas Brunke et.al.	2409.11171	null
2024-09-17	Reactive Environments for Active Inference Agents with RxEnvironments.jl	Wouter W. L. Nuijten et.al.	2409.11087	link
2024-09-17	Data-driven Dynamic Intervention Design in Network Games	Xiupeng Chen et.al.	2409.11069	null
2024-09-17	A logical alarm for misaligned binary classifiers	Andrés Corrada-Emmanuel et.al.	2409.11052	null
2024-09-17	Improving Speech Emotion Recognition in Under-Resourced Languages via Speech-to-Speech Translation with Bootstrapping Data Selection	Hsi-Che Lin et.al.	2409.10985	null
2024-09-17	Label-free correlative morpho-chemical tomography of 3D kidney mesangial cells	Ankit Butola et.al.	2409.10971	null
2024-09-17	Frontier Shepherding: A Bio-Mimetic Multi-robot Framework for Large-Scale Exploration	John Lewis et.al.	2409.10931	null
2024-09-17	Multi-Floor Zero-Shot Object Navigation Policy	Lingfeng Zhang et.al.	2409.10906	null
2024-09-17	Distributed Optimization for Traffic Light Control and Connected Automated Vehicle Coordination in Mixed-Traffic Intersections	Viet-Anh Le et.al.	2409.10864	null
2024-09-17	SIFToM: Robust Spoken Instruction Following through Theory of Mind	Lance Ying et.al.	2409.10849	null
2024-09-17	Improving Interface Design in Interactive Task Learning for Hierarchical Tasks based on a Qualitative Study	Jieyu Zhou et.al.	2409.10826	null
2024-09-17	Consensus in Models for Opinion Dynamics with Generalized-Bias	Juan Paz et.al.	2409.10809	null
2024-09-16	AutoSafeCoder: A Multi-Agent Framework for Securing LLM Code Generation through Static Analysis and Fuzz Testing	Ana Nunez et.al.	2409.10737	null
2024-09-16	CoMamba: Real-time Cooperative Perception Unlocked with State Space Models	Jinlong Li et.al.	2409.10699	null
2024-09-16	Mitigating Partial Observability in Adaptive Traffic Signal Control with Transformers	Xiaoyu Wang et.al.	2409.10693	null
2024-09-16	Multi-agent Path Finding in Continuous Environment	Kristýna Janovská et.al.	2409.10680	null
2024-09-16	Motion Forecasting via Model-Based Risk Minimization	Aron Distelzweig et.al.	2409.10585	null
2024-09-16	Reinforcement Learning with Quasi-Hyperbolic Discounting	S. R. Eshwar et.al.	2409.10583	null
2024-09-14	On the limits of agency in agent-based models	Ayush Chopra et.al.	2409.10568	link
2024-09-13	Applying Action Masking and Curriculum Learning Techniques to Improve Data Efficiency and Overall Performance in Operational Technology Cyber Security using Reinforcement Learning	Alec Wilson et.al.	2409.10563	null
2024-09-16	On interactive anisotropic walks in two dimensions generated from a three state opinion dynamics model	Surajit Saha et.al.	2409.10413	null
2024-09-16	Reducing Leximin Fairness to Utilitarian Optimization	Eden Hartman et.al.	2409.10395	null
2024-09-16	Decentralized and Asymmetric Multi-Agent Learning in Construction Sites	Yakov Miron et.al.	2409.10375	null
2024-09-19	Instigating Cooperation among LLM Agents Using Adaptive Information Modulation	Qiliang Chen et.al.	2409.10372	null
2024-09-16	2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation?	Téo Guichoux et.al.	2409.10357	null
2024-09-16	Partial Ordering Bayesian Logistic Regression Model for Phase I Combination Trials and Computationally Efficient Approach to Operational Prior Specification	Weishi Chen et.al.	2409.10352	null
2024-09-16	Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots	Hongming Zhang et.al.	2409.10277	link
2024-09-16	Synchronization-Based Cooperative Distributed Model Predictive Control	Julius Beerwerth et.al.	2409.10215	null
2024-09-16	Maneuver Decision-Making with Trajectory Streams Prediction for Autonomous Vehicles	Mais Jamal et.al.	2409.10165	null
2024-09-16	Multi-Agent Obstacle Avoidance using Velocity Obstacles and Control Barrier Functions	Alejandro Sánchez Roncero et.al.	2409.10117	null
2024-09-16	Robust Reinforcement Learning with Dynamic Distortion Risk Measures	Anthony Coache et.al.	2409.10096	link
2024-09-16	Cross-modality image synthesis from TOF-MRA to CTA using diffusion-based models	Alexander Koch et.al.	2409.10089	null
2024-09-19	Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation	Meng Chen et.al.	2409.10071	link
2024-09-16	A Social Force Model for Multi-Agent Systems With Application to Robots Traversal in Cluttered Environments	Chenxi Li et.al.	2409.10049	null
2024-09-16	Audio-Driven Reinforcement Learning for Head-Orientation in Naturalistic Environments	Wessel Ledder et.al.	2409.10048	null
2024-09-16	Bearing-Distance Based Flocking with Zone-Based Interactions	Hossein B. Jond et.al.	2409.10047	null
2024-09-16	E2Map: Experience-and-Emotion Map for Self-Reflective Robot Navigation with Language Models	Chan Kim et.al.	2409.10027	null
2024-09-16	Reinforcement learning-based statistical search strategy for an axion model from flavor	Satsuki Nishimura et.al.	2409.10023	null
2024-09-16	SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning	Amogh Joshi et.al.	2409.09990	null
2024-09-16	Optimality Gap of Decentralized Submodular Maximization under Probabilistic Communication	Joan Vendrell et.al.	2409.09979	null
2024-09-16	Constrained Bandwidth Observation Sharing for Multi-Robot Navigation in Dynamic Environments via Intelligent Knapsack	Anirudh Chari et.al.	2409.09975	null
2024-09-16	Solving Monotone Variational Inequalities with Best Response Dynamics	Yu-Wen Chen et.al.	2409.09961	null
2024-09-16	Context-aware Advertisement Modeling and Applications in Rapid Transit Systems	Afzal Ahmed et.al.	2409.09956	null
2024-09-15	Critic as Lyapunov function (CALF): a model-free, stability-ensuring agent	Pavel Osinenko et.al.	2409.09869	null
2024-09-15	A Complete Algorithm for a Moving Target Traveling Salesman Problem with Obstacles	Anoop Bhat et.al.	2409.09852	null
2024-09-15	On the Effect of Robot Errors on Human Teaching Dynamics	Jindan Huang et.al.	2409.09827	null
2024-09-15	Revisiting the state-space model of unawareness	Alex A. T. Rathke et.al.	2409.09818	null
2024-09-15	Social Influence and Consensus Building: Introducing a q-Voter Model with Weighted Influence	Pratik Mullick et.al.	2409.09817	null
2024-09-17	Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition	Chao-Han Huck Yang et.al.	2409.09785	null
2024-09-15	DiFSD: Ego-Centric Fully Sparse Paradigm with Uncertainty Denoising and Iterative Refinement for Efficient End-to-End Autonomous Driving	Haisheng Su et.al.	2409.09777	null
2024-09-15	Range-SLAM: Ultra-Wideband-Based Smoke-Resistant Real-Time Localization and Mapping	Yi Liu et.al.	2409.09763	null
2024-09-15	Automatic Control With Human-Like Reasoning: Exploring Language Model Embodied Air Traffic Agents	Justas Andriuškevičius et.al.	2409.09717	null
2024-09-15	Unveiling Gender Bias in Large Language Models: Using Teacher's Evaluation in Higher Education As an Example	Yuanning Huang et.al.	2409.09652	link
2024-09-15	RethinkMCTS: Refining Erroneous Thoughts in Monte Carlo Tree Search for Code Generation	Qingyao Li et.al.	2409.09584	null
2024-09-15	Traffic Scene Generation from Natural Language Description for Autonomous Vehicles with Large Language Model	Bo-Kai Ruan et.al.	2409.09575	null
2024-09-15	Decentralized Safe and Scalable Multi-Agent Control under Limited Actuation	Vrushabh Zinage et.al.	2409.09573	null
2024-09-14	Swarm Algorithms for Dynamic Task Allocation in Unknown Environments	Adithya Balachandran et.al.	2409.09550	null
2024-09-14	Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation	Yiwei Shi et.al.	2409.09541	null
2024-09-14	Ensuring System-Level Protection against Eavesdropping Adversaries in Distributed Dynamical Systems	Dipankar Maity et.al.	2409.09539	null
2024-09-14	Planning Transformer: Long-Horizon Offline Reinforcement Learning with Planning Tokens	Joseph Clinton et.al.	2409.09513	null
2024-09-14	Learning Nudges for Conditional Cooperation: A Multi-Agent Reinforcement Learning Model	Shatayu Kulkarni et.al.	2409.09509	null
2024-09-14	Learning Keypoints for Multi-Agent Behavior Analysis using Self-Supervision	Daniel Khalil et.al.	2409.09455	null
2024-09-14	Initial Error Affection and Error Correction in Linear Quadratic Mean Field Games under Erroneous Initial Information	Yuxin Jin et.al.	2409.09375	null
2024-09-14	The (n,k) game with heterogeneous agents	Hsin-Lun Li et.al.	2409.09364	null
2024-09-14	PeriGuru: A Peripheral Robotic Mobile App Operation Assistant based on GUI Image Understanding and Prompting with LLM	Kelin Fu et.al.	2409.09354	link
2024-09-14	Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models	Yuanzhao Zhai et.al.	2409.09345	null
2024-09-14	Capability Augmentation for Heterogeneous Dynamic Teaming with Temporal Logic Tasks	Carter Berlind et.al.	2409.09285	null
2024-09-14	Python Symbolic Execution with LLM-powered Code Generation	Wenhan Wang et.al.	2409.09271	null
2024-09-14	High-Fidelity Data-Driven Dynamics Model for Reinforcement Learning-based Magnetic Control in HL-3 Tokamak	Niannian Wu et.al.	2409.09238	null
2024-09-19	Curricula for Learning Robust Policies with Factored State Representations in Changing Environments	Panayiotis Panayiotou et.al.	2409.09169	null
2024-09-13	Measure Preserving Flows for Ergodic Search in Convoluted Environments	Albert Xu et.al.	2409.09164	null
2024-09-08	ELMS: Elasticized Large Language Models On Mobile Devices	Wangsong Yin et.al.	2409.09071	null
2024-09-13	The unknotting number, hard unknot diagrams, and reinforcement learning	Taylor Applebaum et.al.	2409.09032	null
2024-09-13	Optically-Validated Microvascular Phantom for Super-Resolution Ultrasound Imaging	Jaime Parra Raad et.al.	2409.09031	null
2024-09-13	Agents in Software Engineering: Survey, Landscape, and Vision	Yanxian Huang et.al.	2409.09030	link
2024-09-13	AI-LieDar: Examine the Trade-off Between Utility and Truthfulness in LLM Agents	Zhe Su et.al.	2409.09013	null
2024-09-13	Mechanism Design for Extending the Accessibility of Facilities	Hau Chan et.al.	2409.08993	null
2024-09-13	Safeguarding Decentralized Social Media: LLM Agents for Automating Community Rule Compliance	Lucio La Cava et.al.	2409.08963	null
2024-09-13	Emerging Reliance Behaviors in Human-AI Text Generation: Hallucinations, Data Quality Assessment, and Cognitive Forcing Functions	Zahra Ashktorab et.al.	2409.08937	null
2024-09-13	Farmer.Chat: Scaling AI-Powered Agricultural Services for Smallholder Farmers	Namita Singh et.al.	2409.08916	null
2024-09-13	Exploring Action-Centric Representations Through the Lens of Rate-Distortion Theory	Miguel de Llanza Varona et.al.	2409.08892	null
2024-09-13	Using The Concept Hierarchy for Household Action Recognition	Andrei Costinescu et.al.	2409.08853	null
2024-09-13	Deep reinforcement learning for tracking a moving target in jellyfish-like swimming	Yihao Chen et.al.	2409.08815	null
2024-09-13	Mutual Theory of Mind in Human-AI Collaboration: An Empirical Study with LLM-driven AI Agents in a Real-time Shared Workspace Task	Shao Zhang et.al.	2409.08811	null
2024-09-13	HOLA-Drone: Hypergraphic Open-ended Learning for Zero-Shot Multi-Drone Cooperative Pursuit	Yang Li et.al.	2409.08767	null
2024-09-13	Fusing Dynamics Equation: A Social Opinions Prediction Algorithm with LLM-based Agents	Junchi Yao et.al.	2409.08717	null
2024-09-13	Systematic analysis of requirements for socially acceptable service robots	Andrea Ruo et.al.	2409.08677	null
2024-09-13	Average Consensus over Directed Networks in Open Multi-Agent Systems with Acknowledgement Feedback	Evagoras Makridis et.al.	2409.08634	null
2024-09-13	Generalization of Gershgorin's theorem. Analysis and design of control laws	Igor Furtat et.al.	2409.08576	null
2024-09-13	Expediting and Elevating Large Language Model Reasoning via Hidden Chain-of-Thought Decoding	Tianqiao Liu et.al.	2409.08561	null
2024-09-16	Can AI Prompt Humans? Multimodal Agents Prompt Players' Game Actions and Show Consequences to Raise Sustainability Awareness	Qinshi Zhang et.al.	2409.08486	null
2024-09-13	A BERT-Based Summarization approach for depression detection	Hossein Salahshoor Gavalan et.al.	2409.08483	null
2024-09-12	A Surveillance Game between a Differential Drive Robot and an Omnidirectional Agent: The Case of a Faster Evader	Rodrigo Saavedra et.al.	2409.08414	null
2024-09-12	Sequential Discrete Action Selection via Blocking Conditions and Resolutions	Liam Merz Hoffmeister et.al.	2409.08410	null
2024-09-12	Knowledge Tagging with Large Language Model based Multi-Agent System	Hang Li et.al.	2409.08406	null
2024-09-12	Self-Supervised Inference of Agents in Trustless Environments	Vladyslav Larin et.al.	2409.08386	null
2024-09-12	An Experimental Study of Competitive Market Behavior Through LLMs	Jingru Jia et.al.	2409.08357	null
2024-09-13	Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale	Rogerio Bonatti et.al.	2409.08264	link
2024-09-12	How can the tragedy of the commons be prevented?: Introducing Linear Quadratic Mixed Mean Field Games	Gokce Dayanikli et.al.	2409.08235	null
2024-09-12	Linear Complementary Dual Codes Constructed from Reinforcement Learning	Yansheng Wu et.al.	2409.08114	null
2024-09-12	MosquitoMiner: A Light Weight Rover for Detecting and Eliminating Mosquito Breeding Sites	Md. Adnanul Islam et.al.	2409.08078	link
2024-09-13	Learning Communities from Equilibria of Nonlinear Opinion Dynamics	Yu Xing et.al.	2409.08004	null
2024-09-12	Autonomous Vehicle Controllers From End-to-End Differentiable Simulation	Asen Nachkov et.al.	2409.07965	null
2024-09-12	WirelessAgent: Large Language Model Agents for Intelligent Wireless Networks	Jingwen Tong et.al.	2409.07964	link
2024-09-12	Covariance Intersection-based Invariant Kalman Filtering(DInCIKF) for Distributed Pose Estimation	Haoying Li et.al.	2409.07933	null
2024-09-12	Reinforcement Learning Discovers Efficient Decentralized Graph Path Search Strategies	Alexei Pisacane et.al.	2409.07932	null
2024-09-12	Tidal MerzA: Combining affective modelling and autonomous code generation through Reinforcement Learning	Elizabeth Wilson et.al.	2409.07918	null
2024-09-12	Tera-SpaceCom: GNN-based Deep Reinforcement Learning for Joint Resource Allocation and Task Offloading in TeraHertz Band Space Networks	Zhifeng Hu et.al.	2409.07911	null
2024-09-12	UNIT: Unsupervised Online Instance Segmentation through Time	Corentin Sautier et.al.	2409.07887	null
2024-09-12	Mapping Technical Safety Research at AI Companies: A literature review and incentives analysis	Oscar Delaney et.al.	2409.07878	null
2024-09-12	ReGentS: Real-World Safety-Critical Driving Scenario Generation Made Stable	Yuan Yin et.al.	2409.07830	null
2024-09-12	GateAttentionPose: Enhancing Pose Estimation with Agent Attention and Improved Gated Convolutions	Liang Feng et.al.	2409.07798	null
2024-09-12	A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning	Yinbo Yu et.al.	2409.07775	null
2024-09-12	Accelerated Multi-Time-Scale Stochastic Approximation: Optimal Complexity and Applications in Reinforcement Learning and Multi-Agent Games	Sihan Zeng et.al.	2409.07767	null
2024-09-12	Distributed Learning Dynamics Converging to the Core of $B$ -Matchings	Aya Hamed et.al.	2409.07754	null
2024-09-12	Self-similarity of temporal interaction networks arises from hyperbolic geometry with time-varying curvature	Subhabrata Dutta et.al.	2409.07733	link
2024-09-12	A Conceptual Framework for Understanding Empathy in Physics Faculty	Alia Hamdan et.al.	2409.07724	null
2024-09-12	CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model	Yang Li et.al.	2409.07714	null
2024-09-12	DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?	Liqiang Jing et.al.	2409.07703	link
2024-09-11	SimulBench: Evaluating Language Models with Creative Simulation Tasks	Qi Jia et.al.	2409.07641	null
2024-09-11	HERL: Tiered Federated Learning with Adaptive Homomorphic Encryption using Reinforcement Learning	Jiaxang Tang et.al.	2409.07631	null
2024-09-11	A Survey of Inverse Constrained Reinforcement Learning: Definitions, Progress and Challenges	Guiliang Liu et.al.	2409.07569	null
2024-09-11	Connecting extended Wigner's friend arguments and noncontextuality	Laurens Walleghem et.al.	2409.07537	null
2024-09-13	MoA is All You Need: Building LLM Research Team using Mixture of Agents	Sandy Chen et.al.	2409.07487	null
2024-09-04	MarS: a Financial Market Simulation Engine Powered by Generative Foundation Model	Junjie Li et.al.	2409.07486	null
2024-09-11	"My Grade is Wrong!": A Contestable AI Framework for Interactive Feedback in Evaluating Student Essays	Shengxin Hong et.al.	2409.07453	null
2024-09-11	SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories	Ben Bogin et.al.	2409.07440	link
2024-09-11	Agent Workflow Memory	Zora Zhiruo Wang et.al.	2409.07429	link
2024-09-11	Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise Recommendation	Luo Ji et.al.	2409.07416	null
2024-09-11	A Contrastive Symmetric Forward-Forward Algorithm (SFFA) for Continual Learning Tasks	Erik B. Terres-Escudero et.al.	2409.07387	null
2024-09-11	Policy consequences of the new neuroeconomic framework	A. David Redish et.al.	2409.07373	null
2024-09-11	Online Decision MetaMorphFormer: A Casual Transformer-Based Reinforcement Learning Framework of Universal Embodied Intelligence	Luo Ji et.al.	2409.07341	null
2024-09-11	Explanation, Debate, Align: A Weak-to-Strong Framework for Language Model Generalization	Mehrdad Zakershahrak et.al.	2409.07335	null
2024-09-11	Using Generative Agents to Create Tip Sheets for Investigative Data Reporting	Joris Veerbeek et.al.	2409.07286	null
2024-09-11	Multi-Type Preference Learning: Empowering Preference-Based Reinforcement Learning with Equal Preferences	Ziang Liu et.al.	2409.07268	null
2024-09-11	Cross-Dialect Text-To-Speech in Pitch-Accent Language Incorporating Multi-Dialect Phoneme-Level BERT	Kazuki Yamauchi et.al.	2409.07265	null
2024-09-11	Propaganda to Hate: A Multimodal Analysis of Arabic Memes with Multi-Agent LLMs	Firoj Alam et.al.	2409.07246	null
2024-09-11	A Perspective on AI-Guided Molecular Simulations in VR: Exploring Strategies for Imitation Learning in Hyperdimensional Molecular Systems	Mohamed Dhouioui et.al.	2409.07189	null
2024-09-11	Identify Design Problems Through Questioning: Exploring Role-playing Interactions with Large Language Models to Foster Design Questioning Skills	Hyunseung Lim et.al.	2409.07178	null
2024-09-11	Learning Efficient Recursive Numeral Systems via Reinforcement Learning	Jonathan D. Thomas et.al.	2409.07170	null
2024-09-11	Randomized Strategic Facility Location with Predictions	Eric Balkanski et.al.	2409.07142	null
2024-09-11	MVLLaVA: An Intelligent Agent for Unified and Flexible Novel View Synthesis	Hanyu Jiang et.al.	2409.07129	null
2024-09-11	DCMAC: Demand-aware Customized Multi-Agent Communication via Upper Bound Training	Dongkun Huo et.al.	2409.07127	null
2024-09-17	Inefficient Alliance Formation in Coalitional Blotto Games	Vade Shah et.al.	2409.06899	null
2024-09-10	A Quality Diversity Approach to Automatically Generate Multi-Agent Path Finding Benchmark Maps	Cheng Qian et.al.	2409.06888	null
2024-09-10	Can Agents Spontaneously Form a Society? Introducing a Novel Architecture for Generative Multi-Agents to Elicit Social Emergence	H. Zhang et.al.	2409.06750	null
2024-09-19	Decentralized Neural Networks for Robust and Scalable Eigenvalue Computation	Ronald Katende et.al.	2409.06746	null
2024-09-10	Memory and Personality in Ideological Polarization: The Politico-physics of Mnemomatter	Shengkai Li et.al.	2409.06660	null
2024-09-10	Fixed-budget and Multiple-issue Quadratic Voting	Laura Georgescu et.al.	2409.06614	null
2024-09-10	On Epistemic Properties in Discrete-Event Systems: A Uniform Framework and Its Applications	Bohan Cui et.al.	2409.06588	null
2024-09-10	Think-on-Process: Dynamic Process Generation for Collaborative Development of Multi-Agent System	Leilei Lin et.al.	2409.06568	link
2024-09-10	Indirect Dynamic Negotiation in the Nash Demand Game	Tatiana V. Guy et.al.	2409.06566	null
2024-09-10	Social Mediation through Robots -- A Scoping Review on Improving Group Interactions through Directed Robot Action using an Extended Group Process Model	Thomas H. Weisswange et.al.	2409.06557	null
2024-09-10	Coordinated Motion Planning: Multi-Agent Path Finding in a Densely Packed, Bounded Domain	Sándor P. Fekete et.al.	2409.06486	null
2024-09-10	Learning Generative Interactive Environments By Trained Agent Exploration	Naser Kazemi et.al.	2409.06445	link
2024-09-10	Position Fair Mechanisms Allocating Indivisible Goods	Ryoga Mahara et.al.	2409.06423	null
2024-09-10	Exploring the Integration of Large Language Models in Industrial Test Maintenance Processes	Ludvig Lemner et.al.	2409.06416	null
2024-09-10	MAGDA: Multi-agent guideline-driven diagnostic assistance	David Bani-Harouni et.al.	2409.06351	null
2024-09-17	Foragax: An Agent-Based Modelling Framework Based on JAX	Siddharth Chaturvedi et.al.	2409.06345	link
2024-09-10	Towards Agentic AI on Particle Accelerators	Antonin Sulc et.al.	2409.06336	null
2024-09-11	Modified Meta-Thompson Sampling for Linear Bandits and Its Bayes Regret Analysis	Hao Li et.al.	2409.06329	null
2024-09-10	Automate Strategy Finding with LLM in Quant investment	Zhizhuo Kou et.al.	2409.06289	null
2024-09-10	Evidence gathering under competitive and noncompetitive rewards	Philip Brookins et.al.	2409.06248	null
2024-09-10	INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding	Ji Ha Jang et.al.	2409.06210	null
2024-09-11	A Policy Iteration Method for Inverse Mean Field Games	Kui Ren et.al.	2409.06184	null
2024-09-10	Contrastive Federated Learning with Tabular Data Silos	Achmad Ginanjar et.al.	2409.06123	null
2024-09-14	ClarQ-LLM: A Benchmark for Models Clarifying and Requesting Information in Task-Oriented Dialog	Yujian Gan et.al.	2409.06097	link
2024-09-09	Coarse Descriptions and Cautious Preferences	Evan Piermont et.al.	2409.06054	null
2024-09-09	When Learning Meets Dynamics: Distributed User Connectivity Maximization in UAV-Based Communication Networks	Bowei Li et.al.	2409.06010	null
2024-09-09	Promptable Closed-loop Traffic Simulation	Shuhan Tan et.al.	2409.05863	null
2024-09-15	MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct	Run Luo et.al.	2409.05840	null
2024-09-09	Cooperative Decision-Making for CAVs at Unsignalized Intersections: A MARL Approach with Attention and Hierarchical Game Priors	Jiaqi Liu et.al.	2409.05712	null
2024-09-09	StratXplore: Strategic Novelty-seeking and Instruction-aligned Exploration for Vision and Language Navigation	Muraleekrishna Gopinathan et.al.	2409.05593	null
2024-09-09	Interpretable Responsibility Sharing as a Heuristic for Task and Motion Planning	Arda Sarp Yenicesu et.al.	2409.05586	link
2024-09-09	SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning	Alireza Ghafarollahi et.al.	2409.05556	link
2024-09-09	Seeing is Believing? Enhancing Vision-Language Navigation using Visual Perturbations	Xuesong Zhang et.al.	2409.05552	null
2024-09-09	A refined Frauchiger--Renner paradox based on strong contextuality	Laurens Walleghem et.al.	2409.05491	null
2024-09-09	Adaptive Multi-Layer Deployment for A Digital Twin Empowered Satellite-Terrestrial Integrated Network	Yihong Tao et.al.	2409.05480	null
2024-09-09	Reinforcement Learning for Variational Quantum Circuits Design	Simone Foderà et.al.	2409.05475	null
2024-09-09	Semifactual Explanations for Reinforcement Learning	Jasmina Gajcin et.al.	2409.05435	link
2024-09-09	Leveraging Computation of Expectation Models for Commonsense Affordance Estimation on 3D Scene Graphs	Mario Alberto Valdes Saucedo et.al.	2409.05392	null
2024-09-09	BAMDP Shaping: a Unified Theoretical Framework for Intrinsic Motivation and Reward Shaping	Aly Lidayan et.al.	2409.05358	null
2024-09-09	Obvious Strategy-proofness with Respect to a Partition	R. Pablo Arribillaga et.al.	2409.05315	null
2024-09-09	Distributed Robust Continuous-Time Optimization Algorithms for Time-Varying Constrained Cost	Zeinab Ebrahimi et.al.	2409.05293	null
2024-09-09	Towards Fast Rates for Federated and Multi-Task Reinforcement Learning	Feng Zhu et.al.	2409.05291	null
2024-09-08	COVID19-CBABM: A City-Based Agent Based Disease Spread Modeling Framework	Raunak Sarbajna et.al.	2409.05235	null
2024-09-08	Banded phases in topological flocks	Charles R. Packard et.al.	2409.05198	null
2024-09-08	Difference Between Cyclic and Distributed Approach in Stochastic Optimization for Multi-agent System	Jiahao Shi et.al.	2409.05155	null
2024-09-08	Nonlinear Cooperative Output Regulation with Input Delay Compensation	Shiqi Zheng et.al.	2409.05113	null
2024-09-11	Decentralized Control of Multi-Agent Systems Under Acyclic Spatio-Temporal Task Dependencies	Gregorio Marchesini et.al.	2409.05106	null
2024-09-08	Pareto-Optimal Peer-to-Peer Risk Sharing with Robust Distortion Risk Measures	Mario Ghossoub et.al.	2409.05103	null
2024-09-08	On final opinions of the Friedkin-Johnsen model over random graphs with partially stubborn community	Lingfei Wang et.al.	2409.05063	null
2024-09-08	Towards Multi-agent Policy-based Directed Hypergraph Learning for Traffic Signal Control	Kang Wang et.al.	2409.05037	null
2024-09-08	Cooperative Learning-Based Framework for VNF Caching and Placement Optimization over Low Earth Orbit Satellite Networks	Khai Doan et.al.	2409.05025	null
2024-09-08	A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement	Huan Zhang et.al.	2409.05001	link
2024-09-08	Multi-V2X: A Large Scale Multi-modal Multi-penetration-rate Dataset for Cooperative Perception	Rongsong Li et.al.	2409.04980	null
2024-09-07	DEPLOYERS: An agent based modeling tool for multi country real world data	Martin Jaraiz et.al.	2409.04876	null
2024-09-07	Adaptation Procedure in Misinformation Games	Konstantinos Varsos et.al.	2409.04854	null
2024-09-07	Context-Aware Replanning with Pre-explored Semantic Map for Object Navigation	Hung-Ting Su et.al.	2409.04837	null
2024-09-07	LMGT: Optimizing Exploration-Exploitation Balance in Reinforcement Learning through Language Model Guided Trade-offs	Yongxin Deng et.al.	2409.04744	null
2024-09-07	Algorithmic Scenario Generation as Quality Diversity Optimization	Stefanos Nikolaidis et.al.	2409.04711	null
2024-09-07	Learning Optimal Stable Matches in Decentralized Markets with Unknown Preferences	Vade Shah et.al.	2409.04669	null
2024-09-10	QueryBuilder: Human-in-the-Loop Query Development for Information Retrieval	Hemanth Kandula et.al.	2409.04667	null
2024-09-06	Stacked Universal Successor Feature Approximators for Safety in Reinforcement Learning	Ian Cannon et.al.	2409.04641	null
2024-09-06	Sparse Rewards Can Self-Train Dialogue Agents	Barrett Martin Lattimer et.al.	2409.04617	link
2024-09-15	Decentralized Learning in General-sum Markov Games	Chinmay Maheshwari et.al.	2409.04613	null
2024-09-06	Impact of Transit on Mobility, Equity, and Economy in the Chicago Metropolitan Region	Omer Verbas et.al.	2409.04568	null
2024-09-03	State and Action Factorization in Power Grids	Gianvito Losapio et.al.	2409.04467	null
2024-09-03	Here's Charlie! Realising the Semantic Web vision of Agents in the age of LLMs	Jesse Wright et.al.	2409.04465	null
2024-09-06	A Survey on Knowledge Organization Systems of Research Fields: Resources and Challenges	Angelo Salatino et.al.	2409.04432	null
2024-09-06	RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs	Jiaxing Wu et.al.	2409.04421	null
2024-09-06	MATWA: A Web Toolkit for Matching under Preferences	Frederik Glitzner et.al.	2409.04402	null
2024-09-06	Cs-O $_2$ -Li as enhanced NEA surface layer with increased lifetime for GaAs photocathodes	Maximilian Herbert et.al.	2409.04319	null
2024-09-06	Safe and Efficient Path Planning under Uncertainty via Deep Collision Probability Fields	Felix Herrmann et.al.	2409.04306	null
2024-09-06	Using Large Language Models to Generate Authentic Multi-agent Knowledge Work Datasets	Desiree Heim et.al.	2409.04286	null
2024-09-06	Collective chemotactic search strategies	Hugues Meyer et.al.	2409.04262	null
2024-09-06	SPACE: A Python-based Simulator for Evaluating Decentralized Multi-Robot Task Allocation Algorithms	Inmo Jang et.al.	2409.04230	link
2024-09-06	FPT Algorithms using Minimal Parameters for a Generalized Version of Maximin Shares	Klaus Jansen et.al.	2409.04225	null
2024-09-06	Advancing Multi-Organ Disease Care: A Hierarchical Multi-Agent Reinforcement Learning Framework	Daniel J. Tan et.al.	2409.04224	null
2024-09-06	Runtime analysis of a coevolutionary algorithm on impartial combinatorial games	Alistair Benford et.al.	2409.04177	null
2024-09-06	Towards a Socially Acceptable Competitive Equilibrium in Energy Markets	Koorosh Shomalzadeh et.al.	2409.04157	null
2024-09-06	Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers	Chenglei Si et.al.	2409.04109	link
2024-09-06	Tighter Analysis for Decentralized Stochastic Gradient Method: Impact of Data Homogeneity	Qiang Li et.al.	2409.04092	null
2024-09-06	Surface Patterns Shaped by Additives in Crystals	M. A. Chabowska et.al.	2409.04084	null
2024-09-05	DRAL: Deep Reinforcement Adaptive Learning for Multi-UAVs Navigation in Unknown Indoor Environment	Kangtong Mo et.al.	2409.03930	null
2024-09-05	On the Convergence Rates of Federated Q-Learning across Heterogeneous Environments	Muxing Wang et.al.	2409.03897	null
2024-09-05	Multi-agent Path Finding for Mixed Autonomy Traffic Coordination	Han Zheng et.al.	2409.03881	null
2024-09-05	PARCO: Learning Parallel Autoregressive Policies for Efficient Multi-Agent Combinatorial Optimization	Federico Berto et.al.	2409.03811	link
2024-09-04	NESTFUL: A Benchmark for Evaluating LLMs on Nested Sequences of API Calls	Kinjal Basu et.al.	2409.03797	null
2024-09-13	Safeguarding AI Agents: Developing and Analyzing Safety Architectures	Ishaan Domkundwar et.al.	2409.03793	null
2024-08-31	BreachSeek: A Multi-Agent Automated Penetration Tester	Ibrahim Alshehri et.al.	2409.03789	link
2024-09-06	RAG based Question-Answering for Contextual Response Prediction System	Sriram Veturi et.al.	2409.03708	null
2024-09-05	TRACE-cs: Trustworthy Reasoning for Contrastive Explanations in Course Scheduling Problems	Stylianos Loukas Vasileiou et.al.	2409.03671	null
2024-09-06	LLM-based multi-agent poetry generation in non-cooperative environments	Ran Zhang et.al.	2409.03659	link
2024-09-05	A Complete Landscape of EFX Allocations of Mixed Manna on Graphs	Yu Zhou et.al.	2409.03594	null
2024-09-05	CHIRPs: Change-Induced Regret Proxy metrics for Lifelong Reinforcement Learning	John Birkbeck et.al.	2409.03577	null
2024-09-05	From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents	Jifan Yu et.al.	2409.03512	null
2024-09-05	Rx Strategist: Prescription Verification using LLM Agents System	Phuc Phan Van et.al.	2409.03440	null
2024-09-05	Reinforcement Learning Approach to Optimizing Profilometric Sensor Trajectories for Surface Inspection	Sara Roos-Hoefgeest et.al.	2409.03429	null
2024-09-05	Game On: Towards Language Models as RL Experimenters	Jingwei Zhang et.al.	2409.03402	null
2024-09-05	ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models	Qi Ju et.al.	2409.03301	link
2024-09-05	Robust synchronization and policy adaptation for networked heterogeneous agents	Miguel F. Arevalo-Castiblanco et.al.	2409.03273	null
2024-09-05	GraphInsight: Unlocking Insights in Large Language Models for Graph Structure Understanding	Yukun Cao et.al.	2409.03258	null
2024-09-05	E2CL: Exploration-based Error Correction Learning for Embodied Agents	Hanlin Wang et.al.	2409.03256	null
2024-09-05	Improving agent performance in fluid environments by perceptual pretraining	Jin Zhang et.al.	2409.03230	null
2024-09-05	xLAM: A Family of Large Action Models to Empower AI Agent Systems	Jianguo Zhang et.al.	2409.03215	link
2024-09-05	Predefined-time distributed non-convex optimization via a time-base generator	Qinlong Lin et.al.	2409.03188	null
2024-09-11	Continual Skill and Task Learning via Dialogue	Weiwei Gu et.al.	2409.03166	null
2024-09-04	Subsidy design for better social outcomes	Maria-Florina Balcan et.al.	2409.03129	null
2024-09-04	RoboKoop: Efficient Control Conditioned Representations from Visual Input in Robotics using Koopman Operator	Hemant Kumawat et.al.	2409.03107	null
2024-09-04	Resilient Two-Time-Scale Local Stochastic Gradient Descent for Byzantine Federated Learning	Amit Dutta et.al.	2409.03092	null
2024-09-04	An Introduction to Centralized Training for Decentralized Execution in Cooperative Multi-Agent Reinforcement Learning	Christopher Amato et.al.	2409.03052	null
2024-09-04	Large Language Model-Based Agents for Software Engineering: A Survey	Junwei Liu et.al.	2409.02977	link
2024-09-03	Managing multiple agents by automatically adjusting incentives	Shunichi Akatsuka et.al.	2409.02960	null
2024-09-04	LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture	Xidong Wang et.al.	2409.02889	link
2024-09-04	Bioinformatics Retrieval Augmentation Data (BRAD) Digital Assistant	Joshua Pickard et.al.	2409.02864	null
2024-09-06	Language Understanding as a Constraint on Consensus Size in LLM Societies	Giordano De Marzo et.al.	2409.02822	null
2024-09-04	Ion-specific Stability of Gold Nanoparticle Suspensions	Philipp Ritzert et.al.	2409.02762	null
2024-09-04	Adaptive Formation Learning Control for Cooperative AUVs under Complete Uncertainty	Emadodin Jandaghi et.al.	2409.02745	null
2024-09-04	Creating a Gen-AI based Track and Trace Assistant MVP (SuperTracy) for PostNL	Mohammad Reshadati et.al.	2409.02711	null
2024-09-04	Decision Transformer for Enhancing Neural Local Search on the Job Shop Scheduling Problem	Constantin Waubert de Puiseau et.al.	2409.02697	null
2024-09-04	Generalized Individual Q-learning for Polymatrix Games with Partial Observations	Ahmed Said Donmez et.al.	2409.02663	null
2024-09-04	A Survey on Emergent Language	Jannik Peters et.al.	2409.02645	null
2024-09-04	Evaluating Environments Using Exploratory Agents	Bobby Khaleque et.al.	2409.02632	null
2024-09-04	Advancing Cyber Incident Timeline Analysis Through Rule Based AI and Large Language Models	Fatma Yasmine Loumachi et.al.	2409.02572	null
2024-09-04	Vision-Language Navigation with Continual Learning	Zhiyuan Li et.al.	2409.02561	null
2024-09-05	A Sequential Decision-Making Model for Perimeter Identification	Ayal Taitler et.al.	2409.02549	null
2024-09-04	Astrochemistry on Galactic scales	L. Colzi et.al.	2409.02537	null
2024-09-04	Cog-GA: A Large Language Models-based Generative Agent for Vision-Language Navigation in Continuous Environments	Zhiyuan Li et.al.	2409.02522	null
2024-09-04	Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal	Jifeng Hu et.al.	2409.02512	link
2024-09-04	Occlusion-Based Cooperative Transport for Concave Objects with a Swarm of Miniature Mobile Robots	Sanjuksha Nirgude et.al.	2409.02436	null
2024-09-04	Context-Aware Agent-based Model for Smart Long Distance Transport System	Muhammad Raees et.al.	2409.02434	null
2024-09-04	Building Math Agents with Multi-Turn Iterative Preference Learning	Wei Xiong et.al.	2409.02392	null
2024-09-04	Multi-modal Situated Reasoning in 3D Scenes	Xiongkun Linghu et.al.	2409.02389	null
2024-09-04	Neighbourhood conditions for network stability with link uncertainty	Simone Mariano et.al.	2409.02350	null
2024-09-03	Kinesthetic Teaching in Robotics: a Mixed Reality Approach	Simone Macci`o et.al.	2409.02305	null
2024-09-03	Multi-Agent Reinforcement Learning for Joint Police Patrol and Dispatch	Matthew Repasky et.al.	2409.02246	null
2024-09-02	AutoEncoder Convolutional Neural Network for Pneumonia Detection	Michael Nosa-Omoruyi et.al.	2409.02142	null
2024-09-01	TrajWeaver: Trajectory Recovery with State Propagation Diffusion Model	Jinming Wang et.al.	2409.02124	null
2024-09-05	Noise-free comparison of stochastic agent-based simulations using common random numbers	Daniel J. Klein et.al.	2409.02086	null
2024-09-03	A Modern Take on Visual Relationship Reasoning for Grasp Planning	Paolo Rabino et.al.	2409.02035	null
2024-09-03	Optimal allocations with capacity constrained verification	Albin Erlanson et.al.	2409.02031	null
2024-09-03	Planning to avoid ambiguous states through Gaussian approximations to non-linear sensors in active inference agents	Wouter M. Kouw et.al.	2409.01974	null
2024-09-03	Snapshot: Towards Application-centered Models for Pedestrian Trajectory Prediction in Urban Traffic Environments	Nico Uhlemann et.al.	2409.01971	null
2024-09-03	Achieving Maximin Share and EFX/EF1 Guarantees Simultaneously	Hannaneh Akrami et.al.	2409.01963	null
2024-09-03	Learning Resilient Formation Control of Drones with Graph Attention Network	Jiaping Xiao et.al.	2409.01953	null
2024-09-03	From Grounding to Planning: Benchmarking Bottlenecks in Web Agents	Segev Shlomov et.al.	2409.01927	null
2024-09-03	Focus Agent: LLM-Powered Virtual Focus Group	Taiyu Zhang et.al.	2409.01907	null
2024-09-03	What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices	Zhi Chen et.al.	2409.01893	link
2024-09-03	AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction	Yuchen Shi et.al.	2409.01854	link
2024-09-03	Segmenting Object Affordances: Reproducibility and Sensitivity to Scale	Tommaso Apicella et.al.	2409.01814	link
2024-09-03	Empirical evidence of Large Language Model's influence on human spoken communication	Hiromu Yakura et.al.	2409.01754	null
2024-09-03	4D-CAT: Synthesis of 4D Coronary Artery Trees from Systole and Diastole	Daosong Hu et.al.	2409.01725	null
2024-09-03	VProChart: Answering Chart Question through Visual Perception Alignment Agent and Programmatic Solution Reasoning	Muye Huang et.al.	2409.01667	null
2024-09-03	T1-contrast Enhanced MRI Generation from Multi-parametric MRI for Glioma Patients with Latent Tumor Conditioning	Zach Eidex et.al.	2409.01622	null
2024-09-03	A Time-Intensity Aware Pipeline for Generating Late-Stage Breast DCE-MRI using Generative Adversarial Models	Ruben D. Fonnegra et.al.	2409.01596	null
2024-09-03	Convergence of the Heterogeneous Deffuant-Weisbuch Model: A Complete Proof and Some Extensions	Ge Chen et.al.	2409.01593	null
2024-09-03	An Implementation of Werewolf Agent That does not Truly Trust LLMs	Takehiro Sato et.al.	2409.01575	null
2024-09-03	Purification-Agnostic Proxy Learning for Agentic Copyright Watermarking against Adversarial Evidence Forgery	Erjin Bao et.al.	2409.01541	null
2024-09-03	Bridging the Gap Between Central and Local Decision-Making: The Efficacy of Collaborative Equilibria in Altruistic Congestion Games	Bryce L Ferguson et.al.	2409.01525	null
2024-09-02	The Compressor-Retriever Architecture for Language Model OS	Yuan Yang et.al.	2409.01495	link
2024-09-02	Watermarking of Quantum Circuits	Rupshali Roy et.al.	2409.01484	null
2024-09-02	Irreversible investment under weighted discounting: effects of decreasing impatience	Pengyu Wei et.al.	2409.01478	null
2024-09-02	Real-Time Recurrent Learning using Trace Units in Reinforcement Learning	Esraa Elelimy et.al.	2409.01449	null
2024-09-02	Performance-Aware Self-Configurable Multi-Agent Networks: A Distributed Submodular Approach for Simultaneous Coordination and Network Design	Zirui Xu et.al.	2409.01411	link
2024-09-02	GenAgent: Build Collaborative AI Systems with Automated Workflow Generation -- Case Studies on ComfyUI	Xiangyuan Xue et.al.	2409.01392	null
2024-09-02	Modeling contagious disease spreading	Dipak Patra et.al.	2409.01103	null
2024-09-02	Two-Timescale Synchronization and Migration for Digital Twin Networks: A Multi-Agent Deep Reinforcement Learning Approach	Wenshuai Liu et.al.	2409.01092	null
2024-09-02	Learning in Hybrid Active Inference Models	Poppy Collis et.al.	2409.01066	null
2024-09-02	Multiagent Reinforcement Learning Enhanced Decision-making of Crew Agents During Floor Construction Process	Bin Yang et.al.	2409.01060	null
2024-09-02	Accelerated Multi-objective Task Learning using Modified Q-learning Algorithm	Varun Prakash Rajamohan et.al.	2409.01046	null
2024-09-02	Federated Deep Reinforcement Learning-Based Intelligent Channel Access in Dense Wi-Fi Deployments	Xinyang Du et.al.	2409.01004	null
2024-09-02	Evolution of Social Norms in LLM Agents using Natural Language	Ilya Horiguchi et.al.	2409.00993	null
2024-09-02	Co-Learning: Code Learning for Multi-Agent Reinforcement Collaborative Framework with Conversational Natural Language Interfaces	Jiapeng Yu et.al.	2409.00985	link
2024-09-02	Semantically Controllable Augmentations for Generalizable Robot Learning	Zoey Chen et.al.	2409.00951	null
2024-09-02	Distributed Optimization under Edge Agreement with Application in Battery Network Management	Zehui Lu et.al.	2409.00936	null
2024-09-02	ToolACE: Winning the Points of LLM Function Calling	Weiwen Liu et.al.	2409.00920	null
2024-09-04	MarsCode Agent: AI-native Automated Bug Fixing	Yizhou Liu et.al.	2409.00899	null
2024-09-02	Whole-Body Control Through Narrow Gaps From Pixels To Action	Tianyue Wu et.al.	2409.00895	null
2024-09-01	Self-evolving Agents with reflective and memory-augmented abilities	Xuechen Liang et.al.	2409.00872	null
2024-09-01	JaxLife: An Open-Ended Agentic Simulator	Chris Lu et.al.	2409.00853	link
2024-09-01	Satisficing Equilibrium	Bary S. R. Pradelski et.al.	2409.00832	null
2024-09-01	Digital Homunculi: Reimagining Democracy Research with Generative Agents	Petr Specian et.al.	2409.00826	null
2024-09-01	Accelerating Hybrid Agent-Based Models and Fuzzy Cognitive Maps: How to Combine Agents who Think Alike?	Philippe J. Giabbanelli et.al.	2409.00824	null
2024-09-01	Cooperative Path Planning with Asynchronous Multiagent Reinforcement Learning	Jiaming Yin et.al.	2409.00754	null
2024-09-01	Simulation of Social Media-Driven Bubble Formation in Financial Markets using an Agent-Based Model with Hierarchical Influence Network	Gonzalo Bohorquez et.al.	2409.00742	link
2024-09-01	Fair Reciprocal Recommendation in Matching Markets	Yoji Tomita et.al.	2409.00720	link
2024-09-04	Multi-Agent Reinforcement Learning from Human Feedback: Data Coverage and Algorithmic Techniques	Natalia Zhang et.al.	2409.00717	null
2024-09-01	Universal Finite-State and Self-Stabilizing Computation in Anonymous Dynamic Networks	Giuseppe A. Di Luna et.al.	2409.00688	null
2024-09-01	A Learnable Agent Collaboration Network Framework for Personalized Multimodal AI Search Engine	Yunxiao Shi et.al.	2409.00636	null
2024-09-01	Roundabout Dilemma Zone Data Mining and Forecasting with Trajectory Prediction and Graph Neural Networks	Manthan Chelenahalli Satish et.al.	2409.00622	null
2024-09-01	TinyAgent: Function Calling at the Edge	Lutfi Eren Erdogan et.al.	2409.00608	null
2024-09-01	Average-case optimization analysis for distributed consensus algorithms on regular graphs	Nhat Trung Nguyen et.al.	2409.00605	null
2024-09-04	GenAI-powered Multi-Agent Paradigm for Smart Urban Mobility: Opportunities and Challenges for Integrating Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) with Intelligent Transportation Systems	Haowen Xu et.al.	2409.00494	null
2024-08-31	Online Learning of Interaction Dynamics with Dual Model Predictive Control for Multi-Agent Systems Using Gaussian Processes	T. M. J. T. Baltussen et.al.	2409.00432	null
2024-08-31	Chatting Up Attachment: Using LLMs to Predict Adult Bonds	Paulo Soares et.al.	2409.00347	null
2024-08-29	PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action	Yijia Shao et.al.	2409.00138	link
2024-08-29	HoneyComb: A Flexible LLM-Based Agent System for Materials Science	Huan Zhang et.al.	2409.00135	null
2024-08-29	MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale	Anton Andreychuk et.al.	2409.00134	link
2024-08-27	Evaluating the Impact of Multiple DER Aggregators on Wholesale Energy Markets: A Hybrid Mean Field Approach	Jun He et.al.	2409.00107	null
2024-08-27	Collective Predictive Coding as Model of Science: Formalizing Scientific Activities Towards Generative Science	Tadahiro Taniguchi et.al.	2409.00102	null
2024-08-27	Modelisation a base d'Agent Augmentes par LLM pour les Simulations Sociales: Defis et Opportunites	Önder Gürcan et.al.	2409.00100	null
2024-08-24	Towards Human-Level Understanding of Complex Process Engineering Schematics: A Pedagogical, Introspective Multi-Agent Framework for Open-Domain Question Answering	Sagar Srinivas Sakhinana et.al.	2409.00082	null
2024-08-30	Robust Technology Regulation	Andrew Koh et.al.	2408.17398	null
2024-08-30	Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory control	Zihao Sheng et.al.	2408.17380	link
2024-08-30	EMPOWER: Embodied Multi-role Open-vocabulary Planning with Online Grounding and Execution	Francesco Argenziano et.al.	2408.17379	null
2024-08-30	Non-reciprocal spin-glass transition and aging	Giulia Garcia Lorenzana et.al.	2408.17360	null
2024-08-30	Why do elites extend property rights: unlocking investment and the switch to public goods	Alastair Langtry et.al.	2408.17335	null
2024-08-30	All You Need is Group Actions: Advancing Robust Autonomous Planning	Vincenzo Basco et.al.	2408.17295	null
2024-08-30	Predicting the Impact of Generative AI Using an Agent-Based Model	Joao Tiago Aparicio et.al.	2408.17268	null
2024-08-30	Using Quantum Solved Deep Boltzmann Machines to Increase the Data Efficiency of RL Agents	Daniel Kent et.al.	2408.17240	null
2024-08-30	Asynchronous Distributed Learning with Quantized Finite-Time Coordination	Nicola Bastianello et.al.	2408.17156	null
2024-08-30	Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning	Shuyang Zhang et.al.	2408.17005	link
2024-08-30	Characterizing User Platforms for Video Streaming in Broadband Networks	Yifan Wang et.al.	2408.16995	link
2024-08-30	Tool-Assisted Agent on SQL Inspection and Refinement in Real-World Scenarios	Zhongyuan Wang et.al.	2408.16991	null
2024-08-30	Beyond Preferences in AI Alignment	Tan Zhi-Xuan et.al.	2408.16984	null
2024-08-30	The Sample-Communication Complexity Trade-off in Federated Q-Learning	Sudeep Salgia et.al.	2408.16981	null
2024-08-30	Discovery of False Data Injection Schemes on Frequency Controllers with Reinforcement Learning	Romesh Prasad et.al.	2408.16958	null
2024-08-29	Robotic warehousing operations: a learn-then-optimize approach to large-scale neighborhood search	Cynthia Barnhart et.al.	2408.16890	null
2024-08-29	Learning Multi-agent Multi-machine Tending by Mobile Robots	Abdalwhab Abdalwhab et.al.	2408.16875	null
2024-08-29	A framework for training and benchmarking algorithms that schedule robot tasks	Wojciech Dudek et.al.	2408.16844	null
2024-08-29	AdapShare: An RL-Based Dynamic Spectrum Sharing Solution for O-RAN	Sneihil Gopal et.al.	2408.16842	null
2024-08-29	A Bibliometric Analysis of Trust in Conversational Agents over the Past Fifteen Years	Meltem Aksoy et.al.	2408.16837	null
2024-08-29	Maelstrom Networks	Matthew Evanusa et.al.	2408.16632	null
2024-08-29	On the data-sparsity of the solution of Riccati equations with applications to feedback control	Stefano Massei et.al.	2408.16569	null
2024-08-29	CooTest: An Automated Testing Approach for V2X Communication Systems	An Guo et.al.	2408.16470	null
2024-08-29	Consensus Planning with Primal, Dual, and Proximal Agents	Alvaro Maggiar et.al.	2408.16462	null
2024-08-29	3D Topological Modeling and Multi-Agent Movement Simulation for Viral Infection Risk Analysis	Wassim Jabi et.al.	2408.16417	null
2024-09-04	Efficient Multi-agent Navigation with Lightweight DRL Policy	Xingrong Diao et.al.	2408.16370	null
2024-08-29	Guided Reasoning: A Non-Technical Introduction	Gregor Betz et.al.	2408.16331	link
2024-08-29	Autocorrelation properties of temporal networks governed by dynamic node variables	Harrison Hartle et.al.	2408.16270	null
2024-08-29	Action potential dynamics on heterogenous neural networks: from kinetic to macroscopic equations	Marzia Bisi et.al.	2408.16214	null
2024-08-28	DECAF: a Discrete-Event based Collaborative Human-Robot Framework for Furniture Assembly	Giulio Giacomuzzo et.al.	2408.16125	null
2024-08-28	EPO: Hierarchical LLM Agents with Environment Preference Optimization	Qi Zhao et.al.	2408.16090	null
2024-08-28	Logic-Enhanced Language Model Agents for Trustworthy Social Simulations	Agnieszka Mensfelt et.al.	2408.16081	link
2024-08-28	Hitting the Gym: Reinforcement Learning Control of Exercise-Strengthened Biohybrid Robots in Simulation	Saul Schaffer et.al.	2408.16069	null
2024-08-28	An Extremely Data-efficient and Generative LLM-based Reinforcement Learning Agent for Recommenders	Shuang Feng et.al.	2408.16032	null
2024-08-28	Thoughtseeds: Evolutionary Priors, Nested Markov Blankets, and the Emergence of Embodied Cognition	Prakash Chandra Kavi et.al.	2408.15982	null
2024-08-28	WebPilot: A Versatile and Autonomous Multi-Agent System for Web Task Execution with Strategic Exploration	Yao Zhang et.al.	2408.15978	null
2024-08-28	BattleAgentBench: A Benchmark for Evaluating Cooperation and Competition Capabilities of Language Models in Multi-Agent Systems	Wei Wang et.al.	2408.15971	null
2024-08-28	Atari-GPT: Investigating the Capabilities of Multimodal Large Language Models as Low-Level Policies for Atari Games	Nicholas R. Waytowich et.al.	2408.15950	null
2024-08-28	Auxiliary Input in Training: Incorporating Catheter Features into Deep Learning Models for ECG-Free Dynamic Coronary Roadmapping	Yikang Liu et.al.	2408.15947	null
2024-09-02	Persuasion Games using Large Language Models	Ganesh Prasath Ramani et.al.	2408.15879	null
2024-08-28	Retrieval-Augmented Instruction Tuning for Automated Process Engineering Calculations : A Tool-Chaining Problem-Solving Framework with Attributable Reflection	Sagar Srinivas Sakhinana et.al.	2408.15866	null
2024-08-28	FlowAct: A Proactive Multimodal Human-robot Interaction System with Continuous Flow of Perception and Modular Action Sub-systems	Timothée Dhaussy et.al.	2408.15864	null
2024-08-28	Interactive Agents: Simulating Counselor-Client Psychological Counseling via Role-Playing LLM-to-LLM Interactions	Huachuan Qiu et.al.	2408.15787	link
2024-09-05	LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models	Jiayi Gui et.al.	2408.15778	null
2024-08-28	A Survey on Evaluation of Multimodal Large Language Models	Jiaxing Huang et.al.	2408.15769	null
2024-08-28	Evaluating and Comparing Crowd Simulations: Perspectives from a Crowd Authoring Tool	Gabriel Fonseca Silva et.al.	2408.15762	null
2024-09-01	Reinforcement Learning for Adaptive Traffic Signal Control: Turn-Based and Time-Based Approaches to Reduce Congestion	Muhammad Tahir Rafique et.al.	2408.15751	null
2024-08-28	Different Facets for Different Experts: A Framework for Streamlining The Integration of Qualitative Insights into ABM Development	Vivek Nallur et.al.	2408.15725	null
2024-08-28	Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning	Minjong Yoo et.al.	2408.15593	null
2024-08-28	TrafficGamer: Reliable and Flexible Traffic Simulation for Safety-Critical Scenarios with Game-Theoretic Oracles	Guanren Qiao et.al.	2408.15538	link
2024-08-28	Towards Fully Autonomous Research Powered by LLMs: Case Study on Simulations	Zhihan Liu et.al.	2408.15512	link
2024-08-28	AeroVerse: UAV-Agent Benchmark Suite for Simulating, Pre-training, Finetuning, and Evaluating Aerospace Embodied World Models	Fanglong Yao et.al.	2408.15511	null
2024-08-28	Infinite-Horizon Optimal Wireless Control Over Shared State-Dependent Fading Channels for IIoT Systems	Shuling Wang et.al.	2408.15492	null
2024-08-27	Graph Attention Inference of Network Topology in Multi-Agent Systems	Akshay Kolli et.al.	2408.15449	null
2024-08-27	Fast and Modular Autonomy Software for Autonomous Racing Vehicles	Andrew Saba et.al.	2408.15425	null
2024-09-04	Simultaneous Training of First- and Second-Order Optimizers in Population-Based Reinforcement Learning	Felix Pfeiffer et.al.	2408.15421	null
2024-08-27	On Stateful Value Factorization in Multi-Agent Reinforcement Learning	Enrico Marchesini et.al.	2408.15381	null
2024-08-27	A Multi-Agent Reinforcement Learning Scheme for SFC Placement in Edge Computing Networks	Congzhou Li et.al.	2408.15337	null
2024-08-27	Artificially intelligent Maxwell's demon for optimal control of open quantum systems	Paolo Andrea Erdman et.al.	2408.15328	null
2024-08-27	TourSynbio: A Multi-Modal Large Model and Agent Framework to Bridge Text and Protein Sequences for Protein Engineering	Yiqing Shen et.al.	2408.15299	link
2024-08-27	Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations	Yucheng Jiang et.al.	2408.15232	null
2024-08-27	Exploiting Approximate Symmetry for Efficient Multi-Agent Reinforcement Learning	Batuhan Yardim et.al.	2408.15173	null
2024-08-27	Applications in CityLearn Gym Environment for Multi-Objective Control Benchmarking in Grid-Interactive Buildings and Districts	Kingsley Nweye et.al.	2408.15170	null
2024-08-27	Delay as Payoff in MAB	Ofir Schlisselberg et.al.	2408.15158	null
2024-08-27	muPRL: A Mutation Testing Pipeline for Deep Reinforcement Learning based on Real Faults	Deepak-George Thomas et.al.	2408.15150	link
2024-08-29	No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery	Alexander Rutherford et.al.	2408.15099	link
2024-08-23	Flexible categorization using formal concept analysis and Dempster-Shafer theory	Marcel Boersma et.al.	2408.15012	null
2024-08-27	AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems	Chi-Min Chan et.al.	2408.14972	link
2024-08-27	The Asymptotic Cost of Complexity	Martin W Cripps et.al.	2408.14949	null
2024-08-27	Decentralized Unlabeled Multi-agent Pathfinding Via Target And Priority Swapping (With Supplementary)	Stepan Dergachev et.al.	2408.14948	link
2024-08-27	Learning Robust Reward Machines from Noisy Labels	Roko Parac et.al.	2408.14871	link
2024-08-27	Diffusion Models Are Real-Time Game Engines	Dani Valevski et.al.	2408.14837	null
2024-08-27	Sequential-Scanning Dual-Energy CT Imaging Using High Temporal Resolution Image Reconstruction and Error-Compensated Material Basis Image Generation	Qiaoxin Li et.al.	2408.14754	null
2024-08-27	Sub-Riemannian Geometry, Mixing, and the Holonomy of Optimal Mass Transport	Mahmoud Abdelgalil et.al.	2408.14707	null
2024-08-26	Model-Based Reinforcement Learning for Control of Strongly-Disturbed Unsteady Aerodynamic Flows	Zhecheng Liu et.al.	2408.14685	null
2024-08-26	Emergent Language in Open-Ended Environments	Cornelius Wolff et.al.	2408.14649	null
2024-08-26	Biased Dueling Bandits with Stochastic Delayed Feedback	Bongsoo Yi et.al.	2408.14603	null
2024-08-26	On Centralized Critics in Multi-Agent Reinforcement Learning	Xueguang Lyu et.al.	2408.14597	link
2024-08-26	Multi-Agent Path Finding with Real Robot Dynamics and Interdependent Tasks for Automated Warehouses	Vassilissa Lehoux-Lebacque et.al.	2408.14527	null
2024-08-26	A Survey on Reinforcement Learning Applications in SLAM	Mohammad Dehghani Tezerjani et.al.	2408.14518	null
2024-08-24	Artificial intelligence for science: The easy and hard problems	Ruairidh M. Battleday et.al.	2408.14508	null
2024-08-23	Knowledge Graph Modeling-Driven Large Language Model Operating System (LLM OS) for Task Automation in Process Engineering Problem-Solving	Sakhinana Sagar Srinivas et.al.	2408.14494	null
2024-08-18	Agentic Retrieval-Augmented Generation for Time Series Analysis	Chidaksh Ravuru et.al.	2408.14484	null
2024-08-26	Employing Artificial Intelligence to Steer Exascale Workflows with Colmena	Logan Ward et.al.	2408.14434	null
2024-08-26	SWE-bench-java: A GitHub Issue Resolving Benchmark for Java	Daoguang Zan et.al.	2408.14354	link
2024-09-03	Foundation Models for Music: A Survey	Yinghao Ma et.al.	2408.14340	link
2024-08-26	Equivariant Reinforcement Learning under Partial Observability	Hai Nguyen et.al.	2408.14336	null
2024-08-26	LLM-3D Print: Large Language Models To Monitor and Control 3D Printing	Yayati Jadhav et.al.	2408.14307	null
2024-08-26	Fact Probability Vector Based Goal Recognition	Nils Wilken et.al.	2408.14224	link
2024-08-26	Robot Navigation with Entity-Based Collision Avoidance using Deep Reinforcement Learning	Yury Kolomeytsev et.al.	2408.14183	null
2024-08-26	"Hi. I'm Molly, Your Virtual Interviewer!" -- Exploring the Impact of Race and Gender in AI-powered Virtual Interview Experiences	Shreyan Biswas et.al.	2408.14159	null
2024-08-26	Investigating the effect of Mental Models in User Interaction with an Adaptive Dialog Agent	Lindsey Vanderlyn et.al.	2408.14154	null
2024-09-02	MLR-Copilot: Autonomous Machine Learning Research based on Large Language Models Agents	Ruochen Li et.al.	2408.14033	link
2024-08-26	Optimizing TD3 for 7-DOF Robotic Arm Grasping: Overcoming Suboptimality with Exploration-Enhanced Contrastive Learning	Wen-Han Hsieh et.al.	2408.14009	null
2024-08-26	Decentralized Federated Learning with Model Caching on Mobile Agents	Xiaoyu Wang et.al.	2408.14001	null
2024-08-26	Quantitative Representation of Scenario Difficulty for Autonomous Driving Based on Adversarial Policy Search	Shuo Yang et.al.	2408.14000	null
2024-08-26	AgentMove: Predicting Human Mobility Anywhere Using Large Language Model based Agentic Framework	Jie Feng et.al.	2408.13986	link
2024-08-25	CoT Rerailer: Enhancing the Reliability of Large Language Models in Complex Reasoning Tasks through Error Detection and Correction	Guangya Wan et.al.	2408.13940	null
2024-08-25	Safe Policy Exploration Improvement via Subgoals	Brian Angulo et.al.	2408.13881	null
2024-08-25	Flexible game-playing AI with AlphaViT: adapting to multiple games and board sizes	Kazuhisa Fujita et.al.	2408.13871	null
2024-08-25	Informativeness and Trust in Bayesian Persuasion	Reema Deori et.al.	2408.13822	null
2024-08-25	Optical Inversion Using Plasmonic Contrast Agents	Xinlin Cao et.al.	2408.13793	null
2024-08-25	Demo: Generative Open xG Network Simulation with Multi-Agent LLM and ns-3 (GenOnet)	Farhad Rezazadeh et.al.	2408.13781	null
2024-08-25	MASQ: Multi-Agent Reinforcement Learning for Single Quadruped Robot Locomotion	Qi Liu et.al.	2408.13759	null
2024-08-25	Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation	Zhaoyang Li et.al.	2408.13752	null
2024-08-25	Multi-Agent Target Assignment and Path Finding for Intelligent Warehouse: A Cooperative Multi-Agent Deep Reinforcement Learning Perspective	Qi Liu et.al.	2408.13750	null
2024-08-25	Count-based Novelty Exploration in Classical Planning	Giacomo Rosa et.al.	2408.13719	null
2024-08-24	How to guide a present-biased agent through prescribed tasks?	Tatiana Belova et.al.	2408.13675	null
2024-08-24	Temporal Elections: Welfare, Strategyproofness, and Proportionality	Edith Elkind et.al.	2408.13637	null
2024-08-24	DeepVoting: Learning Voting Rules with Tailored Embeddings	Leonardo Matone et.al.	2408.13630	null
2024-08-24	Reaching New Heights in Multi-Agent Collective Construction	Martin Rameš et.al.	2408.13615	null
2024-08-24	Hybrid Training for Enhanced Multi-task Generalization in Multi-agent Reinforcement Learning	Mingliang Zhang et.al.	2408.13567	null
2024-08-27	Control-Informed Reinforcement Learning for Chemical Processes	Maximilian Bloor et.al.	2408.13566	link
2024-08-24	IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question Answering	Ruosen Li et.al.	2408.13545	null
2024-08-24	Unleashing Collaborative Computing for Adaptive Video Streaming with Multi-objective Optimization in Satellite Terrestrial Networks	Zhishu Shen et.al.	2408.13512	null
2024-08-23	Optimizing Collaboration of LLM based Agents for Finite Element Analysis	Chuan Tian et.al.	2408.13406	null
2024-08-23	DrugAgent: Explainable Drug Repurposing Agent with Large Language Model-based Reasoning	Yoshitaka Inoue et.al.	2408.13378	null
2024-08-23	Generative Blockchain: Transforming Blockchain from Transaction Recording to Transaction Generation through Proof-of-Merit	Haozhao Zhang et.al.	2408.13367	null
2024-08-23	Reconciling Different Theories of Learning with an Agent-based Model of Procedural Learning	Sina Rismanchian et.al.	2408.13364	null
2024-08-23	Oscillatory and Excitable Dynamics in an Opinion Model with Group Opinions	Corbit R. Sampson et.al.	2408.13336	link
2024-08-23	Mastering the Digital Art of War: Developing Intelligent Combat Simulation Agents for Wargaming Using Hierarchical Reinforcement Learning	Scotty Black et.al.	2408.13333	null
2024-08-23	Localized Observation Abstraction Using Piecewise Linear Spatial Decay for Reinforcement Learning in Combat Simulations	Scotty Black et.al.	2408.13328	null
2024-08-23	Large Language Models for Zero Touch Network Configuration Management	Oscar G. Lira et.al.	2408.13298	null
2024-08-23	The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities	Venkatesh Balavadhani Parthasarathy et.al.	2408.13296	null
2024-08-23	Optimally Solving Simultaneous-Move Dec-POMDPs: The Sequential Central Planning Approach	Johan Peralez et.al.	2408.13139	null
2024-08-18	An Introduction to Cognidynamics	Marco Gori et.al.	2408.13112	null
2024-08-23	Diffusion-based Episodes Augmentation for Offline Multi-Agent Reinforcement Learning	Jihwan Oh et.al.	2408.13092	null
2024-09-01	Controllable Financial Market Generation with Diffusion Guided Meta Agent	Yu-Hao Huang et.al.	2408.12991	null
2024-08-26	Zeoformer: Coarse-Grained Periodic Graph Transformer for OSDA-Zeolite Affinity Prediction	Xiangxiang Shen et.al.	2408.12984	null
2024-08-23	Informational Embodiment: Computational role of information structure in codes and robots	Alexandre Pitti et.al.	2408.12950	null
2024-08-23	Complete Graph Identification in Population Protocols	Haruki Kanaya et.al.	2408.12862	null
2024-08-23	Online Fair Division with Contextual Bandits	Arun Verma et.al.	2408.12845	null
2024-08-23	LIMP: Large Language Model Enhanced Intent-aware Mobility Prediction	Songwei Li et.al.	2408.12832	link
2024-08-23	From Mobilisation to Radicalisation: Probing the Persistence and Radicalisation of Social Movements Using an Agent-Based Model	Emma F. Thomas et.al.	2408.12795	null
2024-08-23	Environment-Centric Active Inference	Kanako Esaki et.al.	2408.12777	null
2024-08-27	Intelligent OPC Engineer Assistant for Semiconductor Manufacturing	Guojin Chen et.al.	2408.12775	null
2024-08-22	Free-breathing 3D cardiac extracellular volume (ECV) mapping using a linear tangent space alignment (LTSA) model	Wonil Lee et.al.	2408.12706	null
2024-09-01	Can LLMs Understand Social Norms in Autonomous Driving Games?	Boxuan Wang et.al.	2408.12680	null
2024-08-22	Integrating an agent-based behavioral model in microtransit forecasting and revenue management	Xiyuan Ren et.al.	2408.12577	null
2024-08-25	MuMA-ToM: Multi-modal Multi-Agent Theory of Mind	Haojun Shi et.al.	2408.12574	link
2024-08-22	PCGRL+: Scaling, Control and Generalization in Reinforcement Learning Level Generators	Sam Earle et.al.	2408.12525	null
2024-08-22	Stochastic Online Correlated Selection	Ziyun Chen et.al.	2408.12524	null
2024-08-22	Weighted Envy-Freeness in House Allocation	Sijia Dai et.al.	2408.12523	null
2024-08-22	MEDCO: Medical Education Copilots Based on A Multi-Agent Framework	Hao Wei et.al.	2408.12496	null
2024-08-22	Multi Agent Framework for Collective Intelligence Research	Alexandru Dochian et.al.	2408.12391	link
2024-08-22	Recursive Distributed Collaborative Aided Inertial Navigation	Roland Jung et.al.	2408.12360	link
2024-09-04	Graph Retrieval Augmented Trustworthiness Reasoning	Ying Zhu et.al.	2408.12333	link
2024-08-22	Can Artificial Intelligence Embody Moral Values?	Torben Swoboda et.al.	2408.12250	null
2024-08-22	Time Optimal Distance- $k$ -Dispersion on Dynamic Ring	Brati Mondal et.al.	2408.12220	null
2024-08-22	MDD-5k: A New Diagnostic Conversation Dataset for Mental Disorders Synthesized via Neuro-Symbolic LLM Agents	Congchi Yin et.al.	2408.12142	link
2024-08-22	An evidence-accumulating drift-diffusion model of competing information spread on networks	Julien Corsin et.al.	2408.12127	null
2024-08-22	Emotion-Agent: Unsupervised Deep Reinforcement Learning with Distribution-Prototype Reward for Continuous Emotional EEG Analysis	Zhihao Zhou et.al.	2408.12121	null
2024-08-22	Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards	Shresth Verma et.al.	2408.12112	null
2024-08-22	Distributed Noncoherent Joint Transmission Based on Multi-Agent Reinforcement Learning for Dense Small Cell MISO Systems	Shaozhuang Bai et.al.	2408.12067	null
2024-08-21	Empirical Equilibria in Agent-based Economic systems with Learning agents	Kshama Dwarakanath et.al.	2408.12038	null
2024-08-21	Reasoning and Tools for Human-Level Forecasting	Elvis Hsieh et.al.	2408.12036	null
2024-08-21	Understanding Epistemic Language with a Bayesian Theory of Mind	Lance Ying et.al.	2408.12022	null
2024-08-21	Controlling nonergodicity in quantum many-body systems by reinforcement learning	Li-Li Ye et.al.	2408.11989	link
2024-08-21	Advances in Preference-based Reinforcement Learning: A Review	Youssef Abdelkareem et.al.	2408.11943	null
2024-08-21	Distributed alternating gradient descent for convex semi-infinite programs over a network	Ashwin Aravind et.al.	2408.11937	null
2024-08-21	Spline tie-decay temporal networks	Chanon Thongprayoon et.al.	2408.11913	null
2024-08-21	Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction	Anthony GX-Chen et.al.	2408.11816	null
2024-08-21	EmbodiedSAM: Online Segment Any 3D Thing in Real Time	Xiuwei Xu et.al.	2408.11811	null
2024-08-21	Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models	Yuzhou Huang et.al.	2408.11801	null
2024-08-21	Leveraging Chemistry Foundation Models to Facilitate Structure Focused Retrieval Augmented Generation in Multi-Agent Workflows for Catalyst and Materials Design	Nathaniel H. Park et.al.	2408.11793	null
2024-08-21	DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework	Zhifei Xie et.al.	2408.11788	null
2024-08-21	Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards	Omar Erak et.al.	2408.11775	link
2024-08-21	Deviations from the Nash equilibrium and emergence of tacit collusion in a two-player optimal execution game with reinforcement learning	Fabrizio Lillo et.al.	2408.11773	null
2024-08-21	VIRIS: Simulating indoor airborne transmission combining architectural design and people movement	Yidan Xue et.al.	2408.11772	link
2024-08-23	Consensus over Clustered Networks Using Intermittent and Asynchronous Output Feedback	Federico M. Zegers et.al.	2408.11752	null
2024-08-21	Bayesian Optimization Framework for Efficient Fleet Design in Autonomous Multi-Robot Exploration	David Molina Concha et.al.	2408.11751	null
2024-08-21	Open-Ended 3D Point Cloud Instance Segmentation	Phuc D. A. Nguyen et.al.	2408.11747	null
2024-08-21	Less is more: AI Decision-Making using Dynamic Deep Neural Networks for Short-Term Stock Index Prediction	CJ Finnegan et.al.	2408.11740	null
2024-08-22	LLM4VV: Exploring LLM-as-a-Judge for Validation and Verification Testsuites	Zachariah Sollenberger et.al.	2408.11729	null
2024-08-21	Networked Communication for Mean-Field Games with Function Approximation and Empirical Mean-Field Estimation	Patrick Benjamin et.al.	2408.11607	null
2024-08-21	Optimizing QoS in HD Map Updates: Cross-Layer Multi-Agent with Hierarchical and Independent Learning	Jeffrey Redondo et.al.	2408.11605	null
2024-08-21	Cause-Aware Empathetic Response Generation via Chain-of-Thought Fine-Tuning	Xinhao Chen et.al.	2408.11599	null
2024-08-21	Drama Engine: A Framework for Narrative Agents	Martin Pichlmair et.al.	2408.11574	null
2024-08-21	AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition	Minheng Ni et.al.	2408.11564	null
2024-08-21	Semi-supervised 3D Semantic Scene Completion with 2D Vision Foundation Model Guidance	Duc-Hai Pham et.al.	2408.11559	null
2024-08-21	Fixation of leadership in non-Markovian growth processes	Tejas Iyer et.al.	2408.11516	null
2024-08-21	Verifying Approximate Equilibrium in Auctions	Fabian R. Pieroth et.al.	2408.11445	null
2024-08-21	Subgoal-based Hierarchical Reinforcement Learning for Multi-Agent Collaboration	Cheng Xu et.al.	2408.11416	link
2024-08-21	Towards "Differential AI Psychology" and in-context Value-driven Statement Alignment with Moral Foundations Theory	Simon Münker et.al.	2408.11415	null
2024-08-21	Swarm Intelligence in Geo-Localization: A Multi-Agent Large Vision-Language Model Collaborative Framework	Xiao Han et.al.	2408.11312	null
2024-08-20	CooPre: Cooperative Pretraining for V2X Cooperative Perception	Seth Z. Zhao et.al.	2408.11241	null
2024-08-20	Optimization of Multi-Agent Flying Sidekick Traveling Salesman Problem over Road Networks	Ruixiao Yang et.al.	2408.11187	null
2024-08-20	Autonomous Negotiation Using Comparison-Based Gradient Estimation	Surya Murthy et.al.	2408.11186	link
2024-08-20	Range-based Multi-Robot Integrity Monitoring Against Cyberattacks and Faults: An Anchor-Free Approach	Vishnu Vijay et.al.	2408.11155	null
2024-08-20	Accelerating Goal-Conditioned RL Algorithms and Research	Michał Bortkiewicz et.al.	2408.11052	link
2024-08-20	FLAME: Learning to Navigate with Multimodal LLM in Urban Environments	Yunzhe Xu et.al.	2408.11051	link
2024-08-23	MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding	Jian Chen et.al.	2408.11049	link
2024-08-20	Athena: Safe Autonomous Agents with Verbal Contrastive Learning	Tanmana Sadhu et.al.	2408.11021	null
2024-08-20	The Evolution of Reinforcement Learning in Quantitative Finance	Nikolaos Pippas et.al.	2408.10932	null
2024-08-20	All Robots in One: A New Standard and Unified Dataset for Versatile, General-Purpose Embodied Agents	Zhiqiang Wang et.al.	2408.10899	null
2024-08-23	DBHP: Trajectory Imputation in Multi-Agent Sports Using Derivative-Based Hybrid Prediction	Hanjun Choi et.al.	2408.10878	null
2024-08-20	More Options for Prelabor Rupture of Membranes, A Bayesian Analysis	Ashley Klein et.al.	2408.10876	null
2024-08-20	Multi-agent Multi-armed Bandits with Stochastic Sharable Arm Capacities	Hong Xie et.al.	2408.10865	null
2024-08-20	Knowledge Sharing and Transfer via Centralized Reward Agent for Multi-Task Reinforcement Learning	Haozhe Ma et.al.	2408.10858	link
2024-08-20	Learning Randomized Algorithms with Transformers	Johannes von Oswald et.al.	2408.10818	null
2024-08-20	Multi-Agent Based Simulation for Decentralized Electric Vehicle Charging Strategies and their Impacts	Kristoffer Christensen et.al.	2408.10790	null
2024-08-20	Multi-agent based modeling for investigating excess heat utilization from electrolyzer production to district heating network	Kristoffer Christensen et.al.	2408.10783	null
2024-08-20	Multi-Agent Based Simulation for Investigating Centralized Charging Strategies and their Impact on Electric Vehicle Home Charging Ecosystem	Kristoffer Christensen et.al.	2408.10773	null
2024-08-20	PhishAgent: A Robust Multimodal Agent for Phishing Webpage Detection	Tri Cao et.al.	2408.10738	null
2024-08-20	Investigating Context Effects in Similarity Judgements in Large Language Models	Sagar Uprety et.al.	2408.10711	null
2024-08-20	Genesis: Towards the Automation of Systems Biology Research	Ievgeniia A. Tiukova et.al.	2408.10689	null
2024-08-20	Neural Exploratory Landscape Analysis	Zeyuan Ma et.al.	2408.10672	null
2024-08-20	Incorporating a 'ladder of trust' into dynamic Allocation of Function in Human-Autonomous Agent Collectives	Chris Baber et.al.	2408.10654	null
2024-08-20	Variations on distributed belief	John Lindqvist et.al.	2408.10637	null
2024-08-20	Strategist: Learning Strategic Skills by LLMs via Bi-Level Tree Search	Jonathan Light et.al.	2408.10635	null
2024-08-21	MUSES: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration	Yanbo Ding et.al.	2408.10605	null
2024-08-20	Fast Collective Evasion in Self-Localized Swarms of Unmanned Aerial Vehicles	Filip Novák et.al.	2408.10596	null
2024-08-20	Synchronization behind Learning in Periodic Zero-Sum Games Triggers Divergence from Nash equilibrium	Yuma Fujimoto et.al.	2408.10595	null
2024-08-20	Bidirectional Intent Communication: A Role for Large Foundation Models	Tim Schreiter et.al.	2408.10589	null
2024-08-20	DEGAS: Detailed Expressions on Full-Body Gaussian Avatars	Zhijing Shao et.al.	2408.10588	null
2024-08-20	Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks	Yun Qu et.al.	2408.10556	link
2024-08-20	Semi-on-Demand Off-Peak Transit Services with Shared Autonomous Vehicles -- Service Planning, Simulation, and Analysis in Munich, Germany	Max T. M. Ng et.al.	2408.10547	null
2024-08-20	Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation	Jiawei Han et.al.	2408.10537	link
2024-08-20	Approximate Estimation of High-dimension Execution Skill for Dynamic Agents in Continuous Domains	Delma Nieves-Rivera et.al.	2408.10512	null
2024-08-20	Evaluation Framework for AI-driven Molecular Design of Multi-target Drugs: Brain Diseases as a Case Study	Arthur Cerveira et.al.	2408.10482	link
2024-08-24	IDEA:Enhancing the Rule Learning Ability of Language Agents through Induction, Deduction, and Abduction	Kaiyu He et.al.	2408.10455	null
2024-08-19	Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation	Liu He et.al.	2408.10453	null
2024-08-19	Tax Credits and Household Behavior: The Roles of Myopic Decision-Making and Liquidity in a Simulated Economy	Jialin Dong et.al.	2408.10391	null
2024-08-19	Narrowing the Gap between Vision and Action in Navigation	Yue Zhang et.al.	2408.10388	link
2024-08-19	Competing Social Contagions with Opinion Dependent Infectivity	Corbit R. Sampson et.al.	2408.10373	link
2024-08-19	Toward Fair and Strategyproof Tournament Rules for Tournaments with Partially Transferable Utilities	David Pennock et.al.	2408.10346	null
2024-08-17	Why and How do Complex Systems Self-Organize at All? Average Action Efficiency as a Predictor, Measure, Driver, and Mechanism of Self-Organization	Matthew J Brouillet et.al.	2408.10278	null
2024-08-19	Don't Get Stuck: A Deadlock Recovery Approach	Francesca Baldini et.al.	2408.10167	null
2024-08-19	Learning Precise Affordances from Egocentric Videos for Robotic Manipulation	Gen Li et.al.	2408.10123	null
2024-08-19	Enhancing Reinforcement Learning Through Guided Search	Jérôme Arjonilla et.al.	2408.10113	null
2024-08-19	No Screening is More Efficient with Multiple Objects	Shunya Noda et.al.	2408.10077	null
2024-08-19	Synthesis of Reward Machines for Multi-Agent Equilibrium Design (Full Version)	Muhammad Najib et.al.	2408.10074	null
2024-08-19	Near-Optimal Mechanisms for Resource Allocation Without Monetary Transfers	Moise Blanchard et.al.	2408.10066	null
2024-08-19	The Practimum-Optimum Algorithm for Manufacturing Scheduling: A Paradigm Shift Leading to Breakthroughs in Scale and Performance	Moshe BenBassat et.al.	2408.10040	null
2024-08-19	The Expressive Power of Uniform Population Protocols with Logarithmic Space	Philipp Czerner et.al.	2408.10027	null
2024-08-19	Adaptive BESS and Grid Setpoints Optimization: A Model-Free Framework for Efficient Battery Management under Dynamic Tariff Pricing	Alaa Selim et.al.	2408.09989	null
2024-08-19	The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective	Renye Yan et.al.	2408.09974	null
2024-08-20	MegaAgent: A Practical Framework for Autonomous Cooperation in Large-Scale LLM Agent Systems	Qian Wang et.al.	2408.09955	null
2024-08-19	Boltzmann approach to collective motion via non-local visual interaction	Susumu Ito et.al.	2408.09917	null
2024-08-19	Multi-layer diffusion model of photovoltaic installations	Tomasz Weron et.al.	2408.09904	null
2024-08-19	Demystifying Reinforcement Learning in Production Scheduling via Explainable AI	Daniel Fischer et.al.	2408.09841	null
2024-08-19	Mitigating the Stability-Plasticity Dilemma in Adaptive Train Scheduling with Curriculum-Driven Continual DQN Expansion	Achref Jaziri et.al.	2408.09838	null
2024-08-20	World Models Increase Autonomy in Reinforcement Learning	Zhao Yang et.al.	2408.09807	null
2024-08-19	Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation	Yunxin Li et.al.	2408.09787	link
2024-08-19	GoNoGo: An Efficient LLM-based Multi-Agent System for Streamlining Automotive Software Release Decision-Making	Arsham Gholamzadeh Khoee et.al.	2408.09785	null
2024-08-19	Targeted Drug Delivery: Algorithmic Methods for Collecting a Swarm of Particles with Uniform External Forces	Aaron T. Becker et.al.	2408.09729	null
2024-08-19	Algorithmic Contract Design with Reinforcement Learning Agents	David Molina Concha et.al.	2408.09686	null
2024-08-19	Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey	Ruiqi Zhang et.al.	2408.09675	link
2024-08-20	BLADE: Benchmarking Language Model Agents for Data-Driven Science	Ken Gu et.al.	2408.09667	link
2024-08-19	Linear-Quadratic Mean-Field Game for Stochastic Systems with Partial Observation	Min Li et.al.	2408.09652	null
2024-08-18	Prescribed-time Convergent Distributed Multiobjective Optimization with Dynamic Event-triggered Communication	Tengyang Gong et.al.	2408.09602	null
2024-08-21	Löb-Safe Logics for Reflective Agents	Seth Ahrenbach et.al.	2408.09590	null
2024-08-18	HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model	Mengkang Hu et.al.	2408.09559	link
2024-08-18	Enhancing Population-based Search with Active Inference	Nassim Dehouche et.al.	2408.09548	null
2024-08-18	A Logic for Policy Based Resource Exchanges in Multiagent Systems	Lorenzo Ceragioli et.al.	2408.09516	null
2024-08-18	Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning	Zhiwei Xu et.al.	2408.09501	null
2024-08-18	Ancestral Reinforcement Learning: Unifying Zeroth-Order Optimization and Genetic Algorithms for Reinforcement Learning	So Nakashima et.al.	2408.09493	null
2024-08-18	HySem: A context length optimized LLM pipeline for unstructured tabular extraction	Narayanan PP et.al.	2408.09434	null
2024-08-18	Value-Enriched Population Synthesis: Integrating a Motivational Layer	Alba Aguilera et.al.	2408.09407	null
2024-08-18	Optimal stopping and divestment timing under scenario ambiguity and learning	Andrea Mazzon et.al.	2408.09349	null
2024-08-17	How to Make an Action Better	Marilyn Pease et.al.	2408.09294	null
2024-08-17	GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System	Shuo Wang et.al.	2408.09191	null
2024-08-17	Generative Agent-Based Models for Complex Systems Research: a review	Yikang Lu et.al.	2408.09175	null
2024-08-17	Worst- and Average-Case Robustness of Stable Matchings: (Counting) Complexity and Experiments	Kimon Boehmer et.al.	2408.09160	null
2024-08-17	Training Verifiably Robust Agents Using Set-Based Reinforcement Learning	Manuel Wendl et.al.	2408.09112	null
2024-08-17	Me want cookie! Towards automated and transparent data governance on the Web	Jesse Wright et.al.	2408.09071	null
2024-08-16	On the Completeness of Conflict-Based Search: Temporally-Relative Duplicate Pruning	Thayne T Walker et.al.	2408.09028	null
2024-08-16	Visual Agents as Fast and Slow Thinkers	Guangyan Sun et.al.	2408.08862	link
2024-08-16	CPS-TaskForge: Generating Collaborative Problem Solving Environments for Diverse Communication Tasks	Nikita Haduong et.al.	2408.08853	null
2024-08-16	A Novel Quantum Algorithm for Efficient Attractor Search in Gene Regulatory Networks	Mirko Rossini et.al.	2408.08814	link
2024-08-16	CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk	Mohamad Fares El Hajj Chehade et.al.	2408.08812	null
2024-08-16	EmoDynamiX: Emotional Support Dialogue Strategy Prediction by Modelling MiXed Emotions and Discourse Dynamics	Chenwei Wan et.al.	2408.08782	link
2024-08-16	Beyond Proportional Individual Guarantees for Binary Perpetual Voting	Yotam Gafni et.al.	2408.08767	null
2024-08-16	Rethinking Generative Semantic Communication for Multi-User Systems with Multi-Modal LLM	Wanting Yang et.al.	2408.08765	null
2024-08-16	SYMPOL: Symbolic Tree-Based On-Policy Reinforcement Learning	Sascha Marton et.al.	2408.08761	link
2024-08-16	Weighted Envy-free Allocation with Subsidy	Haris Aziz et.al.	2408.08711	null
2024-08-16	Explore-then-Commit Algorithms for Decentralized Two-Sided Matching Markets	Tejas Pagare et.al.	2408.08690	null
2024-08-24	The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset Generation	Samee Arif et.al.	2408.08688	link
2024-08-16	Neural Reward Machines	Elena Umili et.al.	2408.08677	link
2024-08-16	Fine-tuning LLMs for Autonomous Spacecraft Control: A Case Study Using Kerbal Space Program	Alejandro Carrasco et.al.	2408.08676	link
2024-08-16	An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation	Peiming Guo et.al.	2408.08650	null
2024-08-16	A survey on secure decentralized optimization and learning	Changxin Liu et.al.	2408.08628	null
2024-08-16	DeepREST: Automated Test Case Generation for REST APIs Exploiting Deep Reinforcement Learning	Davide Corradini et.al.	2408.08594	null

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.github		.github
assets		assets
docs		docs
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
daily_arxiv.py		daily_arxiv.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Updated on 2024.09.20

Agents

About

Releases

Packages

Languages

License

Lyz103/LLM-Agent-Paper-daily

Folders and files

Latest commit

History

Repository files navigation

Updated on 2024.09.20

Agents

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages