- Enjoy yourself :D
- Tags: RL, IL, meta-learning, HRL, policy-based, value-based, model-based, model-free, on-policy, off-policy, etc.
- [CoRL 2017] 1710.01813 - Neural Task Programming: Learning to Generalize Across Hierarchical Tasks
- [AAAI 2018] 1710.02298 - Rainbow: Combining Improvements in Deep Reinforcement Learning
- [ICLR 2018] 1710.03641 - Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments
- [ICLR 2018] 1710.09767 - Meta Learning Shared Hierarchies
- [NIPS 2017] 1711.03817 - Learning with Options that Terminate Off-Policy
- 1711.06025 - Learning to Compare: Relation Network for Few-Shot Learning
- [NIPS 2017] 1711.10314 - Crossmodal Attentive Skill Learner
- 1802.01557 - One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning
- 1802.04821 - Evolved Policy Gradients
- 1803.02999 - On First-Order Meta-Learning Algorithms
- [ICLR 2017] Training Agent for First-Person Shooter Game with Actor-Critic Curriculum Learning
- [AAAI 2017] An Efficient Approach to Model-Based Hierarchical Reinforcement Learning
- [ICLR 2018] Reinforcement Learning From Imperfect Demonstration
- [ICML 2017] 1703.02702 - Robust Adversarial Reinforcement Learning
- [ICML 2017] 1706.05064 - Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning
- 1708.05866 - A Brief Survey of Deep Reinforcement Learning
- [NIPS 2017] 1710.03592 - Meta Inverse Reinforcement Learning via Maximum Reward Sharing for Human Motion Analysis
- 1802.03596 - Deep Meta-Learning: Learning to Learn in the Concept Space
- [ICLR 2018] 1802.09081 - Temporal Difference Models: Model-Free Deep RL for Model-Based Control
- 1802.10567 - Learning by Playing - Solving Sparse Reward Tasks from Scratch
- [ICLR 2018] 1803.00933 - Distributed Prioritized Experience Replay
- [ICLR 2018] Extending Robust Adversarial Reinforcement Learning Considering Adaptation and Diversity
- [ICLR 2018] Learning to Teach
- [ICLR 2018] Learning an Embedding Space for Transferable Robot Skills
- 1706.09529 - Learning to Learn: Meta-Critic Networks for Sample Efficient Learning
- 1710.03463 - Learning to Generalize: Meta-Learning for Domain Generalization
- 1712.00948 - Hierarchical Actor-Critic
- [NIPS 2017] 1712.08266 - Federated Control with Hierarchical Multi-Agent Deep Reinforcement Learning
- [ICLR 2018] 1801.08930 - Recasting Gradient-Based Meta-Learning as Hierarchical Bayes
- 1802.07245 - Meta-Reinforcement Learning of Structured Exploration Strategies
- 1802.09564 - Reinforcement and Imitation Learning for Diverse Visuomotor Skills
- [ICLR 2018] Zero-Shot Visual Imitation
- 1506.01497 - Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
- 1512.03385 - Deep Residual Learning for Image Recognition
- 1608.06993 - Densely Connected Convolutional Networks
- 1609.07769 - Deep Joint Rain Detection and Removal from a Single Image
- 1611.10012 - Speed-accuracy trade-offs for modern convolutional object detectors
- 1612.08242 - YOLO9000:Better, Faster, Stronger
- 1703.06870 - Mask R-CNN
- 1704.05548 - Annotating Object Instances with a Polygon-RNN
- 1707.01629 - Dual Path Networks
- 1707.06168 - Channel Pruning for Accelerating Very Deep Neural Networks
- 1708.01241 - DSOD: Learning Deeply Supervised Object Detectors from Scratch
- GAN_series
- Removing rain from single images via a deep detail network
- Robust Hand Detection in Vehicles