Prehensile and Non-Prehensile Robotic Pick-and-Place of Objects in Clutter Using Deep Reinforcement Learning

Open Access

29 January 2023

journal article
research article
Published by MDPI AG in Sensors

Vol. 23 (3), 1513
https://doi.org/10.3390/s23031513

Abstract

In this study, we develop a framework for an intelligent and self-supervised industrial pick-and-place operation for cluttered environments. Our target is to have the agent learn to perform prehensile and non-prehensile robotic manipulations to improve the efficiency and throughput of the pick-and-place task. To achieve this target, we specify the problem as a Markov decision process (MDP) and deploy a deep reinforcement learning (RL) temporal difference model-free algorithm known as the deep Q-network (DQN). We consider three actions in our MDP; one is ‘grasping’ from the prehensile manipulation category and the other two are ‘left-slide’ and ‘right-slide’ from the non-prehensile manipulation category. Our DQN is composed of three fully convolutional networks (FCN) based on the memory-efficient architecture of DenseNet-121 which are trained together without causing any bottleneck situations. Each FCN corresponds to each discrete action and outputs a pixel-wise map of affordances for the relevant action. Rewards are allocated after every forward pass and backpropagation is carried out for weight tuning in the corresponding FCN. In this manner, non-prehensile manipulations are learnt which can, in turn, lead to possible successful prehensile manipulations in the near future and vice versa, thus increasing the efficiency and throughput of the pick-and-place task. The Results section shows performance comparisons of our approach to a baseline deep learning approach and a ResNet architecture-based approach, along with very promising test results at varying clutter densities across a range of complex scenario test cases.

Keywords

Funding Information

Science Foundation Ireland (SFI 16/RC/3918)

This publication has 43 references indexed in Scilit:

Real-time grasp detection using convolutional neural networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2015
Deep learning for detecting robotic grasps
The International Journal of Robotics Research, 2015
Human-level control through deep reinforcement learning
Nature, 2015
A Planning Framework for Non-Prehensile Manipulation under Clutter and Uncertainty
Autonomous Robots, 2012
From caging to grasping
The International Journal of Robotics Research, 2012
Virtual Robot Experimentation Platform V-REP: A Versatile 3D Robot Simulator
Lecture Notes in Computer Science, 2010
Planar sliding with dry friction Part 1. Limit surface and moment function
Wear, 1991
Learning representations by back-propagating errors
Nature, 1986
Mechanics and Planning of Manipulator Pushing Operations
The International Journal of Robotics Research, 1986
Robust Estimation of a Location Parameter
The Annals of Mathematical Statistics, 1964

Cited by 5 articles