Can Robots Learn Complicated Tasks from Few Demonstrations?

Written By : Market Trends

Published:29th Nov, 2020 at 8:05 AM

The new system addresses the existing challenges of reinforcement learning in robots.

Researchers at the University of Southern California (USC) have developed a system that can perform complicated tasks with only a few demonstrations. The research paper titled, "Learning from Demonstration Using Signal Temporal Logic", was presented at the Conference of Robot Learning last week.

The research paper cites that the system addresses the existing limitations of reinforcement learning in robots such as imperfections, safety, and interpretability. Researchers have used Signal Temporal logic-so that robot can learn tasks such as driving a car and cooking the food from only a few demonstrations.

Aniruddh Puranic, a PhD student in computer science at the USC Viterbi School of Engineering and the lead author of the research says, "Many machine learning and reinforcement learning systems require large amounts of data and hundreds of demonstrations—you need a human to demonstrate over and over again, which is not feasible,"

Limitations of the existing Learning from Demonstration (LfD) model

The existing learning-from-demonstration-model in robots involves reinforcement learning. Over the years existing LfD has enabled researchers to draw insights on specifications of the robot. However, there are certain limitations to this model:

The control policy of the existing LfD model is unsafe and lacks robustness.
Not all demonstrations are equal. While some demonstrations can indicate desired results, others need human-assistance to deliver quality results.
Lack of required metrics to evaluate the quality of demonstrations or tasks adds to the third challenge of LfD.
Since demonstrations do not have specific safety conditions for the robot, the demonstrations get optimized to achieve a particular objective.

Features of the new system

Introduced by Arthur Prior under the name Tense Logic in 1960, the Signal Temporal Logic is a mathematical language that enables robotic reasoning about current and future outcomes.

Researchers have earlier used STL in cyber-physical system applications such as robots and self-driving cars, to evaluate the logic and reasoning during an emergency. The existing STL system delivers positive outcomes for evaluating the temporal behaviour in robots such as the multi-dimensional signal consisting of the robot's position, joint angles, angular velocities, and linear velocity, amongst others.

It also evaluates and automatically rank demonstrations based on their fitness of the specific task performed, and enhance the control policy in robots. This system also estimates the quality of demonstrations so that robots cannot learn from undesired examples.

The researchers states that "The key insight of this work is that the use of even partial STL specifications can help in a mechanism to automatically evaluate and rank demonstrations, leading to learning robust control policies and inferring rewards to be used in a model-free RL setting."

Unlike the traditional model, this system enables the robot to learn from both its success and failure, with only a few demonstrations. Through this system, the robots can learn and perform the task without requiring logic and can draw out its conclusion about the performance.

The researchers conclude that this approach will provide new directions for safety and interpretability of robot control policies and verification of model-free learning methods. It is also well suited for applications where the maps are known beforehand but there exist dynamic obstacles in the map, such as for robots in household and warehouse environments, space exploration rovers, etc.

Join our WhatsApp Channel to get the latest news, exclusives and videos on WhatsApp

_____________

Disclaimer: Analytics Insight does not provide financial advice or guidance on cryptocurrencies and stocks. Also note that the cryptocurrencies mentioned/listed on the website could potentially be risky, i.e. designed to induce you to invest financial resources that may be lost forever and not be recoverable once investments are made. This article is provided for informational purposes and does not constitute investment advice. You are responsible for conducting your own research (DYOR) before making any investments. Read more about the financial risks involved here.

Can Robots Learn Complicated Tasks from Few Demonstrations?

The new system addresses the existing challenges of reinforcement learning in robots.

Limitations of the existing Learning from Demonstration (LfD) model

Features of the new system

Also Read

Solana (SOL) Tests Critical $80 Support, But Taurox (TAUX)Deliver 50% Returns to Investors in Less Than a Month

21Shares Dogecoin (DOGE) ETF Launches on Nasdaq As Analysts Predict Taurox (TAUX) Will Reach $1 Faster

Early Birds See 34,900% Growth as BlockDAG Hits $0.35 on CMC! Zcash Price Battles $200 Support & ETH Price Shows Mixed Signs

Why Altcoins Follow the Same Cycle Pattern in Crypto Markets

Crypto Scams are Rising: Learn How to Protect Your Funds