(video) International Conference on Machine Learning (ICML) 2020 Workshop: Feedback in Imitation Learning

In most AI domains, like image search, “off-the-shelf” standard supervised learning methods are successful in training machine learning models. However, these methods fall short in training robots, like self-driving cars, because a self-driving car, the “learner,” relies on a stream of continuous, time-series data to make decisions and take actions in the physical world. Those actions change the future inputs to the learner by changing not only the vehicle’s own location and velocity, but also how other vehicles in the scene respond.

Through this lens, off-the-shelf supervised machine learning methods aren’t enough for learning decision-making because “mistakes” often have unintended consequences that feed-back into other “mistaken” actions. For example, if a self-driving vehicle mistakenly decides to conduct a lane change, it will start to move laterally. The vehicle’s new position partway into the other lane could then reinforce the decision to continue with the lane change, even though it was originally undesirable. This problem, sometimes called the “feedback effect”, has been well documented in the AI and robotics community.

Recent literature in the field (de Haan et al., 2019) proposes that the Feedback effect stems from “casual confusion” via a “causal confound” (Pearl et al., 2016). The implication here is that the casual structure is being obscured by correlated inputs to the learning algorithm, i.e, the learner is biased by over-indexing on correlated features. However, in this talk, lead engineers from Aurora’s Motion Planning team, Arun Venkatraman and Sanjiban Choudhury, observe that the cited examples do not exhibit casual confounding in the statistical sense. Instead, they posit that the observed issues are fundamentally a result of distributional shift in the features as a result of feedback.

Arun and Sanjiban introduce ALICE, an algorithmic framework that leverages a simulation engine to measure and counter this Feedback Effect. They go on to show preliminary results of ALICE on a prototypical controls problem and discuss the spectrum of Feedback problems and the difficulty in solving them across a variety of different practical setups.

Report

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

(video) International Conference on Machine Learning (ICML) 2020 Workshop: Feedback in Imitation Learning

What do you think?

Leave a ReplyCancel reply

(video) Analyst & Investor Day Keynote Presentation – March 14, 2024

(video) Flying FPV through an Autonomous Truck Terminal – Aurora Innovation x Jaybyrd Films

(video) Detecting and avoiding collision wrong-way driver

(video) Delivering a commercial service year-round

(video) Vayyar Care Point Cloud Demo (UK)

(video) AI-Based Generative Simulation for ADAS and Autonomous Driving

(video) Heritage Senior Living: How Vayyar Care Elevates Outcomes

(video) Heritage Senior Living Executives Testimonial

(video) Winter Testing

Industry Glossary

(video) End-to-End Deep Learning for Autonomous Driving with Wayve, London 2020

(video) Merging Onto Freeway Followed by Lane Change

(video) Luminar’s Investor Day | Oct. 6, 2020

(video) 5. Forced merges

(video) 用于汽车的 LeddarVision–卓越的传感器融合实现更安全的导航

(video) COAST: Why we LOVE what we do!

(video) Driven by Simulation | Episode 9 Trailer

(video) Driving Through Historical Munich (InnovizTwo)

What do you think?

Leave a ReplyCancel reply

Log In

Sign In

Forgot password?

Your password reset link appears to be invalid or expired.

Log in

Privacy Policy

Add to Collection

No Collections