Obsidian/Zettelkasten/Literature Notes/Notes on Papers/Runtime Safety Assurance Using Reinforcement Learning.md

1.0 KiB

First Pass

Category: Simulation paper that talks about using a recovery controller in combination with a nominal controller that is learned. The boundary to swtich between the two is a learned boundary.

Context: Drones

Correctness: Not very. They do really well in the intro and methodology, but shit hits the fan when it comes to the results. They don't explain things well, and also don't really establish why their learned boundary between nominal and recovery controllers should be successful.

Contributions: The biggest contributions these guys make is demonstrating feasibility of their reinforcement learned switching from the recovery controller and the nominal controller.

Clarity: Well written until the whole thing came apart at the end.

Second Pass

I read this a second time but I don't think it's worth it.

Their main contribution is trying to use RL to learn when to switch to a recovery controller.

Third Pass

Recreation Notes:

Hidden Findings:

Weak Points? Strong Points?