This commit is contained in:
Dane Sabo 2025-08-08 13:56:35 -04:00
parent 1544243638
commit b23db588d5
3 changed files with 47 additions and 20 deletions

View File

@ -0,0 +1,3 @@
# Notes on [[thesis-ideas-2025-07-30]]

View File

@ -0,0 +1,11 @@
---
---
# 2025-08-06
Today I have not gotten too much done, but I did get here
early today. Today, I'll sell the truck and work on Sam's
car. I saw Patrick today too, which was nice. Robert's stuff
is all gone.
My main focus for today is to finish the thesis ideas and
tune up the writing. That'll be enough.

View File

@ -15,32 +15,45 @@ ___________________________________________________________
## **Integrating Shielding into Nuclear Power Control**
### Goal:
The goal of this research is to develop machine learning
control algorithms for nuclear power applications with
strict safety guarantees.
The goal of this research is develop machine learning
enabled control algorithims for nuclear power applications
that incoporate shielding: a formal guarantee of adherence
to system specifications without augmenting the machine
learning process.
### Outcomes:
If this research is successful, I will have accomplished the
following:
For this research to be successful, I will accomplish the
following:
1. Develop controller shielding methods for nuclear power
contexts
1. Identify key controllers in a nuclear power context with
the most benefit from using an ML-based controller
2. Provide concrete safety guarantees for autonomous control
of a nuclear asset.
2. Translate regulatory and system level requirements into a
formal specification to synthesize a controller 'shield'.
This shield monitors the ML controller and intervenes
whenever a requirement is predicted to be violated.
3. ??? <!TODO!>
3. Evaluate performance of the ML controller with attached
shield, while assessing the amount of shield useage for
different operating scenarios (power up, shut down, regular
load following)
### Impact:
Machine learning based systems have been shown to be more
efficient than typical PID based controllers, and are able to
learn more complex objective functions than a typical controller
can. The problem with these controllers though is that they are
often unexplainable. This is not acceptable for high assurance
applications, where slight perturbations on inputs can yield
wildly different outputs. Shielding can solve this problem,
helping ensure safety of ML based controllers while not limiting
their development or construction.
Machine learning controllers can outperform PID and
rule-based controllers by adapting to nonlinear dynamics,
optimizing over multi-objective cost functions, and changing
plant conditions. But, these ML controllers are often
*unexplainable*, meaning that their global behavior is not
easily understood.This unexplainability prevents ML based
controllers from being used in high-assurance usecases such
as nuclear power. Shielding can address this issue, by
providing a formal runtime assurance, allieviating the
burden of explainability away from the machine learning
algorithm. This work would further bring regulatory
requiremnts into the formal design of control systems and
help bridge the gap between high assurance systems and the
start of the art in control.
### Relevant Papers
[[safe-reinforcement-learning-via-shielding]]
@ -118,7 +131,7 @@ ___________________________________________________________
### Goals:
If this research is successful, we will be able to generate
autonomous controller shields that provably adhere to specifications
written with temporal logic automatically.
written with temporal logic.
### Outcomes:
- Create an intermediary shield that mediates signals between an