2025-08-05 11:35:01 -04:00

8.4 KiB

Thesis Ideas 2025-07-30

Following our group meeting from Monday, July 28th, Dan suggested I write down 6 ideas, and from them we shall figure out a possible topic idea that I can really start working on.

I used ChatGPT to do some of the heavy lifting based on the papers I've been reading, and leveraged the 'deep research' feature. Here are some of my favorite ideas, broken down into goals, outcomes, impact, and related papers.


Integrating Shielding into Nuclear Power Control

Goal:

The goal of this research is to develop machine learning control algorithms for nuclear power applications with strict safety guarantees.

Outcomes:

If this research is successful, I will have accomplished the following:

  1. Develop controller shielding methods for nuclear power contexts

  2. Provide concrete safety guarantees for autonomous control of a nuclear asset.

  3. ???

Impact:

Machine learning based systems have been shown to be more efficient than typical PID based controllers, and are able to learn more complex objective functions than a typical controller can. The problem with these controllers though is that they are often unexplainable. This is not acceptable for high assurance applications, where slight perturbations on inputs can yield wildly different outputs. Shielding can solve this problem, helping ensure safety of ML based controllers while not limiting their development or construction.

Relevant Papers

safe-reinforcement-learning-via-shielding evaluating-robustness-of-neural-networks-with-mixed-integer-programming


Formally Verified Neural Network Control of Control Rod System

Goals:

The goal of this research is to use formal methods to ensure that a neural network based control rod controller will never violate safety guarantees of a reactor trip system. To do this, a satisfiability modulo theory method will be applied to exhaustively search the network for potential failure modes.

Outcomes:

If this research is successful, I will have accomplished the following:

  • Build a neural network controller for real time control of a control rod system.

  • Formalize safety guarantees of shutdown margin in a satisfiability modulo theory embedding

  • Formally verify that the neural network based controller will not violate any shutdown margin restrictions

Impact:

SMT solvers and MILP formulations have been applied to neural networks to ensure that the network is resilient to input perturbations. I think we can expand this to more general considerations of the state space, especially when there are a relatively small number of states such as in power contexts. The benefit of this system is that we would get closer to saying neural network based systems can be high assurance for physical systems.

reluplex-an-efficient-smt-solver-for-verifying-deep-neural-networks evaluating-robustness-of-neural-networks-with-mixed-integer-programming formal-verification-of-neural-network-controlled-autonomous-systems


Temporal Logic Specifications for Autonomous Controller Synthesis

Goals:

The goal of this program is to use temporal logic specifications to procedurally generate autonomous supervisory controllers for a reactor system.

Outcomes:

If this research is successful, I will have accomplished the following:

  • Captured high level safety and operating requirements in a temporal logic language such as TLA+ or FRET

  • Synthesize a supervisory controller from the temporal logic specification that can be implemented on a real control system with minimal user effort.

  • Verify the supervisory controller generated adheres to safety specifications using exhaustive model checking.

Impact:


Formally Verified Runtime Monitoring and Fallback

Goals:

If this research is successful, we will be able to generate autonomous controller shields that provably adhere to specifications written with temporal logic automatically.

Outcomes:

  • Create an intermediary shield that mediates signals between an optimal control system and the physical plant (MODBUS)?

  • Translate specifications in a language like TLA+ into an executable program

  • Provide proof artifacts that automatically generated shield components will not allow an arbitrary controller to reach an unsafe state.

Impact:

Shielding is one of the preeminent ways to do safe machine learning controllers. Instead of putting the proof burden on the machine learning component, shielding creates a safe boundary in the state space where a safety controller will step in if the machine learning controller endangers the system. This technology solves a critical problem with high assurance systems: high assurance systems have critical safety requirements that make scrutiny on autonomous systems safety intense. Shielding can provide a safety barrier for the controller, allowing the architecture of the control laws to be amenable to more efficient machine learning based methods. Finally, utilizing an automatic translation from a temporal logic formulation of a speculation will allow the engineers of these systems to quickly and clearly implement a shield, without all of the cumbersome derivation.

on-using-real-time-reachability-for-the-safety-assurance-of-machine-learning-controllers enhancing-cyber-physical-system-dependability-via-synthesis-challenges-and-future-directions safe-reinforcement-learning-via-shielding


Data-Driven Fault Detection Using High-Assurance Digital Twins

(8)

Goals:

The goal of this research is to use machine learning to identify system faults of a reactor control system during runtime. A digital twin will be compared to measurements from a real plant to identify issues such as coolant losses, sensor and actuator failures, or component degredation so that safety strategic decisions about the plant can be made autonomously.

Outcomes:

For this research to be successful, I will accomplish the following:

  • Create a simulation suite for the Small Modular Advanced High Temperature Reactor (SmAHTR) to simulate fault conditions of sensors, actuators, and component degradation.

  • Develop a physics informed neural network (PINN) approach to evaluate physics discrepancies in measured signals and to estimate physically relevant parameters to determine real system divergence from the nominal plant.

  • Realize a proof of concept autonomous controller than can react to fault conditions by switching to different control modes rather than only responding with reactor shutdown.

Impact:

The nuclear energy industry's largest expense is operations and maintenance (O&M). These costs include typical reactor repair and refueling, the labor involved to complete such maintenance, and finally the labor involved in operating the reactor itself. Currently the largest of these O&M expenses is the labor and part cost used in maintenance, while large nuclear reactor facilities require a modest reactor operator budget per megawatt of energy produced. The advent of small modular reactors (SMRs) and microreactors (MRs) will change these economics significantly.

As SMRs and MRs become more common, the cost of repair and maintenance should reduce dramatically as nuclear power componnets will become modular, replaceable parts instead of the bespoke reactor designs currently operating. Operator wages, however, can be expected to increase without introducing greater controller autonomy. SMRs and MRs are much smaller output designs per reactor core, and if they are required to employ the same size reactor operator team as a conventional large reactor, will suffer from much larger operator expense per megawatt. Greater controller autonomy can solve this problem by unloading some reactor control responsibilities from the operator, and therein reduce labor consumption.

<# TO DO #> Finally reactor safety can be improved by greater autonomy yada yada find some reasons to back this up.


Verified Adaptive Control

(10)

Goals:

Outcomes:

Impact: