Mission Control — Michele Maestrini

LinkedIn article tiles sit here as a separate stream from the wider study themes, using the same hover-led tile language and outward link treatment used across Mission Control.

Phase	Title	Key Output
1	EDA & Physics Grounding	Active sensor set, variance audit, ACF/PACF, lifecycle trajectories
2	Zero-Leakage Normalisation	FD00u unified manifold, K-Means regime clustering (K = 6), regime Z-score pipeline
3	Physics-Aware Feature Engineering	91-feature vector, kinematic expansion, virtual sensors, fatigue indices, survival analysis
4	XGBoost Baseline + SOTA Benchmarking	XGBoost baseline, TFT and N-HiTS training, DeepAR disqualification, forensic audit
5	Model Evaluation & Sign-Off	Four-quadrant aerospace evaluation, financial impact analysis

Metric	Value
RMSE	14.85 cycles
MAE	10.34 cycles
NASA Asymmetric Score	4,336.3
Critical-Band F₂ (β = 2)	0.9339
Projected net saving (100-aircraft fleet)	+$13,818,171 per annum

Subset	Fault Mode	Operating Regimes	Train Engines	Test Engines	Total Cycles	Median Life	Max Life
FD001	HPC degradation (single)	1	100	100	20,631	199	362
FD002	HPC degradation (single)	6	260	259	53,759	199	378
FD003	HPC + Fan compound	1	100	100	24,720	220	525
FD004	HPC + Fan compound	6	248	249	61,249	234	543
FD00u	Combined	—	708	708	160,359	—	—

Sensor	Physical Quantity	FD001 Dead	FD002 Active	Classification
s1	T2 — Fan Inlet Temperature	✓	✓	Regime-conditional
s5	P2 — Fan Inlet Pressure	✓	✓	Regime-conditional
s6	epr — Engine Pressure Ratio	✓	✓	Regime-conditional
s10	epr proxy	✓	✓	Regime-conditional
s16	farB — Burner Fuel-Air Ratio	✓	✓	Globally dead
s18	Nf_dmd	✓	Dead	Globally dead
s19	PCNfR_dmd	✓	Dead	Globally dead

Z-Score Range	Physical Interpretation	SPC Zone
\|z\| < 2	Healthy variance band	Inner control zone
\|z\| ≥ 2	Warning-limit breach	Warning zone
\|z\| ≥ 3	Control-limit breach	Functional failure threshold

Partition	Engines	Rows	Purpose
Internal Training	567	129,331	Model fitting and parameter fitting
Internal Validation	142	31,028	Model selection and tuning
NASA Official Test	707	104,897	Final Iron Wall evaluation

Step	Component	Features Added	Cumulative Total
Core Manifold	3 settings + 21 sensors	24	24
Kinematic Expansion	Δx_t, rolling mean, rolling std on 17 active sensors	51	75
Virtual Sensors	CPR, E_thermal, EGT drift	3	78
Cumulative Fatigue Indices	Miner’s Rule proxies	3	81
Gap Resolution Features	Additional resolved features	10	91

Physical Sensor	Corrected Sensor	Physical Relationship	Pearson r	VIF
s8 — Nf	s13 — NRf	NR_f = N_f / √θ	0.9447	9.3
s9 — Nc	s14 — NRc	NR_c = N_c / √θ	0.9464	9.6

Metric	Training Set	Validation Set
RMSE	9.68	14.99
MAE	6.47	9.96
NASA Score	213,795.6	176,641.2

Model	Validation RMSE	Test RMSE	Validation NASA	Test NASA
TFT	57.88	63.40	443,438.3	10,061,188.1
N-HiTS	57.88	63.40	443,438.3	10,061,188.1

Model	RMSE	MAE	NASA Score	Critical-Band F₂	Net Save	Status
XGBoost	14.85	10.34	4,336.3	0.9339	+$13,818,171	RECOMMENDED ★
TFT	63.40	48.71	10,061,188.1	0.0532	−$1,631,210	Not viable (v0)
N-HiTS	63.40	48.71	10,061,188.1	0.0532	−$1,631,210	Collision unresolved
DeepAR	—	—	—	—	—	DISQUALIFIED

Gap ID	Priority	Description	Resolution Path
G-V1-01 (G5)	HIGH	True Yeo-Johnson transformation required for CPR and E_thermal	Fit on X_train only and produce dedicated neural matrices
G-V1-02 (G9)	MANDATORY	s16 ablation via SHAP	Residual analysis to confirm whether s16 contributes signal or noise
G-V1-03 (G3)	MANDATORY	Terminal class imbalance	Evaluate subset-stratified sampling if SHAP residuals show terminal bias

Area	FusionCore v1 Scope
Dataset	NASA C-MAPSS only: FD001, FD002, FD003, and FD004 pooled as FD00u.
Input	FusionCore v0 Phase 3 91-feature parquet output, with neural-only Yeo-Johnson treatment for high-skew virtual sensors.
Architecture	Two-branch deterministic PiNet backbone: 30-cycle TCN plus physics-token MLP, fused into a shared embedding.
Comparator	FusionCore v0 Phase 4 XGBoost carried forward unchanged: RMSE 14.85, NASA Score 4,336.3, Critical-band F₂ 0.9339.
Out of scope	N-CMAPSS deployment, real-time fleet dashboards, and full probabilistic deployment infrastructure.

Metric	XGBoost v0 Reference	PiNet v1 Aim
RMSE	14.85 cycles	Beat or match without weakening safety performance
NASA Asymmetric Score	4,336.3	Priority metric when RMSE and operational risk disagree
Critical-band F₂	0.9339	Maintain or improve safety-weighted terminal detection
Critical recall	0.9363	Strong pass if PiNet recall is at least the XGBoost recall

Directive	Implementation in v1
Consume the v0 evidence base	Use the FusionCore v0 Phase 3 91-feature FD00u parquet output unchanged as the fixed input manifold.
Build the PiNet backbone	Pair a long-memory TCN over 30-cycle windows with a small physics-token MLP over selected gas-path and degradation scalars.
Train one primary model	Optimise a two-term loss combining NASA asymmetric RUL cost with class-weighted cross-entropy for Healthy, Warning, and Critical risk bands.
Evaluate once	Touch the NASA C-MAPSS official test set exactly once at programme close, reporting RMSE, NASA Score, MAE, and Critical precision/recall/F₂.
Preserve a novelty hook	Run one pre-declared compute-bounded ablation with triplet metric learning and Shannon regularisation.

Partition	Engines	Rows	Role	When Touched
Internal Train	567 (~80%)	129,331	Loss optimisation and weight updates	Every batch of every epoch
Internal Validation	142 (~20%)	31,028	Convergence monitoring and epoch selection	Once per epoch
NASA Official Test	707	104,897	Headline benchmark and literature comparability	Exactly once at programme close

Component	Input	Mechanism	Roadmap Rationale
TCN branch	30-cycle windows of the Yeo-Johnson-transformed 91-feature matrix	Dilated causal 1D convolutions with residual connections	Causality prevents look-ahead; dilation covers the full window without excessive parameters.
Physics-token branch	Approximately 11 physically meaningful scalars per cycle	Two-layer MLP with ReLU and BatchNorm	Keeps gas-path thermodynamic state auditable instead of burying it in the TCN latent space.
Fusion embedding	Last-timestep TCN state, mean-pooled TCN state, and physics embedding	Two-layer fusion MLP with LayerNorm	Combines current condition, distributed degradation evidence, and explicit physics state.
Output heads	Shared embedding d_e = 128	RUL regression head plus three-class softmax risk head	Produces both scalar RUL and operational banding for maintenance decisions.

Risk Band	RUL Range	Natural Fraction	Training Batch Share
Healthy	RUL ≥ 80	≈ 60%	40-50%
Warning	30 ≤ RUL < 80	≈ 27%	25-30%
Critical	RUL < 30	≈ 13.2%	25-30%

Hyperparameter	Fixed Value	Source / Rationale
Embedding dim d_e	128	Li et al. C-MAPSS trade-off
TCN layers L	5	ERF = 63 cycles > W = 30
TCN kernel K	3	Bai et al. TCN standard
Dilations	{1, 2, 4, 8, 16}	Doubling pattern for long memory
Hidden channels	64	PHM literature precedent
Window length W	30 cycles	Heimes precedent plus v0 autocorrelation lag 17 margin
Physics MLP	2 layers, BatchNorm, ReLU	Shallow by design because tokens are already meaningful
Dropout	0.15	Regularisation on a 16 GB Apple Silicon MPS budget
Batch size	64	Apple Silicon headroom
Learning rate	1 × 10⁻³	Adam default and PHM precedent
Optimiser	AdamW	Standard neural PHM practice
Epochs	80, early stopping patience 15	Empirical from prior SOTA work

Metric	XGBoost v0 Carry-Forward	PiNet v1 Decision Logic
RMSE	14.85 cycles	Report per subset and pooled; improvement is valuable only if safety metrics hold.
NASA Score	4,336.3	Primary operational cost metric when RMSE and risk conflict.
Critical-band F₂	0.9339	Safety-weighted measure because recall is twice as important as precision.
Critical recall	0.9363	Strong pass if PiNet recall − XGBoost recall ≥ 0.

Outcome	Condition	Programme Decision
Strong Pass	PiNet Critical recall − XGBoost Critical recall ≥ 0	PiNet declared operationally superior.
Pass with Advisory	PiNet Critical recall − XGBoost Critical recall ≥ −0.02	PiNet declared comparable and flagged as a watch item for the probabilistic extension.
Fail	PiNet Critical recall − XGBoost Critical recall < −0.02	PiNet is not adopted as a replacement; failure analysis is required in Phase v1.6.

Risk	Category	Mitigation
Engine appearing in multiple partitions	Hard leakage	GroupShuffleSplit by (subset_origin, unit_id) with programmatic non-intersection assertion.
Validation or test contaminating fitted statistics	Hard leakage	All inherited artefacts are read-only; v1 Yeo-Johnson parameters are fitted on Internal Train only.
Windowing across engine boundaries	Hard leakage if unchecked	Composite-key enforcement in the windowing function and per-batch assertion.
NASA test accessed before close	Hard leakage	Test set touched exactly once at Phase v1.4; no post-evaluation iteration.

Class	Frequency	Proportion
Healthy	3,221	15.61%
Unhealthy	17,417	84.39%
Total	20,638	100.00%

Stage	Model	Accuracy	Recall	MCC
Stage 1 (Test)	BLM — MobileNetV2	1.0000	1.0000	1.0000
Stage 2 (Validation)	FSLM — 2-way 5-shot	0.9969	—	—
Stage 2 (Test)	FSLM — 2-way 5-shot	0.9875	0.9877	—

#	Category	Images
1	Pepper Bell Bacterial Spot	997
2	Pepper Bell Healthy	1,478
3	Potato Early Blight	1,000
4	Potato Late Blight	1,000
5	Potato Healthy	152
6	Tomato Bacterial Spot	2,127
7	Tomato Early Blight	1,000
8	Tomato Late Blight	1,909
9	Tomato Leaf Mold	952
10	Tomato Septoria Leaf Spot	1,771
11	Tomato Two-Spotted Spider Mite	1,676
12	Tomato Target Spot	1,404
13	Tomato Yellow Leaf Curl Virus	3,209
14	Tomato Mosaic Virus	373
15	Tomato Healthy	1,591
	Total	20,639

Partition	Proportion	Role
Training (7)	70%	Weight optimisation, random search, Siamese metric learning
Validation (2)	20%	Hyperparameter tuning and episodic support-set pool
Test (1)	10%	Final evaluation and episodic query set

Hyperparameter	Symbol	Search Space	Optimal (Trial 2)
Width Multiplier	α	{0.35, 0.5, 0.75, 1.0}	0.5
Learning Rate	η	{0.0001, 0.001, 0.01}	0.0001
Dropout Rate	P	{0.2, 0.3, 0.4, 0.5}	0.3

Dataset	Accuracy	Precision	Recall	F1-Score	MCC
Train	0.9999	1.0000	0.9998	0.9999	0.9995
Validation	0.9985	0.9989	0.9994	0.9991	0.9945
Test	1.0000	1.0000	1.0000	1.0000	1.0000

Component	Parameters	Notes
Trainable (MobileNetV2 + Dense layers)	~1,180,000	Updated during triplet-loss training
Non-Trainable (ImageNet pretrained weights)	18,544	Frozen from Stage 1
Total	~1,200,000
Memory footprint	4.57 MB	IoT-deployable

Hyperparameter	Symbol	Search Space	Optimal (Trial 2)	Role
Shannon coefficient	λ_Shan	{−10.0, −5.0, −1.0, 0.0, 0.01, 0.5}	0.0	Regularisation weight
Smoothing parameter	ξ	{3/2, 5/2}	5/2 (= 2.5)	Embedding-space sensitivity
EI exploration weight	ξ_EI	{0.0, 0.1, 0.5, 0.75, 1.0}	0.75	Exploration–exploitation balance

Trial	λ_Shan	ξ	ξ_EI	Validation Accuracy	Notes
2 ★	0.0	2.5	0.75	~0.9969	Optimal — Shannon term inactive; moderate exploration bias
3	−5.0	—	—	~0.9969	Inverted regulariser; still highly performant
10	—	—	—	Worst trial	Demonstrates hyperparameter sensitivity

Partition	Accuracy	Recall	F1-Score
Validation	~0.9969	—	—
Test	0.9875	0.9877	—

Thesis Component	FusionCore Adaptation	Adaptation Notes
Siamese Network triplet loss	PiNet metric-learning regularisation via L_triplet	Adapted from binary image classification to multi-stage RUL regression using RUL-band proximity rather than class labels
Shannon regulariser (λ_Shan, ξ)	Embedding-collapse prevention in PiNet backbone pretraining	Used to counter collapse risk under terminal-class imbalance in C-MAPSS
Bayesian HPO (GP + EI)	Deferred from the FusionCore v1 headline experiment	The v1 PiNet roadmap uses fixed PHM-sourced hyperparameters; Bayesian tuning remains a future research stream rather than a test-set-facing control
2-way 5-shot episodic FSL	True episodic FSL deferred to FusionCore v2	N-CMAPSS provides the cross-dataset novelty required for a genuine episodic setting

23 Feb 2026

LINKEDIN ARTICLE

Career Switch Diary (Post 1) — 23 Feb 2026

I got my Data Science Master’s in my early 50s… then discovered the “abundant job market” had quietly left the building. So instead of waiting for “experience” to magically appear, I’m building it the hard way: FusionCore, a real aerospace predictive maintenance project using NASA turbofan sensor data (C-MAPSS)—focused on time-series anomaly detection and Remaining Useful Life (RUL) prediction.

This is Post 1 of a series where I’ll share updates (weekly, where possible) as I follow a roadmap I’ve laid out in the article: pick a niche, build domain knowledge, ship a real project, get it reviewed by industry people, and make it easy for employers to assess the work. Not glamorous, occasionally chaotic… but at least it’s honest.

If this resonates, feel free to repost it for anyone else career-switching or job-hunting. And if you work in aerospace / predictive maintenance (or you’ve already broken into it), I’d love to connect—even a quick “here’s what I’d do differently” could save me weeks. Also: if you’re on a similar journey, tell me what you’re building (or what’s not working). Misery loves company… but progress loves receipts.

Read on LinkedIn

2 Mar 2026

LINKEDIN ARTICLE

Career Switch Diary (Post 2) — 2 Mar 2026

I built my data science project the fastest way to fool myself: grabbed the data, let AI sprint, shipped six notebooks… and called it “progress.” The problem? I still couldn’t explain what the numbers meant, which variables mattered to whom, or what risk hides inside “good model performance.”

So, I scrapped the notebook pile and rebuilt the work like an operating system: compress feedback loops, delete/simplify before automating, iterate fast, and engineer the project so it survives scrutiny without me in the room—think mission readiness review, but the payload is my own competence.

Read on LinkedIn

13 Mar 2026

LINKEDIN ARTICLE

Career Switch Diary (Post 3) - 13 Mar 2026

I’ve just shared a new article on how I set up my predictive maintenance project — and why I spent far longer building the roadmap than touching the model itself. Before getting to the glamorous machine learning bit, there was the small matter of understanding the physics, the risk, and the cost of being wrong. Turns out, in aerospace, “just winging it” is not a recognised methodology.

Read on LinkedIn

30 Mar 2026

LINKEDIN ARTICLE

Career Switch Diary (Post 4) — 30 Mar 2026

My latest article is about a result that genuinely surprised me.

I built FusionCore v0, a physics-aware predictive maintenance pipeline for turbofan engine Remaining Useful Life estimation, fully expecting the neural networks to lead.

They didn’t.

But the article is about far more than which model won.

It is about how to build AI that can speak to engineering, safety, operations, and finance at the same time. Using FusionCore v0 as the case study, I explain why the strongest result came from a model built on physics grounding, zero-leakage controls, and risk-aware evaluation, and why AI in aerospace has to be judged not just by prediction quality, but by how well it handles the operational consequences of being wrong.

Read on LinkedIn

Systems & Operations

Projects

FusionCore v0

FusionCore v1

MSc Thesis

Project Detail

What Is FusionCore v0?

The Problem It Solves

The Five-Phase Architecture

Headline Results

Phase 1: Exploratory Data Analysis and Physics Grounding

1.1 Dataset Characteristics

1.2 The Variance Audit

1.3 Time-Series Diagnostics

Phase 2: Zero-Leakage Normalisation and Dataset Unification

2.1 Regime Clustering

2.2 Regime Z-Score Normalisation

2.3 Data Partition Architecture

Phase 3: Physics-Aware Feature Engineering

3.1 The 91-Feature Manifold

3.2 Virtual Sensors

3.3 Cumulative Fatigue Index

3.4 Survival Analysis

3.5 Multicollinearity

Phase 4: XGBoost Baseline and SOTA Benchmarking

4.1 Baseline Configuration and Validation Performance

4.2 Forensic Audit

4.3 DeepAR Disqualification

4.4 TFT and N-HiTS Score Collision

Phase 5: Four-Quadrant Aerospace Evaluation

5.1 Iron Wall Protocol

5.2 Master Results

5.3 Critical-Band Operational Classification

5.4 Financial Impact

Outstanding Items and FusionCore v1 Steering

What Is FusionCore v1 Now?

Programme Boundary

What Makes It Different?

Headline Success Criteria

1. Project Directive

2. Dataset Discipline and the Iron Wall

3. PiNet Architecture

4. Training Objective and Risk Bands

5. Fixed Hyperparameter Strategy

6. Headline Evaluation Framework

7. Safety Gate and Phase Plan

8. Advanced Ablation and Forward Extension

Probabilistic Forward Look

9. Live Leakage Controls

Research Imperative

Dataset

The Two-Stage Pipeline

Headline Results

1. Data Architecture and Class Distribution

1.1 PlantVillage Category Breakdown

1.2 Data Split Convention

2. Stage 1 — Baseline Learning Model (BLM)

2.1 MobileNetV2 with Width Multiplier

2.2 Loss Function and Regularisation

2.3 Hyperparameter Search Strategy

2.4 BLM Results

3. Stage 2 — Few-Shot Learning Model (FSLM)

3.1 Two-Sub-Stage Structure

3.2 Siamese Network Architecture

3.3 Triplet Loss Function

3.4 Embedding Space Diagnostics

3.5 Episodic FSL Setup — 2-Way 5-Shot

3.6 Shannon Regularisation

3.7 Bayesian Hyperparameter Optimisation

3.8 Trial Analysis

3.9 Final FSLM Performance

4. Methodological Limitations and Generalisation Risks

5. Methodological Lineage and FusionCore Connection

Study Focus

Predictive Maintenance

Deep Learning Architectures

Time Series & Prognostics

Statistical & Bayesian Methods

Few-Shot & Transfer Learning

Explainability & Interpretability