FAU Erlangen-Nürnberg
The closing unit, in one line
What U14 is not.
What we built
Where we ended
Three honest gaps
The three gaps map to U14’s three knobs
By the end of 90 minutes, you can:
The naïve generative-model failure
Constraints are correctness
Composition-side
Structure-side
Mnemonic: Composition is what the formula says; structure is what the lattice says; thermodynamics is whether nature lets it exist.
Architectural prior — the constraint is built into the model
Hard projection / filter vs soft penalty
Composition simplex via softmax
Charge-balance head for ionic compounds
Latent-space projection
Discriminator / score-based filter
Constrained acquisition
\[x^* = \arg\max_{x \in \mathcal{F}} \alpha(x)\]
Cost-aware soft variant
Filter ordering matters: ranking 1000 candidates and then filtering to feasible ≠ filtering to feasible and then ranking. The two top-10 lists are different. Filter first.
Soft penalty
\[\mathcal{L}_{\text{total}} = \mathcal{L}_{\text{data}} + \lambda \mathcal{L}_{\text{phys}}\]
Hard projection
\[\hat{x} = \Pi_{\mathcal{F}}(x)\]
Setup
Without vs with the simplex head
Setup
Unconstrained vs simplex acquisition
Generalisable lesson: parameterise the constraint into the search space, do not impose it via post-hoc rescaling.
The PINN loss
\[\mathcal{L}_{\text{PINN}} = \mathcal{L}_{\text{data}} + \lambda_r \|\mathcal{N}[u_\theta]\|^2 + \lambda_b \|\mathcal{B}[u_\theta]\|^2\]
What PINN gives you
Forward problem
Inverse problem (where PINNs shine)
Microstructure homogenisation
Phase-field parameter inference
Where PINNs fit
Where PINNs do not fit
Closing rule: use a PINN where you have a PDE you trust and parameters you do not. Otherwise use a §B-constrained surrogate.
The setup that breaks naïve trust
The two operational OOD signals
Three sources of sim–exp gap
Operational consequence
The phenomenon
Why it happens
Operational rule: re-calibrate the surrogate on every newly-entered chemistry family, before using its uncertainty for screening.
The construction
The guarantee
\[\hat{C}_\alpha(x) = [\hat{f}(x) - q_\alpha,\ \hat{f}(x) + q_\alpha]\]
Per-family calibration
As an acquisition gate
Three usable OOD scores
Use as a refusal gate
The trap
Mitigation
The lesson: trust is a system property, not a model property. Combine signals.
The audit trail per decision
The materials-specific MFML W14 instantiation
Discovery is a decision problem, not a prediction problem
Six steps, repeated
Components, named
Interfaces, the painful part
Workflow engines (pick one)
BO drivers (pick one)
The 80/20 rule for autonomous labs: 80% of the work is orchestration; 20% is the surrogate. The community has good tools for both halves now — use them.
What A-Lab claimed
What A-Lab demonstrated
The follow-up critique (leeman2024challenges?)
What we learn
Photochemistry / catalysis
Energy materials, polymers, electrolytes
Recipe ambiguity
Hardware bottlenecks and sample-handling errors
Characterisation-pipeline failures
The operator-time bottleneck
Works (productive use)
Marginal / does not yet work
The honest 2026 verdict: autonomous labs are real research infrastructure, within their domain. They are not yet a general-purpose discovery engine.
What you need
What that buys you
FAIR principles, applied to ML
ML artefacts that need FAIR-ification
What a dataset card answers
Splits and distribution
Intended use vs out-of-scope
Trust artefacts
Recent shortcut-learning findings
Recommended 2026 practice
The minimum bundle
The story bundle
The reviewer’s checklist: can I reproduce the model from the artefacts? Can I reproduce the numbers? Can I trust the OOD claim? If any answer is “no,” the paper needs revision.
The 2024–2026 emergence
What they share
Cheap energy evaluation
Bottleneck shift
Three fidelities in 2026 MG
Routing the query
The right framing is not “DFT or experiment” — it is “spend each budget where it pays” (Murphy 2012).
Cost per query in 2026 (order of magnitude)
The economic logic
Methodological open problems
Infrastructural open problems
Maturity ladder, by component
Reading the field honestly
The arc, walked once more
The arc’s destination
Choose and Train
Plan and Close
If you can do all four end-to-end on a chemistry domain you care about, you are an MG practitioner. That is what this course taught.
The course textbooks
Beyond MG itself
Exam scope
The exam rubric

© Philipp Pelz - Materials Genomics