Materials Genomics
Unit 11: Latent Spaces of Materials (supplementary)

Prof. Dr. Philipp Pelz

FAU Erlangen-Nürnberg

FAU Logo IMN Logo CENEM Logo ERC Logo Eclipse Logo

Supplementary Reading

How to Use This Deck

This deck is supplementary reading, not a standalone lecture.

  • After the 2026-05-13 realignment, the Week 11 lecture (23.06.2026) merges representation learning and latent-space interpretation into a single 90-minute session delivered from Unit 10.
  • This deck is preserved as a deeper dive for students who want more detail on materials latent-space interpretation, anomaly detection, and the perovskite case study.
  • It is not lectured live. Treat it as extended reading after Unit 10 and before Unit 12 (generative models).
  • Cross-references in this deck to “next week’s clustering unit” reflect the old schedule and should be read as: clustering content is now folded into Unit 13 (uncertainty-aware discovery).

§A · MFML W9 Recap

01. Today’s Question

You already have an embedding \(z\). Now what?

  • MG Unit 10 trained the encoder.
  • MFML W9 derived PCA, t-SNE, UMAP, contrastive learning.
  • Today’s question: how do you read the resulting materials map without lying to yourself?

What this unit is not.

  • Not a derivation of dimensionality reduction — that is MFML W9 (Bishop 2006; Murphy 2012).
  • Not a re-introduction of autoencoders — that was MG U10 and ML-PC U5.
  • Today: deploy these tools as a materials discovery instrument.

02. Where We Are in the Triad

Recap — what we already have

  • MFML W9: PCA, t-SNE, UMAP, contrastive embeddings, foundation embeddings.
  • MG U10: materials-specific encoders — CGCNN (xie2018cgcnn?), MEGNet (chen2019megnet?), M3GNet (chen2022m3gnet?), contrastive crystal embeddings, foundation models for materials.
  • ML-PC U5: AE bottleneck \(z\) as a feature; reconstruction-error anomaly detection.

Today — Unit 11 in one line

  • Read the materials latent space as a composition–structure–property map.
  • Navigate it by interpolation and arithmetic.
  • Challenge it: probes, ablations, and the failure modes of trusting a UMAP picture.

03. PCA / t-SNE / UMAP at One Sentence Each

PCA

  • Linear projection along directions of maximum variance.
  • Axes are interpretable via loadings.
  • Preserves global variance; preserves no nonlinear structure (Bishop 2006).

t-SNE (vandermaaten2008tsne?)

  • Nonlinear; preserves local neighbourhoods.
  • Inter-cluster distances are not meaningful.
  • Hyperparameter perplexity controls neighbourhood size.

UMAP (mcinnes2018umap?)

  • Nonlinear; faster than t-SNE.
  • Preserves more global structure than t-SNE — but still distorts.
  • Hyperparameters n_neighbors, min_dist control granularity.

One-line decision rule. If you need axes you can name, use PCA. If you need clusters you can see, use t-SNE / UMAP — and then verify them.

§B · From Abstract to Materials Latent Spaces

04. What Changes When \(z\) Encodes a Material?

Generic latent space (MFML W9 framing)

  • \(z = \mathcal{E}(x)\) for some input \(x\) — could be pixels, words, audio.
  • Distances in \(z\) are learned distances; nothing is assumed about \(x\)’s physical meaning.
  • The space is “blank” until you decide what claim to make.

Materials latent space

  • \(z = \mathcal{E}(\text{composition}, \text{structure})\) from MG U10.
  • \(z\) now lives in chemistry / structure coordinates.
  • Distances are learned distances over chemistry — not Euclidean distances in atom positions (Sandfeld et al. 2024).

05. Periodic-Boundary Considerations

Why PBC matters for \(z\)

  • A crystal has no “first atom” — translations of the unit cell origin are physically equivalent.
  • \(z\) must be invariant under origin shift.
  • Periodic images of a single atom must not be counted as neighbours of itself.

Whose responsibility?

  • The encoder (U10) carries this responsibility.
  • The visualiser (today) inherits it: a PBC-broken encoder produces a PBC-broken map.
  • Diagnostic: re-embed the same crystal with shifted origin; if \(z\) moves, the encoder is broken.

06. Equivariance Baked Into \(\mathbb{L}\)

The equivariance promise

  • Rotation: rotated crystal \(\to\) same \(z\).
  • Permutation: relabelling atoms of the same species \(\to\) same \(z\).
  • Reflection / inversion: handled by encoder design.
  • Equivariant encoders make \(\mathbb{L}\) a quotient space — symmetry duplicates collapse.

What goes wrong without it

  • Rotated duplicates show up as a spurious axis in \(z\).
  • Permuted-atom duplicates form a fake cluster.
  • Visualisation cannot fix these; they were errors before projection.
  • Discipline: check that one input gives one \(z\), regardless of pose.

07. Composition vs Structure Latent Spaces

Composition-only embeddings

  • One \(z\) per formula (e.g., \(\text{BaTiO}_3\)).
  • All polymorphs collapse to the same point.
  • Useful when polymorphism is irrelevant; misleading otherwise.
  • Examples: Magpie-derived embeddings (ward2016magpie?), formula-only foundation models.

Structure-aware embeddings

  • One \(z\) per crystal — primitive cell + space group.
  • Polymorphs separate; defects modify \(z\).
  • The standard for the rest of this lecture.
  • Examples: CGCNN, M3GNet, SchNet (schutt2018schnet?) on full structures.

§C · Composition–Structure–Property Maps

08. Projecting MP onto 2D — The Workhorse View

The recipe

  1. Pull ~30 k–100 k MP entries (a chemistry slice or all of MP).
  2. Embed each: \(z_i = \mathcal{E}_\theta(\text{material}_i) \in \mathbb{R}^{128}\).
  3. Project to 2D: PCA and UMAP — both, not one.
  4. Colour by the property of interest.

Why both projections

  • PCA gives you axes you can defend.
  • UMAP gives you clusters you can see.
  • Reporting both prevents either-method bias.
  • This 2D scatter is the unit’s central object — the rest of §C reads it.

09. Colour by Formation Energy

What we see

  • Stable region (most negative \(E_f\)): a dense lobe in one corner.
  • High-\(E_f\) “frontier”: a sparser region at the periphery.
  • Smooth colour gradient between them — not random.

What this says

  • The encoder has organised chemistry along an axis correlated with thermodynamic stability — without ever being told a stability label.
  • Formation energy is implicit in pretraining tasks like reconstruction or property regression.
  • The gradient is the encoder’s organisation; the projection just makes it visible.

10. Colour by Band Gap

What we see

  • Metals (\(E_g = 0\)): one cluster.
  • Insulators (\(E_g > 3\) eV): another cluster.
  • Semiconductors: a band along the boundary between them.
  • The map separates electronic regimes.

Why this is non-trivial

  • Most pretraining tasks do not explicitly target band gap.
  • The separation arises from correlated features that the encoder did see (electron count, electronegativity contrast, coordination).
  • “Emergent” structure — but emergent because of physics, not magic.

11. Colour by Stability — Energy Above Hull

The colour scale

  • \(E_{\text{hull}} = 0\) eV/atom: on the convex hull. Synthesisable.
  • \(E_{\text{hull}} \in (0, 0.05]\): metastable; usually accessible.
  • \(E_{\text{hull}} > 0.1\): unlikely to be synthesisable as the listed structure.

Reading the stability map

  • Hull-stable region: a connected manifold, not isolated points.
  • Metastable shell around it: candidates for synthesis with effort.
  • Far frontier: useful as negatives, rarely as targets.
  • This map is the foundation for U13’s discovery loop.

12. What Clusters Reveal About Chemistry Families

Coarse separation

  • Oxides, sulfides, halides occupy distinct regions.
  • Intermetallics form their own peninsula.
  • Carbides and nitrides share a region — they share much chemistry.
  • The encoder spontaneously recovers conventional chemistry-class boundaries.

Per-element substructure

  • Within “Ti-bearing oxides”: further organisation by Ti coordination, oxidation state, and tilt pattern.
  • Within “rare-earth halides”: organisation by lanthanide contraction.
  • The map has internal structure all the way down.

13. Per-Prototype Substructure

Within a chemistry family

  • “ABO\(_3\) oxides” subdivides into perovskites, ilmenites, spinels (where \(\text{A}^{2+}\text{B}^{4+}\) allows).
  • “Garnets” form a distinct sub-cluster.
  • Layered vs three-dimensional structures separate.

Why prototypes work

  • A prototype is a recurring local-environment pattern — the very thing the encoder was trained to recognise (MG U6, U10).
  • The latent map’s prototype clusters are the learned analogue of structural-prototype databases.
  • Implication: the encoder has implicitly solved a polytope-classification task.

14. Case Study Preview — Perovskites in 2D

The ABO\(_3\) slice

  • ~10 k entries in MP that match ABO\(_3\) stoichiometry.
  • Cubic, tetragonal, orthorhombic, rhombohedral, hexagonal polymorphs.
  • Octahedral tilt patterns (Glazer notation) discrimminate sub-families.

What we will see in §G

  • Three main lobes: cubic, tilted, hexagonal.
  • A tetragonal–orthorhombic bridge populated by ferroelectrics.
  • A “formability frontier” along \(E_{\text{hull}}\) gradient.
  • Full slide-by-slide treatment in §G (slides 41–46).

15. Reading a Property Map — Checklist

Five questions before you trust a latent-space figure

  1. What encoder? Version + pretraining corpus.
  2. What projection? PCA vs UMAP vs t-SNE; hyperparameters; seed.
  3. What colour scale? Linear / log / clipped; what range?
  4. Are dense regions distinguishable from cherry-picked highlights? Show density.
  5. What linear-probe \(R^2\) for the property being plotted?

A figure that fails any of these questions is decoration.

  • Decoration may be illustrative; it is not evidence.
  • Evidence requires all five.
  • This checklist is the default requirement for any latent-space claim in U11–12 exercises.

16. The Pitfall of UMAP-as-Truth

Why UMAP layouts are not unique

  • UMAP loss has many local minima.
  • Different seeds \(\to\) different global layouts.
  • Different n_neighbors \(\to\) different cluster topologies.
  • Different min_dist \(\to\) different cluster compactness.

What is robust vs what is not

  • Robust: “these N materials cluster together” (with a sweep showing it survives).
  • Not robust: “this cluster is at the upper-left.”
  • Not robust: “cluster A is closer to cluster B than to cluster C.”
  • Robust claims survive a hyperparameter sweep.

17. PCA When You Need Accountable Axes

When PCA wins

  • A reviewer asks “what does the \(x\)-axis mean?”
  • You need to name the dominant direction of variation.
  • You want to compare loadings across multiple datasets.
  • The relationship is plausibly linear in the embedding (Bishop 2006; Murphy 2012).

What PCA cannot do

  • Show nonlinear clusters as separated.
  • Resolve manifold structure smaller than the global variance scale.
  • Tell you anything t-SNE / UMAP would tell you about local neighbourhoods.

Use PCA and UMAP — they answer different questions.

§D · Phase Discovery in Latent Space

18. Phase Discovery Without Labels

The setting

Why this works

  • A good encoder has already done most of the heavy lifting (§C §12–13).
  • Clustering on \(z\) is therefore far easier than clustering on raw features.
  • Cluster names are assigned post-hoc — by inspecting exemplar materials per cluster.

19. Outliers and Overlooked Materials

Outlier \(\neq\) noise

  • Materials in sparse regions of \(z\) are unlike anything else in the dataset.
  • For labelled data, that’s interesting.
  • For unlabelled data, it’s a signal that the encoder thinks this entry is unusual — worth investigation.

The outlier as a target

  • Sparse latent neighbourhood + low \(E_{\text{hull}}\) = under-explored stable corner.
  • A 2024 latent-space study of MP found ~200 such entries; ~30 had no published synthesis attempt despite favourable thermodynamics.
  • Outliers are not noise; they are the next paper.

20. Novelty Detection in Latent Coordinates

The novelty score

  • $(z) = $ distance to the \(k\)-th nearest neighbour in \(z\).
  • High score \(\to\) isolated in latent space.
  • Threshold: top-1% of training-set scores, or fixed quantile.

Caveats baked in

  • Novelty is relative to the corpus that built \(z\).
  • A “novel” material may just be a chemistry family the pretraining set under-covered.
  • Always cite the pretraining corpus alongside any novelty claim (Neuer et al. 2024).

21. Latent-Coordinate Novelty vs Reconstruction-Error Novelty

Two different anomalies

  • Latent novelty (slide 20): “this material is in an empty region of \(z\).”
  • Reconstruction novelty: “the autoencoder cannot redraw this material” — high \(\|x - \mathcal{A}(x)\|^2\) (Neuer et al. 2024).
  • They flag different failure modes.

When each fires

  • Latent novelty fires on unseen chemistry — the encoder placed it in an empty corner.
  • Reconstruction novelty fires on broken inputs — bad CIF, wrong stoichiometry, parse errors.
  • Use both; never confuse them.

22. Published Example — MoS\(_2\) Polymorphs

The system

  • MoS\(_2\) has at least three known polymorphs: 2H (semiconducting), 1T (metallic), 1T’ (semimetallic, distorted).
  • All three exist in MP / OQMD with full structures.

The latent map shows

  • Three distinct latent locations corresponding to the three polymorphs.
  • 1T’ lies between 2H and 1T — consistent with its description as a distorted 1T.
  • The encoder’s learned layout matches the textbook structural relationship.

23. Published Example — High-Entropy Alloy Clustering

The system

  • HEAs: 4–6 element alloys, near-equiatomic compositions.
  • Tens of thousands of compositions in computed databases.
  • Conventional clustering (by element pair, structure type) is brittle.

What the latent map shows

  • HEAs cluster by dominant element pair — not by labelled “HEA-class.”
  • Cantor-class (FeCoNiCrMn) clusters separately from refractory HEAs.
  • Within each cluster: organisation by lattice (FCC / BCC / HCP) and short-range order.

24. Complement to Supervised Regression

Two different jobs

  • Supervised regression: given \(x\), predict \(y\). Solved by U8–U10.
  • Latent discovery: given \(\{x_i\}\), propose new \(x\)’s of interest. Solved by today’s tools.

They are adjoint, not redundant

  • Regression interpolates within known \(x\).
  • Discovery proposes new \(x\) to evaluate.
  • A discovery loop alternates: regression predicts \(y\) on candidates; latent geometry suggests new candidates.
  • This is the loop U13 will close with uncertainty quantification.

25. Discovery as a Verb

The discovery loop in one slide

  1. Embed the corpus.
  2. Project + colour.
  3. Identify a sparse region of low \(E_{\text{hull}}\).
  4. Decode candidate compositions in that region.
  5. DFT-validate the top-\(k\).
  6. Synthesise the most promising 1–2.
  7. Fold results back; retrain.

The latent map is a first filter

  • Step 3 is what today’s lecture builds toward.
  • Steps 4–7 are the rest of the course (U12, U13, U14).
  • Without step 3, the rest of the loop has no proposal mechanism.

§E · Latent-Space Arithmetic and Interpolation

26. The word2vec Analogy

The famous example

\[\vec{\text{king}} - \vec{\text{man}} + \vec{\text{woman}} \approx \vec{\text{queen}}\]

  • Word embeddings support analogical arithmetic (mikolov2013word2vec?).
  • The vector “\(- \text{man} + \text{woman}\)” is, approximately, a “gender” direction.
  • Semantic relationships become vectors in \(\mathbb{L}\).

The materials analogue

\[z_{\text{BaTiO}_3} - z_{\text{Ba}} + z_{\text{Sr}} \approx z_{\text{SrTiO}_3}\]

  • For some encoders, this works — element substitution is a vector.
  • “Replace Ba by Sr” becomes a navigable direction.
  • The analogy is the conceptual seed for inverse design.

27. Composition-Substitution Arithmetic

The substitution vector

  • \(\vec{v}_{\text{Ba}\to\text{Sr}} = z_{\text{SrTiO}_3} - z_{\text{BaTiO}_3}\).
  • Apply to another perovskite parent: \(z_{\text{BaZrO}_3} + \vec{v}_{\text{Ba}\to\text{Sr}} \approx z_{\text{SrZrO}_3}\).
  • Approximately.

When it works, when it doesn’t

  • Works: same structural family, same oxidation states, well-pretrained encoder.
  • Fails: cross-family substitutions, oxidation-state changes, multi-modal chemistry (e.g., Mn\(^{2+}\) vs Mn\(^{4+}\)).
  • Always test: pick two known examples; compute the residual.

28. Smooth Interpolation Between Chemistries

The interpolation path

\[z(t) = (1 - t)\, z_A + t\, z_B \quad t \in [0, 1]\]

  • \(t = 0\): material \(A\).
  • \(t = 1\): material \(B\).
  • Intermediate \(t\): a learned path through chemistry.

Why latent interpolation beats raw

  • Raw atom-coordinate interpolation produces nonphysical overlaps.
  • Latent interpolation stays on the manifold the encoder learned.
  • Decoded intermediate structures are physically plausible — at least more often than raw (Sandfeld et al. 2024).

29. Smooth Property Gradients

The gradient direction

  • For property \(y\), find the direction \(\hat{\mathbf{g}}_y \in \mathbb{R}^L\) along which \(\partial y / \partial z\) is largest.
  • Linear probe (slide 39) gives the direction directly: \(\hat{\mathbf{g}}_y = \nabla_z (\hat{w}_y^\top z)\) where \(\hat{w}_y\) is the linear-probe weight.

The actionable axis

  • Moving \(z \to z + \alpha \hat{\mathbf{g}}_y\) increases predicted \(y\) by approximately \(\alpha \|\hat{w}_y\|\).
  • This is the most actionable axis for design.
  • Decode the new \(z\) to read off candidate compositions.

30. Targeted Property Modification

The design move

  1. Pick a starting material with \(z_0\).
  2. Pick a target \(\Delta y\) (e.g., increase band gap by 0.5 eV).
  3. Compute \(\alpha = \Delta y / \|\hat{w}_y\|\).
  4. Move: \(z_1 = z_0 + \alpha \hat{\mathbf{g}}_y\).
  5. Decode \(z_1\) to a candidate composition / structure.

Caveats baked in

  • The predicted \(\Delta y\) is linear; the true \(\Delta y\) may be smaller, especially for large moves.
  • The decoded structure must be DFT-validated.
  • The decoder is the bottleneck (slide 28).
  • This is the cheapest possible design move, not the best.

31. The \(\text{Ba}_{1-x}\text{Sr}_x\text{TiO}_3\) Trajectory

The series

  • \(x = 0\): BaTiO\(_3\) (tetragonal at room temperature).
  • \(x = 1\): SrTiO\(_3\) (cubic at room temperature).
  • Intermediate \(x\): continuous solid solution.
  • Known phase transitions at specific \(x\).

Map as a curve in \(z\)

  • Embed each \(x\); project; trace the curve.
  • Smooth segments: continuous solid solution.
  • Kinks in the curve: phase transitions detected by the encoder.
  • The latent path recovers known phase boundaries.

32. Why Arithmetic Is a Necessary Precondition for Inverse Design

Without usable arithmetic

  • “Move toward higher band gap” has no meaning.
  • Generative models cannot navigate \(z\).
  • Acquisition functions in U13 cannot define neighbourhoods.
  • The latent space is decorative, not actionable.

With usable arithmetic

  • Property gradients become design directions.
  • Generative models (U12) sample along directions.
  • Acquisition functions (U13) place experiments on top of \(z\).
  • The latent space is the substrate for the rest of the course.

33. Limits of Arithmetic

Where arithmetic fails

  • Cross-family substitutions (perovskite \(\to\) spinel) — chemistry is multi-modal.
  • Oxidation-state changes (Mn\(^{2+} \to\) Mn\(^{4+}\)) — encoder uses different sub-modes.
  • Long compositional paths — linear extrapolation breaks far from training.
  • Encoder regions that the corpus under-covered — pretraining bias.

The discipline

  • Test before relying: pick two known endpoints; sweep; decode; check.
  • Report the residual.
  • If the residual is large, do not extrapolate — use a more sophisticated decoder (U12) or constrain the path.

§F · Failure Modes of Latent-Space Interpretation

34. The t-SNE Distance Trap

The trap

  • “Cluster A and cluster B are closer than cluster A and cluster C” on a t-SNE plot.
  • People read this as a chemistry claim.
  • It is not.

Why it’s wrong

  • t-SNE preserves local neighbourhoods; inter-cluster distances are unconstrained.
  • The same data, with a different perplexity / seed, gives different inter-cluster distances.
  • Any inter-cluster distance claim must be restated in \(z\), not in 2D (Bishop 2006; Murphy 2012).

35. Pretraining-Data Bias

The trap

  • The latent space inherits the pretraining corpus.
  • An MP-pretrained encoder over-represents oxides and stable phases.
  • Materials in under-represented regions look “novel” — but it’s a corpus artefact.

Symptoms

  • Chalcogenides “isolated” in an MP-trained \(z\): probably under-coverage, not chemistry novelty.
  • “Outliers” that come from a different lab’s calculation conventions.
  • Apparent “discoveries” in chemistry families the encoder barely saw (Neuer et al. 2024).

36. The Narrative Fallacy

The trap

  • A 2D scatter with three blobs will be told as a three-phase story.
  • Even when the blobs are projection artefacts.
  • Humans are pattern-completion machines; we cannot help seeing structure.

The defence

  • Replicate: rerun with a different projection / seed / encoder.
  • If the story changes substantially, it was narrative, not signal.
  • If the story persists, it’s partial evidence — supplement with probes.

37. The “Axis Means X” Trap

The trap

  • \(z_1\) is correlated with band gap, therefore \(z_1\) encodes band gap.”
  • The encoder did not necessarily learn band gap; the projection may have produced the correlation.
  • \(z_1\) may correlate with band gap because both correlate with electron count.

The defence

  • A latent direction “encodes” \(y\) only if a linear probe gives high \(R^2\).
  • And the probe survives a control — predicting \(y\) from a different direction not claimed to encode \(y\) should give lower \(R^2\).
  • See slide 39 for the linear-probe protocol.

38. Ablations on the Encoder

The ablation idea

  • Retrain the encoder with one input modality removed.
  • Drop composition, keep structure: does the latent map change?
  • Drop structure, keep composition: does it change?
  • The change measures what each modality contributes.

Why ablations matter

  • They convert “the latent space encodes structure” from a claim into a measurement.
  • They isolate the contribution of each input modality.
  • They are expensive (full retraining) but the information is high.

39. Linear Probes — The Quantitative Check

The protocol

  1. Freeze encoder; extract \(z\) for the corpus.
  2. Train a linear regressor \(y = w^\top z + b\) for property \(y\).
  3. Report \(R^2\) on a held-out split.
  4. Compare to a baseline: linear regressor on Magpie features.

What the comparison says

  • \(R^2_{\text{probe}} \gg R^2_{\text{Magpie}}\): encoder learned beyond composition.
  • \(R^2_{\text{probe}} \approx R^2_{\text{Magpie}}\): encoder is doing essentially Magpie’s job.
  • \(R^2_{\text{probe}} \ll R^2_{\text{Magpie}}\): encoder is worse than a hand-crafted baseline — fix it.

40. The Reproducibility Checklist for Latent-Space Claims

Five items, none optional

  1. Encoder: version + pretraining corpus.
  2. Projection: method + hyperparameters + random seed.
  3. Colour scale: what range, linear / log, clipping.
  4. One ablation: input or training-data ablation.
  5. One linear probe: with chemistry-disjoint split + Magpie baseline.

Without all five

  • The claim is decoration, not evidence.
  • The figure may be illustrative; it is not a result.
  • The exercise this afternoon will require all five.
  • The exam will test whether you can identify each item in someone else’s paper.

§G · Worked Example — MP Perovskites in Latent Space

41. Setup — The ABO\(_3\) Slice

The corpus

  • Pull all entries with stoichiometry ABO\(_3\) from MP-2024.
  • ~10 k structures across alkaline-earth, transition-metal, and rare-earth A-sites.
  • B-site: Ti, Zr, Hf, Mn, Fe, Co, Ni, Sn, Pb, etc.

The encoder + projection

  • Encoder: M3GNet-MP-2024 foundation model, 128-D embedding.
  • Projection: PCA (2D) for accountable axes; UMAP for visual clusters.
  • Frozen encoder; no fine-tuning.

42. The Map — What We See

Three lobes

  • Cubic perovskites (Pm\(\bar{3}\)m): high-symmetry; one lobe.
  • Tilted / orthorhombic / tetragonal (Pnma, P4mm, etc.): the largest lobe.
  • Hexagonal / 2H polytypes (P6\(_3\)/mmc): a smaller, separated lobe.

Sub-features

  • A bridge region between cubic and tetragonal, populated by ferroelectric solid solutions.
  • A formability frontier: an \(E_{\text{hull}}\) gradient at the periphery.
  • Per-A-site sub-clusters within each lobe.

43. Claims the Map Supports

Supportable claims (with evidence)

  • “Octahedral tilting separates from the cubic structure along a learned direction.”
  • “Stable perovskites form a connected manifold in \(z\).”
  • “The bridge region between cubic and tetragonal is rich in ferroelectrics.”

Why these are supportable

  • Each is checkable by a linear probe on a relevant labelled subset.
  • Each is replicable across projection choices.
  • Each is consistent with prior literature.

44. Claims the Map Does Not Support

Unsupportable narratives

  • “This empty corner contains undiscovered superconductors.” (Empty corner = pretraining bias, not chemistry (Neuer et al. 2024).)
  • “Perovskite \(A\) is closer to perovskite \(B\) than to perovskite \(C\).” (UMAP distance, not \(z\) distance.)
  • “Axis 1 is the band-gap axis.” (Correlation, not encoding.)

The discipline

  • Stating what a map cannot claim is itself a contribution.
  • The over-claims of slide 44 appear in published papers.
  • Recognising them in others’ work — and avoiding them in yours — is the unit’s transferable skill.

45. From Map to Lab — The Discovery Move

The end-to-end loop

  1. Pick a property target (e.g., band gap 1.5–2.0 eV, \(E_{\text{hull}} < 50\) meV/atom).
  2. Find the property gradient direction \(\hat{\mathbf{g}}_y\) via linear probe.
  3. Move from a starting perovskite along \(\hat{\mathbf{g}}_y\).
  4. Decode candidate (composition + space-group hypothesis).
  5. DFT-validate top-\(k\).
  6. Synthesise top 1–2.

The closed loop

  • Steps 1–4: today.
  • Step 5: U4 (DFT) and U13 (uncertainty over candidates).
  • Step 6: experimental partners.
  • The loop closes by feeding new (x, y) pairs back to the encoder and the regression.

46. Reproducing the Figure

The exercise repo

  • notebook/perovskite_latent.ipynb — full case-study notebook.
  • ~30 lines of Python end-to-end.
  • Encoder: from_pretrained() one-liner.
  • Projection + colouring + probe + ablation: ten lines each.

Reproducibility is a requirement, not a bonus

  • Every figure on slides 41–45 is regenerated from the notebook.
  • Random seeds are fixed.
  • Hyperparameters are explicit.
  • Linear-probe baseline is included.
  • This is the minimum standard the exercise expects.

§H · Wrap-Up

47. When Latent Visualisation Helps, When It Misleads

Helps

  • Hypothesis generation: which chemistry families to target.
  • Phase discovery: novel polymorphs and overlooked materials.
  • Design-move proposal: starting points for inverse design.
  • Communicating a complicated chemistry story in one figure.

Misleads when

  • Treated as ground truth (“we discovered X”).
  • Distances reported uncritically (“A is closer to B than C”).
  • Narratives outpace probes (“axis 1 is band gap”).
  • Pretraining corpus ignored (“this material is novel”).

48. Forward — Unit 12 (Clustering vs Discovery)

What U12 does next

  • Partition the latent space into chemistry-family clusters.
  • K-means / GMM / HDBSCAN on \(z\).
  • Cluster validation: silhouette, BIC, persistence.
  • Cluster meaning: post-hoc inspection of exemplars.

Plus the bridge to generative

  • VAE / diffusion as decoders for \(z\) — the gap that today’s interpolation slides could not fill.
  • Inverse design: from \(z_{\text{target}}\) to a synthesisable structure.
  • This is U12’s second half.

49. Forward — Unit 13 (Uncertainty-Aware Discovery)

What U13 does next

  • Place a Gaussian process over \(z\): \(y(z) \sim \text{GP}(\mu, k)\).
  • Acquisition functions: UCB, EI, Thompson sampling.
  • The GP gives both prediction and uncertainty.
  • Acquisition picks the next experiment by trading off both.

Why the latent space matters

  • The GP kernel \(k(z, z')\) is a learned metric on chemistry — not Euclidean atom-distance.
  • Uncertainty over \(z\) is uncertainty over chemistry, properly calibrated.
  • The acquisition policy moves through \(z\), exactly as today’s design moves did.
  • Today’s foundation \(\to\) U13’s closed loop.

50. Exercise + Reading Assignment

Exercise (90 min, this afternoon)

  1. Pull a precomputed M3GNet embedding of an MP slice.
  2. Project to 2D with PCA, t-SNE (sweep perplexity), UMAP (sweep n_neighbors).
  3. Colour by formation energy and one of {band gap, density, space group}.
  4. Identify one chemistry family that clusters cleanly; one that does not. Explain.
  5. Run a linear probe for one property; compare to Magpie baseline.
  6. Document one failure of interpretation.

Reading for next week

  • Murphy (2012) Ch 12 (continuous latent variables) — sections 12.1–12.2 are sufficient.
  • Bishop (2006) §12.3 (probabilistic latent variable models).
  • Sandfeld et al. (2024), Ch 19 case studies on AE / latent-space materials applications.
  • McClarren (2021) Ch 8 (autoencoders) for an engineering-style refresher.
  • Optional: Neuer et al. (2024) §5.5 for anomaly detection in latent spaces.

Next week (Unit 12): clustering and generative use of \(z\).

The single sentence to leave with: the materials latent space is a map, not a fact — read it carefully, navigate it deliberately, and challenge it always.

Bishop, Christopher M. 2006. Pattern Recognition and Machine Learning. Springer.
McClarren, Ryan G. 2021. Machine Learning for Engineers: Using Data to Solve Problems for Physical Systems. Springer.
Murphy, Kevin P. 2012. Machine Learning: A Probabilistic Perspective. MIT Press.
Neuer, Michael et al. 2024. Machine Learning for Engineers: Introduction to Physics-Informed, Explainable Learning Methods for AI in Engineering Applications. Springer Nature.
Sandfeld, Stefan et al. 2024. Materials Data Science. Springer.

Continue