Evaluation Metrics

Understand the 24 metrics used for model evaluation across 4 categories

Filter by Category

Showing 24 of 24 metrics

NMI

Normalized Mutual Information

Measures the mutual information between predicted clusters and true labels, normalized by the entropy of the two distributions

Range: [0, 1]Higher better

ARI

Adjusted Rand Index

Similarity between predicted and true clusters adjusted for chance

Range: [-1, 1]Higher better

ASW

Silhouette Score (Average)

Average silhouette coefficient measuring cluster cohesion and separation

Range: [-1, 1]Higher better

DAV

Davies-Bouldin Index

Average similarity between each cluster and its most similar neighboring cluster

Range: [0, ∞]Lower better

CAL

Calinski-Harabasz Index

Ratio of between-cluster to within-cluster dispersion

Range: [0, ∞]Higher better

COR

Correlation-Based Distance

Measure based on Pearson correlation of latent representations

Range: [0, 1]Higher better

UMAP_Dist

UMAP Distance Correlation

Spearman correlation between latent space and UMAP-reduced pairwise distance matrices

Range: [-1, 1]Higher better

UMAP_Q_local

UMAP Local Structure Quality

Average coranking quality for local neighborhoods in UMAP space

Range: [0, 1]Higher better

UMAP_Q_global

UMAP Global Structure Quality

Average coranking quality for global relationships in UMAP space

Range: [0, 1]Higher better

UMAP_Overall

UMAP Overall Embedding Quality

Comprehensive UMAP quality combining distance correlation, local and global preservation

Range: [0, 1]Higher better

tSNE_Dist

t-SNE Distance Correlation

Spearman correlation between latent space and t-SNE-reduced pairwise distance matrices

Range: [-1, 1]Higher better

tSNE_Q_local

t-SNE Local Structure Quality

Average coranking quality for local neighborhoods in t-SNE space

Range: [0, 1]Higher better

tSNE_Q_global

t-SNE Global Structure Quality

Average coranking quality for global relationships in t-SNE space

Range: [0, 1]Higher better

tSNE_Overall

t-SNE Overall Embedding Quality

Comprehensive t-SNE quality combining distance correlation, local and global preservation

Range: [0, 1]Higher better

Manifold_Dim

Manifold Dimensionality Efficiency

Multi-method dimensionality efficiency score combining variance thresholds, Kaiser criterion, elbow detection, and spectral decay

Range: [0, 1]Higher better

Spectral_Decay

Spectral Decay Rate

Rate of eigenvalue decay indicating information concentration in leading dimensions

Range: [0, 1]Higher better

Part_Ratio

Participation Ratio Score

Effective dimensionality measure from eigenvalue distribution (trajectory: lower is better, steady-state: higher is better)

Range: [0, 1]Higher better

Anisotropy

Anisotropy Score

Multi-method anisotropy combining log-ellipticity, condition numbers, ratio variance, entropy, dominance, and effective dimensionality

Range: [0, 1]Higher better

Traj_Dir

Trajectory Directionality

Dominance of primary developmental axis relative to other directions

Range: [0, 1]Higher better

Noise_Resil

Noise Resilience

Signal-to-noise ratio based on leading vs. trailing PCA components

Range: [0, 1]Higher better

Core_Quality

Core Latent Space Quality

Fundamental quality score combining manifold, spectral, participation, and anisotropy metrics

Range: [0, 1]Higher better

Overall_LSE

Overall Intrinsic Quality

Comprehensive latent space quality with data-type-aware weighting

Range: [0, 1]Higher better

Train_Time

Training Time

Total time to train the model on the dataset

Range: [0, ∞]Lower better

Inference_Time

Inference Time

Time to embed all cells through the trained encoder

Range: [0, ∞]Lower better

Metric Categories

C Clustering & Cell Type Discovery

Supervised metrics comparing predicted clusters to ground truth labels

6 metrics

E Embedding Quality (UMAP & t-SNE)

Visualization quality via coranking analysis (4 metrics × 2 methods)

8 metrics

L Intrinsic Latent Space (LSE)

Unsupervised geometric, spectral, and topological properties

8 metrics

R Computational Efficiency

Training and inference performance

2 metrics