Evaluation Metrics

Understand the 24 metrics used for model evaluation across 4 categories

Filter by Category

Showing 24 of 24 metrics

Measures the mutual information between predicted clusters and true labels, normalized by the entropy of the two distributions

Range: [0, 1]Higher better

Similarity between predicted and true clusters adjusted for chance

Range: [-1, 1]Higher better

Average silhouette coefficient measuring cluster cohesion and separation

Range: [-1, 1]Higher better

Average similarity between each cluster and its most similar neighboring cluster

Range: [0, ∞]Lower better

Ratio of between-cluster to within-cluster dispersion

Range: [0, ∞]Higher better

Measure based on Pearson correlation of latent representations

Range: [0, 1]Higher better

Spearman correlation between latent space and UMAP-reduced pairwise distance matrices

Range: [-1, 1]Higher better

Average coranking quality for local neighborhoods in UMAP space

Range: [0, 1]Higher better

Average coranking quality for global relationships in UMAP space

Range: [0, 1]Higher better

Comprehensive UMAP quality combining distance correlation, local and global preservation

Range: [0, 1]Higher better

Spearman correlation between latent space and t-SNE-reduced pairwise distance matrices

Range: [-1, 1]Higher better

Average coranking quality for local neighborhoods in t-SNE space

Range: [0, 1]Higher better

Average coranking quality for global relationships in t-SNE space

Range: [0, 1]Higher better

Comprehensive t-SNE quality combining distance correlation, local and global preservation

Range: [0, 1]Higher better

Multi-method dimensionality efficiency score combining variance thresholds, Kaiser criterion, elbow detection, and spectral decay

Range: [0, 1]Higher better

Rate of eigenvalue decay indicating information concentration in leading dimensions

Range: [0, 1]Higher better

Effective dimensionality measure from eigenvalue distribution (trajectory: lower is better, steady-state: higher is better)

Range: [0, 1]Higher better

Multi-method anisotropy combining log-ellipticity, condition numbers, ratio variance, entropy, dominance, and effective dimensionality

Range: [0, 1]Higher better

Dominance of primary developmental axis relative to other directions

Range: [0, 1]Higher better

Signal-to-noise ratio based on leading vs. trailing PCA components

Range: [0, 1]Higher better

Fundamental quality score combining manifold, spectral, participation, and anisotropy metrics

Range: [0, 1]Higher better

Comprehensive latent space quality with data-type-aware weighting

Range: [0, 1]Higher better

Total time to train the model on the dataset

Range: [0, ∞]Lower better

Time to embed all cells through the trained encoder

Range: [0, ∞]Lower better

Metric Categories

C Clustering & Cell Type Discovery

Supervised metrics comparing predicted clusters to ground truth labels

6 metrics

E Embedding Quality (UMAP & t-SNE)

Visualization quality via coranking analysis (4 metrics × 2 methods)

8 metrics

L Intrinsic Latent Space (LSE)

Unsupervised geometric, spectral, and topological properties

8 metrics

R Computational Efficiency

Training and inference performance

2 metrics

Evaluation Metrics

Filter by Category

NMI

ARI

ASW

DAV

CAL

COR

UMAP_Dist

UMAP_Q_local

UMAP_Q_global

UMAP_Overall

tSNE_Dist

tSNE_Q_local

tSNE_Q_global

tSNE_Overall

Manifold_Dim

Spectral_Decay

Part_Ratio

Anisotropy

Traj_Dir

Noise_Resil

Core_Quality

Overall_LSE

Train_Time

Inference_Time

Metric Categories

C Clustering & Cell Type Discovery

E Embedding Quality (UMAP & t-SNE)

L Intrinsic Latent Space (LSE)

R Computational Efficiency