training-dynamics

Here are 26 public repositories matching this topic...

google / neural-tangents

Fast and Easy Infinite Neural Networks in Python

kernel neural-networks gradient-descent bayesian-inference gaussian-processes bayesian-networks deep-networks gradient-flow jax infinite-networks training-dynamics neural-tangents kernel-computation

Updated Mar 1, 2024
Jupyter Notebook

CVL-UESTC / FR-INR

Star

CVPR 2024-Improved Implicit Neural Representation with Fourier Reparameterized Training

machine-learning deep-learning novel-view-synthesis training-dynamics implicit-neural-representation

Updated May 23, 2025
Python

CVL-UESTC / IGA-INR

Star

ICML2025-Inductive Gradient Adjustment for Spectral Bias in Implicit Neural Representations

machine-learning deep-learning gradient-descent optimization-methods training-dynamics neural-tangent-kernel implicit-neural-representation icml-2025

Updated May 31, 2025
Jupyter Notebook

jjbrophy47 / instance_based_interpretability

Star

Existing literature about training-data analysis.

interpretability explainable-ai xai training-dynamics instance-based influence-estimation data-maps

Updated Dec 17, 2021

mainlp / explaind

Star

A unified framework for attributing model components, data, and training dynamics to model behavior.

neural-networks interpretability explainability training-dynamics data-attribution model-attribution kernel-machines

Updated Sep 30, 2025
Python

liuyz0 / FOCUS

Star

Official repository for "FOCUS: First Order Concentrated Updating Scheme"

optimizer training-dynamics

Updated Jan 22, 2025
Jupyter Notebook

pulkitgopalani / tf-loss-plateau

Star

Code for "What Happens During the Loss Plateau? Understanding Abrupt Learning in Transformers" (NeurIPS 2025)

transformer interpretability training-dynamics abrupt-learning

Updated Oct 22, 2025
Python

marlo-z / reversal_curse_analysis

Star

Code for 'Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics'

transformers gpt-2 training-dynamics large-language-models

Updated Jan 17, 2025
Python

braingpt-lovelab / backwards

Star

Source code for <Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies>

language-learning tokenization training-dynamics scientific-discovery large-language-models

Updated Jun 9, 2025
Jupyter Notebook

dgcnz / relaxed-equivariance-dynamics

Star

Code for "Effect of equivariance on training dynamics"

deep-learning geometric-deep-learning equivariance training-dynamics

Updated Jan 10, 2025
Python

cosmaadrian / nli-stress-test

Star

Official repository for the EMNLP 2024 paper "How Hard is this Test Set? NLI Characterization by Exploiting Training Dynamics"

natural-language-inference roberta training-dynamics data-maps deberta-v3 area-under-margin

Updated May 20, 2025
Python

External LLM intelligence monitors & diagnoses MoE expert ecology during training — preventing routing collapse without auxiliary loss engineering. 16 Experts, 3 Tiers, Top-2 Gating, Claude-in-the-Loop.

deep-learning routing pytorch moe cifar100 mixture-of-experts training-dynamics claude-api ai-in-the-loop routing-collapse expert-collapse expert-ecology

Updated Jun 1, 2026
Python

Cmouzouni / three-phases-moe

Star

Code and data for: Three Phases of Expert Routing — How Load Balance Evolves During MoE Training

deep-learning transformer moe research-paper load-balancing mixture-of-experts training-dynamics phase-transitions mean-field-games llm-architecture sparse-models expert-routing

Updated Apr 5, 2026
Python

tiexinding / NPM-K-public

Star

Cross-Family Convergence of Neural Network Weight Skeletons. Companion to Zenodo paper (10.5281/zenodo.19652706).

transformers neural-networks pythia percolation weibull-distribution training-dynamics scaling-laws llm olmo

Updated Apr 20, 2026
TeX

RayoHQ / attention-binding-a11y

Star

TMLR 2026 | Mechanistic interpretability: attention-head binding (EB*) as a marker of concept emergence. 7 models, 5 architectures (Pythia 160M–2.8B, OLMo-1B, CRFM GPT-2, SmolLM3-3B, Qwen2.5-1.5B), 41 terms.

nlp accessibility transformers language-models pythia interpretability few-shot-learning training-dynamics attention-heads mechanistic-interpretability qwen tmlr olmo smollm3 tmlr-2026 concept-emergence crfm

Updated Jun 9, 2026
Python

pulkitgopalani / tf-matcomp

Star

Code for "Abrupt Learning in Transformers: A Case Study on Matrix Completion" (NeurIPS 2024)

transformer interpretability training-dynamics abrupt-learning

Updated Apr 17, 2025
Python

basyirin-dev / sigmaflow-pde

Star

σFlow-PDE: A drop-in H-Bar training engine that escapes the σ-trap in neural PDE solvers via live σ/δ/α ODE integration, autonomous phase curriculum, and auto-falsification.

pde-solver training-dynamics fno compositional-generalization ood-generalization deeponet neural-operators physics-ml reproducible-ml h-bar-framework

Updated May 16, 2026
Python

FynnGerding / SMI-training-dynamics

Star

Reimplementation of the Sliced Information Plane (SIP) framework from Wongso, Ghosh, and Motani (2025) for analyzing deep neural network training dynamics. The repo uses Sliced Mutual Information (SMI) to obtain scalable, finite dependence estimates in high‑dimensional, deterministic settings, and applies them to MNIST MLP experiments.

sip pytorch mutual-information smi training-dynamics

Updated Feb 12, 2026
Python

LaoZhongjie / rnn-chaos-dynamics

Star

A research project investigating how LSTM training dynamics relate to dynamical stability and order–chaos transitions through Finite-Time Lyapunov Exponent (FTLE) analysis.

machine-learning research deep-learning pytorch lstm rnn dynamical-systems ftle training-dynamics edge-of-chaos

Updated Mar 31, 2026
Python

zfifteen / noether-early-warning

Sponsor

Star

Atomic benchmark suite showing drift can act as an early warning before direct symmetry detection in gradual-breaking regimes, with reversal controls, finite-budget sensitivity tests, and exact alarm-time validation.

benchmarking machine-learning optimization pytorch scientific-computing neural-networks dynamical-systems symmetry-breaking change-detection drift-detection training-dynamics early-warning

Updated Apr 6, 2026
Python

Improve this page

Add a description, image, and links to the training-dynamics topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the training-dynamics topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

training-dynamics

Here are 26 public repositories matching this topic...

google / neural-tangents

CVL-UESTC / FR-INR

CVL-UESTC / IGA-INR

jjbrophy47 / instance_based_interpretability

mainlp / explaind

liuyz0 / FOCUS

pulkitgopalani / tf-loss-plateau

marlo-z / reversal_curse_analysis

braingpt-lovelab / backwards

dgcnz / relaxed-equivariance-dynamics

cosmaadrian / nli-stress-test

zqj323 / expert-ecology

Cmouzouni / three-phases-moe

tiexinding / NPM-K-public

RayoHQ / attention-binding-a11y

pulkitgopalani / tf-matcomp

basyirin-dev / sigmaflow-pde

FynnGerding / SMI-training-dynamics

LaoZhongjie / rnn-chaos-dynamics

zfifteen / noether-early-warning

Improve this page

Add this topic to your repo