Instructor of Mathematics
Miami University, Department of Mathematical and Physical Sciences. Teaching mathematics while building classroom-first AI tools and public mathematical reasoning artifacts.
I build mathematical reasoning systems, benchmark datasets, classroom AI interfaces, and reproducible research artifacts. My current work asks how much better fixed-weight models become when the runtime around them is engineered carefully: what they read, how memory is certified, how verification roles apply pressure, and how final answers are emitted as auditable objects.
This page is the canonical public profile for research collaboration, portfolio review, homepage requests, and professional context around Neohm Labs.
Miami University, Department of Mathematical and Physical Sciences. Teaching mathematics while building classroom-first AI tools and public mathematical reasoning artifacts.
Creator and founder of Neohm Labs, the public studio behind AthenaV5, AEN, Vault of Echoes, Canon DSL, and related reasoning infrastructure.
Mathematical reasoning systems; exact-answer evaluation; long-context model serving; multi-role solver/verifier protocols; Runtime-at-Boot and certified context loading; benchmark curation; synthetic mathematical data; and classroom AI interfaces that make reasoning work inspectable.
Public work is listed with artifact boundaries where relevant. AEN results distinguish blind diagnostics from answer-aware replay and context-recall evidence.
Preprint introducing AEN as a runtime architecture for exact-answer mathematical reasoning with fixed-weight language models. Covers Runtime-at-Boot certification, role-specific memory, the Athena-Aria-Artemis triad, controller-owned finalization, Canon v2.1 distillation, and artifact-based evaluation.
Metadata-First Distillation for Synthetic Mathematical Data. A YAML-style schema for converting solved mathematical problems into structured records with objects, givens, asks, invariants, theorem roles, answer normalization, and generation lineage.
Public-answer 25-problem mathematical reasoning benchmark with parquet/csv data, public key, scorer, sample submission, checksum ledger, and benchmark use policy. Hugging Face DOI: 10.57967/hf/8554.
A Zenodo benchmark record associated with direct UI-native mathematical reasoning comparisons and the Vault of Echoes evaluation lineage. Coauthored with P. Acharya.
Lore-infused puzzle codex and source archive. Public source archive DOI: 10.5281/zenodo.18207613. The book hub on this site also hosts the PDF route.
The portfolio is not one demo. It is a connected set of research systems, datasets, papers, notebooks, and public deployment surfaces.
Triadic solver/verifier/agent protocol, Runtime-at-Boot, and controller-owned exact-answer finalization.
Live teaching and reasoning portal at portal.neohmlabs.com/AEN5.
Puzzle codex and public benchmark family for lore-heavy exact-answer reasoning.
Metadata-first mathematical data distillation and synthetic problem generation schema.
Kaggle dataset and boot-memory/certification surface supporting AEN experiments.
Public two-body solver/verifier evaluator lineage that preceded the full AEN triad.
The most important habit in this work is labeling what a result is. A blind benchmark, an answer-aware replay, and a context-recall diagnostic are different scientific objects.
VoE-2026 is a public-answer dataset. That makes it valuable for reproducible scoring and independent verification, but post-exposure scores should disclose that exposure and should not be presented as held-out benchmark performance.
For research collaboration, classroom AI pilots, benchmark work, artifact review, or Neohm Labs partnerships, email me directly.
Email: paudela8@miamioh.edu
/aadityapaudel/ /aadityapaudel/CV.md /aadityapaudel/profile.json gravatar.com/aadityapaudel