-
RFF-RoPE: Adjust your RoPE using Random Fourier Features
Random Fourier Features and Rotary Position Embeddings
-
In Search of Lost Time: The Long View
When motives are opaque and persuasion is cheap, behavior over time may be our most honest evidence of character.
-
Complex-Analytic Proofs of Global Attraction for Neural Kernel Map
A global contraction of the kernel map using Schwarz–Pick, Julia–Carathéodory, and Rogosinski extremals.
-
From a Stochastic Traffic Jam to a Solvable PDE
An exploration of how complex, random traffic flow can be described by a simple, solvable continuous model.
-
NTK: A First Principles Derivation
A first principles derivation of the classic NTK result (no magic), with an analysis that already suggests the scaling for feature learning