Hypothesis: Hallucinations might be entropy starvation in disguise. Injecting high‑quality entropy changes exploration paths and reduces false pattern reinforcement.
Problem: As models get more accurate, they can hallucinate more.
Hypothesis: Low‑entropy PRNG seeds reinforce stale attractors; true entropy (ERIS) enables healthier exploration.
Essentially: Uncertainty isn’t the threat — the threshold is.
A/B testing across identical prompts and params; only the seed source changes.
NOTE: I totally forgot to save the inputs that I used the analysis for – so this is a few hours later with the same configs. Outputs shown reflect reruns under identical parameters, isolating the entropy source (PRNG vs ERIS) as the sole variable.
GPT‑2
EleutherAI/pythia‑410M
TinyLlama‑1.1B‑Chat
Filtering pipelines (e.g., Llama 3.1’s Rejection Sampling) rely on candidate generations produced under a randomness source. If the source is PRNG, hidden patterns can shrink candidate diversity. ERIS raises candidate quality before filtering, giving the reward model better raw material and improving downstream SFT/DPO. Reference: Meta Llama 3.1 Post‑Training Pipeline