Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training

(github.com)

89 points | by xlayn  8 hours ago

30 comments