Aleksandra Bakalova: Discovering Interpretable Algorithms by Decompiling Transformers to RASP

62 views • 2 days ago