AlgZoo: uninterpreted models with fewer than 1,500 parameters — AI Alignment Forum

Discover how tiny AI models with fewer than 1,500 parts act like simple puzzles to help researchers build safer, more trustworthy artificial intelligence sys...

Level: beginner

By Unknown

Category: discussion