Benchmarking AI Models
Linear Digressions
00:00
Data contamination and MMLU leakage
Unknown Host describes training-data leakage where models memorize evaluation datasets from the internet.
Play episode from 13:08
Transcript
Unknown Host describes training-data leakage where models memorize evaluation datasets from the internet.