How Good Is a Language Model, Really?
Already two years ago I wrote a blog post on how different LLMs are benchmarked, safe to say that much has happened in the last two years in terms of how these models no are benchmarked. Since many of them have also gone past the initial scoring of many of them and we have needed …