When your eval score goes up, the natural conclusion is that your model got better. But there’s another explanation: your LLM judge has…
When your eval score goes up, the natural conclusion is that your model got better. But there’s another explanation: your LLM judge has…Continue reading on Medium » Read More LLM on Medium
#AI