Whether we should trust AI - particularly generative AI - remains a worthy debate. But if you want a better LLM result, you need two things: better data, and better evaluation tools. Here's how a chip ...