I dont expect it to be 100% accurate. Software aren't bug free, human aren't perfect. But may be 99.99%? At least given enough time and resources human could fact check it ourselves. And precisely because we know we are not perfect, in accounting and court cases we have due diligence.
And it is also not just about the %. It is also about the type of error. Will we reach a point we change our perception and say these are expected non-human error?
Or could we have a specific LLM that only checks for these types of error?
And it is also not just about the %. It is also about the type of error. Will we reach a point we change our perception and say these are expected non-human error?
Or could we have a specific LLM that only checks for these types of error?