The crazy thing is the level of effort to say, "have a sub agent validate all references and figures" is so low. I'm paraphrasing, but you don't need much more than that. It would have prevented 99% of the face palms.
I use this regularly for my personal financial research system. Even flagship models make mistakes. Though currently the issue is usually the model using a figure from and older report. Cross-check reduces that dramatically.
7 comments:
> Professional services firm KPMG has pulled a report titled, “Redefining excellence in the age of agentic AI,”
Well they were true to their word about demonstrating a new and increasingly relevant definition of "excellence."
The crazy thing is the level of effort to say, "have a sub agent validate all references and figures" is so low. I'm paraphrasing, but you don't need much more than that. It would have prevented 99% of the face palms.
I use this regularly for my personal financial research system. Even flagship models make mistakes. Though currently the issue is usually the model using a figure from and older report. Cross-check reduces that dramatically.
Gartner is going to have to pull a loooot of reports over the years
Go, GPTZero!
[dupe] https://news.ycombinator.com/item?id=48515733
The register article is better.
Every once in awhile, someone utters a truly unique statement.