

I agree that the wording is a bit ambiguous, I interpreted it the way it seems more natural to me. In the post by the researcher(s) themselves, it says in the tldr paragraph that the “agent produces concrete, reproducible PoC inputs to confirm its findings” but also that they (probably humans) “explored the exploitability of the issues and developed a PoC demonstrating a RCE exploit primitive”. Apparently it finds the vulnerabilities very concretely but humans were involved for the full-blown exploit. It also doesn’t say much about the number of false-positives.
I’m not in the business, so I can’t tell how much of the work such agents are actually saving. Since the articles don’t say much about the amount of human involvement, the imagination conveyed by them probably depends strongly on the (knowledge of the) reader. But in my opinion it is a bit of stretch to say this is downplaying it. It should be noted though, that the article probably sources its information from a post by the company selling that AI.
With that information, the “without any confidence” and “area of interest” parts of your previous post still seem misleading.


I think some of that is because the reporting is focused on the new stuff, that was previously not possible. That human work is involved and some of the weaknesses are not really new. But also because the information in this case comes from a company that wants to sell their AI. I agree that the reporting is probably biased and not really sharp and therefore limited in usefulness.
Also, my (second) comment was not specifically about your comment but generally about the “vibe” of this community