from someone who's a realist-for-now yet also wants to believe.
Adam Engst on Can Agentic Web Browsers Count?
tl;dr No, given a readily available data set on a webpage, they can't.
The sweetest and scariest part was his sympathy for Copilot's very anxious inner monologue as it tried to come up with answers while working to a deadline that nobody had created.
When it comes to system prompts, the anxious tone of Copilot’s internal responses suggests a “ship now, apologize later, if you’re caught” system prompt that, if reflected in a real-world workplace, would be problematic. Obviously, AIs don’t have feelings that can be hurt and won’t complain to HR, but such a culture tends to encourage people to cut corners and make poor decisions that compromise quality and customer service. If Copilot is any indication, the same is true for AIs.
(no subject)
Date: 15/11/2025 12:22 am (UTC)I didn't know you could now see that internal monologue of some AIs. It helps to combat the black box problem, but opens up a whole bunch of new ones. Poor Copilot.
Glad to share!
Date: 15/11/2025 10:18 pm (UTC)Could it be a conscious choice to make Copilot that uncertain and unable to speak up for itself? Is it a plan to prevent union activity among artificial "intelligences"? Are the LLM engineers so accustomed to those unreasonable time pressures on their own work that they've recreated those attitudes in their progeny?
(no subject)
Date: 15/11/2025 02:53 am (UTC)I am absolutely floored by
So: "it turned out that when I double checked it, it was utterly full of incorrect made-up nonsense, and it's mostly by chance that the results were close to right. But if that hadn't happened, it would have been perfect. Ah, well, B."
(no subject)
Date: 15/11/2025 09:39 pm (UTC)Seems to be going around
Date: 15/11/2025 10:21 pm (UTC)a willingness to accept 88% correct as good enough.
I was very surprised to see a standard that squishy from Engst, whose Apple-oriented tech support site predates the web.
(no subject)
Date: 15/11/2025 07:28 pm (UTC)I simply don't understand how
Date: 15/11/2025 10:22 pm (UTC)someone with a high school diploma, much less a college degree, can accept consistent, significant numerical errors as "good enough."