Neat. But that 93% confidence in the "bus" is one of the things that bugs me abo...

photoGrant · on Jan 3, 2023

It's 93% sure it SEE'S a bus, not so much it's 93% a guarantee it actually IS a bus.

Confidence can corrupt in many domains, not just AI!

miohtama · on Jan 3, 2023

As far as I have seen, most AI solutions focus on object detection on a single frame. Would temporal memory, or video detection, increase the confidence a lot? I have not seen any solutions that would understand larger context over multiple seconds timespans.

actionfromafar · on Jan 4, 2023

The real kicker would be for it to integrate a 3D model of what it's looking at. But that would require some heuristics of the world which would probably require some other kind of training data than just a bunch of images. Maybe if/when 3D scans and the corresponding 2D images can be acquired en masse together, or if it could be done in a simulated environment with virtual cameras in them?

bart__ · on Jan 3, 2023

It is also 99% sure some sort of post is a human.