← kwindla hultman kramer

The broad applications for very fast general vision models are as yet…

May 18, 2024

The broad applications for very fast general vision models are as yet under-appreciated.

To be fair, this is true right now of vision in general, and really all of generative AI.

Those of us building applications have not yet caught up to the last 24 months of multiple leaps forward in the core tech. But right now I think it’s *particularly* true for vision.

artem@artemsya

Got fast paligemma inference working on RTX 4090. Here's an object detection demo with the the 224px model running in real time at 16fps. I generate 10 tokens per iteration

Video from @artemsya's post