I think people underestimate how much more expensive it is to process images/videos compared to text At tinder for the recommendation algorithm, most people assume there’s some image/computer vision involved but it was 1000x more efficient to do cosine similarity w swipe patterns With that said, the next battlefield for models (and applications) is images and videos and I think it will be 1000x more exciting!
Ethan He
Ethan He7.8. klo 03.32
AI exhausted text from the entire internet. But Images are 1000x bigger. Videos are another 1000x bigger at zettabytes. There's way more videos than AI can consume yet. Video generation and world models are evolving at speed of light.
753