Of course models purely made for image stuff will completely wipe it out. The vision language models are useful for their generalist capabilities
This item has no comments currently.
It looks like you have JavaScript disabled. This web app requires that JavaScript is enabled.
Please enable JavaScript to use this site (or just go read Hacker News).
Of course models purely made for image stuff will completely wipe it out. The vision language models are useful for their generalist capabilities