zerr parent
My litmus test for generative AI: generate a complete spritesheet for a 2D pixel art action game, e.g. only for the battle tank or main hero movements. No success so far.
Ive never once successfully gotten a usable sprite sheet out of ChatGPT. The concept seems foreign to it and no matter how hard I try to steer it it’ll find a way to do something hopeless (inconsistent frame sizes; incoherent animations; no sense of consistent pixel sizes or what distinguishes (say) 8-bit from 16-bit era sprites; it’ll draw graph paper in the background for some reason; etc etc.). If anyone has a set of magic prompts for this, I’d love to learn about it. But my suspicion is that it’s just fundamentally the wrong tool for the job — you probably need a purpose-built model.
Like a full sprite sheet, one sprite at a time or a sprite animation loop?
Each one would require a different kind of model and model technique to make, so I wouldn't be surprised that ChatGPT has issues with it. A sprite animation loop would be better done by a potentially specialized video-oriented model, for example, and the current image and video models are barely trained on that kind of video data.
that might be possible by asking it to create an 3d model with animations (based on a template) and then capture the sprites. but then again, not sure if building it would be worthwhile because 1) openai might add that as a native product (like what happened with .ppt generation) or 2) the capability to do so might be 6 months away
Have you tried drawing?
According to all the lazy articles I've read here lately you just need to threaten to beat it up lmao...
Unless you're using a model that was built for pixel art, you will never get a usable piece of pixel art out of a model using a diffusion based image generator because it does not understand what a pixel is. You'll always get pixels bleeding into the others, shitty outlines, and nonsensical AA. They simply do not understand the medium.
Mind sharing what you have tried? Have you considered training a diffusion model on pixel art, and then conditioning it on a 3D model?
Agreed, I got an LLM to build me a Super Mario Bros game with 0 code and it's playable after 2 prompts but I'm stuck with the spritesheet as well.
I've seen people make finetunes for character turnarounds, there's probably one for 2D pixel art in particular, or perhaps combining such a finetune with a pixel art finetune would work.
It looks like retrodiffusion.ai in particular has something close.
Maybe AI games can use AI-first art styles. If I ask a camera to create an André Derain painting then it will fail. But that's my fault for misunderstanding the medium.
Yeah I tried to get a spritesheet generated. Absolute rubbish.