I wonder if anyone is aware of this dataset
Sunday, August 24, 2025 - 23:05
I found this by chance and wondered if anyone is aware of it:
https://huggingface.co/datasets/nyuuzyou/OpenGameArt-OGA-BY-4.0
actually there are dataset for each license: https://huggingface.co/collections/nyuuzyou/opengameart-680b5aa2e8ab3ed893af03f0
Ugh point huggingface away from pixelart. It is sacred! And Really? We need a dataset of 32x32 bunny spritesheet? Computers cant make this on their own by now?
I was just sharing the info, mainly because some people are interested in backing up the site—and at the very least, this serves as a backup of the assets. That’s my main goal.
As for the purpose of the dataset: it’s not just about pixel art. It includes various graphic styles, music, and more. And honestly, there’s not much anyone can do about it—licenses allow it. Still, if someone finds it useful as a backup, then it’s there for them.
For anyone feeling concerned: There's an important distinction between having a dataset available for training and an AI actually being capable of understanding and using that data well enough to replace artists. Right now, AI is still far from reaching that level—especially when it comes to specialized artistic disciplines.
I doubt that a model trained solely on OGA would be very competitive, considering the vast and diverse datasets used to train most modern models. That said, this is purely speculation—I haven’t tested such a model or seen any artwork generated from it. However, OGA assets could potentially be included in larger datasets depending on their license. Based on my research, assets under CC0 and CC-BY licenses are eligible for inclusion, as they permit reuse—CC0 allows unrestricted use, while CC-BY requires attribution, both of which align with common dataset inclusion standards.