Software Alternatives & Reviews

The Pile: a dataset for language modeling [pdf]

GPT-J Medium
  1. 1
    Open-source cousin of GPT-3, everyone can use it

    #Productivity #Open Source #Developer Tools 95 social mentions

  2. 2
    Welcome to Medium, a place to read, write, and interact with the stories that matter most to you.
    Pricing:
    • Open Source
    1M images is probably enough to do something, this user for example trained a diffusion model from scratch using 1.5M images and a 3090: https://medium.com/@enryu9000/anifusion-diffusion-models-for-anime-pictures-138cf1af2cbe, the quality of course is not excellent but it's something. I suggest to train a 4x64x64 diffusion model using the new SD XL VAE (it's a really good f8 VAE so it can encode, for example, images from 3x512x512 to 4x64x64), if the images have captions then I suggest using a CLIP text encoder as it was already trained on image text pairs, it would probably be much easier to use by a diffusion model trained on only 1M images instead of other text encoders like T5 that have better text understanding but they have never seen an image.

    #Blogging #Blogging Platform #CMS 2190 social mentions

Discuss: The Pile: a dataset for language modeling [pdf]

Log in or Post with