HOME
情報
OpenAI’s DALL-E 3-Like AI For Free, Forever!

2024.08.11 情報beatle

OpenAI’s DALL-E 3-Like AI For Free, Forever!

ポスト
シェア
はてブ
送る
Pocket

Check out Weights & Biases and sign up for a free demo here: https://wandb.me/papersllm Flux is available here: ...

情報

ポスト
シェア
はてブ
送る
Pocket
feedly

101 件のコメント

@Benjamin_Gilbert-Lif より:

2024年8月18日 9:06 AM

@Benjamin_Gilbert-Lif

2024-08-18

Can we please stop only doing AI video’s its boring and lame, its all the channel covers now

@somebodyoncetoldme1704 より:

2024年8月18日 9:06 AM

@somebodyoncetoldme1704

2024-08-18

It's not free already

@tigermixilatorman より:

2024年8月18日 9:06 AM

@tigermixilatorman

2024-08-18

Amazing, I'm gonna check out the API for personal chatbots

@Eagleizer より:

2024年8月18日 9:06 AM

@Eagleizer

2024-08-18

Tried it. It is terrible. I tried to make an all black rally car without plates with a number on the door. I instructed it to use wide angle lens and take picture from a distance. I even asked for black paint where the plates would be. Also asked for picture to be taken from an elevated position with the sun coming from behind the camera. Every time, there would be plates, sometimes with numbers, number on the door would be wrong 90% of the time, all pictures would be taken from ground level, and from very close. Dust cloud would often be in front of the car, and I never got the sun from behind the camera as instructed. After 100 or so generations I gave up. Every time i specified some areas with problems it would mess up something else. It also seemed to get stuck in a certain look, just as we see in all the fake video thumbnails on YouTube, you learn to see what it's doing and recognize it. Maybe this is good for some type of design, but the results I got with this was no better, and in fact VERY similar to the thumb creator in udio.

@GenericYousername より:

2024年8月18日 9:06 AM

@GenericYousername

2024-08-18

"Run on your phone in your pocket 2 more papers down the line."

People are already running it on an iPhone using DrawThings app. Its takes an hour... but still the future is now.

@SergeiGolubev より:

2024年8月18日 9:06 AM

@SergeiGolubev

2024-08-18

Thanks

@TechnoMasterBoy より:

2024年8月18日 9:06 AM

@TechnoMasterBoy

2024-08-18

"for free"
Did you not even bother reading their github and such?
The "pro" model which is what their API uses, you can not even download...
Their "dev" model is a whopping 23.8GB. I don't know if image gen models behave like LLMs, but if it does, only a small percentage of people are gonna be able to run it, as they'd need 24GB VRAM.
It would be nice if the video showed the dev model being used locally, so we can see how good we'll realistically get.

@avataromega より:

2024年8月18日 9:06 AM

@avataromega

2024-08-18

not free

@bzikarius より:

2024年8月11日 8:26 AM

@bzikarius

2024-08-11

I tested few simple prompts and it follows em pretty close. Also it is true about text: each generation was successfull, even with tilted and defocused text.
Impressive. The more complex prompt causes much more mistakes.

@bambabam1234 より:

2024年8月11日 8:26 AM

@bambabam1234

2024-08-11

Why OpenAI in the title though? Really weird phrasing

@ooiiooiiooii より:

2024年8月11日 8:26 AM

@ooiiooiiooii

2024-08-11

These are not free. Literally no mention of the countless stolen images used to train these. You didn’t “make images of scholars” you typed a prompt and the algorithm used actual artists’ work. Tech is amazing but the lack of intellectual honestly and integrity is telling.

@AiMotionStudio より:

2024年8月11日 8:26 AM

@AiMotionStudio

2024-08-11

Made this video using FLUX Pro Images: https://youtu.be/E42SSnxTdxE

@TARS.. より:

2024年8月11日 8:26 AM

@TARS..

2024-08-11

Am I seeing this right?? When I run it on replicate, it says it's running on CPU rather than something like SDXL which runs on much larger GPUs like the A40.

@timl2k11 より:

2024年8月11日 8:26 AM

@timl2k11

2024-08-11

Can you share the prompt you used for the coffee?

@Adreitz7 より:

2024年8月11日 8:26 AM

@Adreitz7

2024-08-11

I'm running Flux dev on my own machine and its results are GREAT. Very coherent and aesthetic, and the amount of detail is outstanding -- the output is full of pixel-level features, so I wouldn't be surprised if it is using a 16-channel VAE like SD3. The drawbacks are 1. this model is HEAVY at 12B parameters plus T5 (you'll probably need at least 12GB VRAM to run it, and quantized at that); 2. if you thought SDXL was slow, this is about 4x slower step-for-step; 3. it currently doesn't support negative prompts as far as I know, so that may cause problems if your desired prompt causes the model to add undesired features to the image (e.g. "make me a photo of fried rice without peas"). Hopefully, Black Forest will make this into a model family including lower-weight options and put out their paper soon so others can learn from their advances.

@khalatelomara より:

2024年8月11日 8:26 AM

@khalatelomara

2024-08-11

I tried it and it is impressive quite close to keep patterns and details for architecture

@LV-426... より:

2024年8月11日 8:26 AM

@LV-426...

2024-08-11

Lately it's not about scientific achievements, rather this is free today, this is open source, that will be for free forever, and so on. A communist paradise. TWO, you are losing your beginnings.

@patrickhector より:

2024年8月11日 8:26 AM

@patrickhector

2024-08-11

Still cannot make a normal human woman that doesn't look like a hyperattractive doll, and really can't handle adding scars

@Pixel_scribes より:

2024年8月11日 8:26 AM

@Pixel_scribes

2024-08-11

I don't know understand the interface of GitHub, anyone who knows it??

@tbzor より:

2024年8月11日 8:26 AM

@tbzor

2024-08-11

The text plus pictures is so clean, could one specify font though?

@LifewenNight より:

2024年8月11日 8:04 AM

@LifewenNight

2024-08-11

this isnt free.

@cicadafun より:

2024年8月11日 8:04 AM

@cicadafun

2024-08-11

can we get less AI and more things that actually matter?

@erikziak1249 より:

2024年8月11日 8:04 AM

@erikziak1249

2024-08-11

Tried it, it is really stupid. Asked to draw a trolleybus and it came up with totally idiotic images. Absolutely no understanding of catenary and trolley poles. Streetcar was a little bit better, but still massively flawed.

@DanFrederiksen より:

2024年8月11日 8:04 AM

@DanFrederiksen

2024-08-11

Is it hysterically censored or will it allow the generation we want? like hitler netanyahu
Edit: yes it will. it started out a little puppet cartoonish but with several attempts it gets decent. It refused to add satan behind netanyahu but a demon was fine. it doesn't get the hitler mustache quite right, it doesn't have quite the photorealism of midjourney but it's pretty good. has to be the pro version though.

@kellymoses8566 より:

2024年8月11日 8:04 AM

@kellymoses8566

2024-08-11

Completely uncensored image generators will create some WILD stuff.

@RealityRogue より:

2024年8月11日 8:04 AM

@RealityRogue

2024-08-11

I end up using "Niji 6" of midjourney for it's amazing creative quality when coming up with stylized characters. I don't know if Flux would come close, as this seems mostly similar to midjourney's V6.1

@azeriff より:

2024年8月11日 8:04 AM

@azeriff

2024-08-11

First off, I have to say that Flux is fantastic. Way better than DALL·E3. In my opinion, DALL·E3 is a completely useless algorithm. I’ve tried working with it multiple times, even though I pay for the premium version. Each time it just doesn’t understand or follow my instructions. The prompt is impossible to modify, making it unusable. It might work for something simple like “a hedgehog in a meadow,” but for anything more complex, it fails miserably.

@vidyagaems4063 より:

2024年8月11日 8:04 AM

@vidyagaems4063

2024-08-11

Being fully available to public instantly makes it good.

@AricRastley より:

2024年8月11日 8:04 AM

@AricRastley

2024-08-11

Dang, we got text before GTA6

@df1ned より:

2024年8月11日 8:04 AM

@df1ned

2024-08-11

Yeah, its cool. But it will be forgotten within a month unless they manage to, firstly, optimize and prune it probaly by about a factor of 10 on performance and secnondly, firgure out how to finetune it

@Wrennbird より:

2024年8月11日 8:04 AM

@Wrennbird

2024-08-11

Are the training models ethically sourced? If not, this is yet again another program built off the backs of uncredited artists and photographers for the sake of cutting corners.

@GenericCat より:

2024年8月11日 8:04 AM

@GenericCat

2024-08-11

"Your request will cost $0.05 per megapixel. For $1 you can run this model approximately 20 times." I think we have different meanings for the word "free" xD

@artman40 より:

2024年8月11日 8:04 AM

@artman40

2024-08-11

Note that DALL-E 3 can still seemingly do more styles and Flux AI still struggled with longer texts.
It also struggles with novel concepts such as "Spoon-shaped elephant".

@muhammadlufti2967 より:

2024年8月11日 8:04 AM

@muhammadlufti2967

2024-08-11

Been using this model for 3 days and I can’t believe that Flux Schnell model is that fast and Flux Pro model’s result is on par with Midjourney’s and Stable Ultra model

@looksintolasers より:

2024年8月11日 8:04 AM

@looksintolasers

2024-08-11

Please do a video on the "Repulsive Shells" paper from SIGGRAPH 2024 by Josua Sassen, Henrik Schumacher, Martin Rumpf, and Keenan Crane!

@LOC-Ness より:

2024年8月11日 8:04 AM

@LOC-Ness

2024-08-11

This was made by Black Forest Labs, not OpenAI

@CartoType より:

2024年8月11日 8:04 AM

@CartoType

2024-08-11

This is incredible. I tried it a few minutes ago, and it's going to take me some time to recover from my amazement. I asked for a picture in the style of a Victorian painting with ancient Indo-European chieftains and a mass of their followers, and got an extremely good result.

@Aemond-qj4xt より:

2024年8月11日 8:04 AM

@Aemond-qj4xt

2024-08-11

we are truly living in one of the times to be alive

@lektionenausallenbereichen791 より:

2024年8月11日 8:04 AM

@lektionenausallenbereichen791

2024-08-11

Good to have some AI news from my native Germany :)

@TheBennyFisch より:

2024年8月11日 8:04 AM

@TheBennyFisch

2024-08-11

There seems to be a misunderstanding in terms of "available for free". This is not the first time that only reduced models are available.

@begobolehsjwjangan2359 より:

2024年8月11日 7:40 AM

@begobolehsjwjangan2359

2024-08-11

but can it generate correct hands image?

@luuketaylor より:

2024年8月11日 7:40 AM

@luuketaylor

2024-08-11

At this point, I'm digitizing all my papers so I don't have to worry about holding onto them or dropping them!

@tyalikanky より:

2024年8月11日 7:40 AM

@tyalikanky

2024-08-11

Next step is generation of pages of plausible text as part of image

@Onihikage より:

2024年8月11日 7:40 AM

@Onihikage

2024-08-11

Dr. Károly, I know it's not quite your usual thing, but I would be very interested to see a video summarizing the major public advancements that have been made in cloth simulations over the last five or ten years. What kind of performance gains have been realized by the kind of research you've covered? What sort of real-time cloth simulation, such as in video games, has gone from unthinkable to trivial? I'd love to know! Maybe it's more suitable for a TwentyMinutePapers kind of video, but I really want to see it!

@GPAnims より:

2024年8月11日 7:40 AM

@GPAnims

2024-08-11

why is your videos all AI now

@alberthofmann2630 より:

2024年8月11日 7:40 AM

@alberthofmann2630

2024-08-11

Damn !

@pietro4507 より:

2024年8月11日 7:40 AM

@pietro4507

2024-08-11

What a time to be alive!

@somebodyoncetoldme1704 より:

2024年8月11日 7:40 AM

@somebodyoncetoldme1704

2024-08-11

If you use swiftkey on your phone you have dall e 3 for free as well

@revengefrommars より:

2024年8月11日 7:40 AM

@revengefrommars

2024-08-11

After several tries, I got the Schnell version on HuggingFace to draw something acceptable (free and without login), so it's about as good as SDXL or DALL-E. I usually throw the same prompt at those two (and Google ImageFX) to see which one actually generates something usable. Good to have a fourth option.

@elgodric より:

2024年8月11日 7:40 AM

@elgodric

2024-08-11

What's the vram requirements?

@bricktube3871 より:

2024年8月11日 7:40 AM

@bricktube3871

2024-08-11

How much is flux pro? I’m not signed up and if I goto flux pro page it says it would be $.0017 to run my prompt. But a friend who has flux pro, it tells him $.05 to run his prompt. That’s not and insignificant difference

@validwithvee より:

2024年8月11日 7:40 AM

@validwithvee

2024-08-11

I guess I’m the only only one having issues with the app, it’s telling me 500 error

@FRareDom より:

2024年8月11日 7:40 AM

@FRareDom

2024-08-11

THIS IS SO COOL

@milesfarber より:

2024年8月11日 7:40 AM

@milesfarber

2024-08-11

>48GB of VRAM
WELP

@pallenda より:

2024年8月11日 7:40 AM

@pallenda

2024-08-11

Really good at text!

Fails my own test I found our when testing midjourney a while ago. I basically ask to get an image of an everyday item broken/disassembled into its parts. Fx. a hammer.

A hammer

@Crazy_Truth より:

2024年8月11日 7:40 AM

@Crazy_Truth

2024-08-11

It's asking for money on github

@camarotheboss2854 より:

2024年8月11日 7:40 AM

@camarotheboss2854

2024-08-11

Lol, first try and generated text is already broken :DDD

@moahammad1mohammad より:

2024年8月11日 7:40 AM

@moahammad1mohammad

2024-08-11

Meta's Next3d AI model generator

@msmith2961 より:

2024年8月11日 7:40 AM

@msmith2961

2024-08-11

Free forever, until they pull it offline like dall-e2

@ConnorisseurYT より:

2024年8月11日 7:40 AM

@ConnorisseurYT

2024-08-11

1 minute in and I already dropped my papers

@tuseroni6085 より:

2024年8月11日 7:18 AM

@tuseroni6085

2024-08-11

my main question would be: how does it do with persistent characters, and the follow up: does it support control net?

one of the major things i need AI for is creating art assets for a game i am working on, for that persistence is important.

@OpreanMircea より:

2024年8月11日 7:18 AM

@OpreanMircea

2024-08-11

"What would I use it for?", making sexy pictures of course

@Blubb5000 より:

2024年8月11日 7:18 AM

@Blubb5000

2024-08-11

I know exactly what I am going to use it for. It’s: ******

@nixes1636 より:

2024年8月11日 7:18 AM

@nixes1636

2024-08-11

This is amazing. Thank you so much for sharing it. I wouldn't have found it otherwise.

@Steamrick より:

2024年8月11日 7:18 AM

@Steamrick

2024-08-11

I rather doubt this in particular will be running on a phone any time soon. It's a 12B parameter model and the text encoder (T5-Efficient-XXL) is another 11B parameters. The hardware hunger is real.
Bringing the text capability the model has to a smaller model, though? Seems entirely possible.

@Shootjapan より:

2024年8月11日 7:18 AM

@Shootjapan

2024-08-11

New here, is it a AI voice commenting? It need to improve by a lot, no real human stop like that every 2 words.

@filovirus1 より:

2024年8月11日 7:18 AM

@filovirus1

2024-08-11

what would I use this for? umm R34 comes to mind...

@AlphaVisionPro より:

2024年8月11日 7:18 AM

@AlphaVisionPro

2024-08-11

What a time to be alive!

@AustinThomasPhD より:

2024年8月11日 7:18 AM

@AustinThomasPhD

2024-08-11

The title (currently "OpenAI’s DALL-E 3-Like AI For Free, Forever!") makes it sound like OpenAI actually released an open model. probably should change that to be misleading. A very cool model, though!

@anderswesterlund4191 より:

2024年8月11日 7:18 AM

@anderswesterlund4191

2024-08-11

iit was not free

@pattayaguideorg より:

2024年8月11日 7:18 AM

@pattayaguideorg

2024-08-11

Trying it is not as simple as you claim

@DonCarnage42 より:

2024年8月11日 7:18 AM

@DonCarnage42

2024-08-11

Looking good. Did an apple pie with letters on top using Schnell, and it worked perfectly the second time.

@kevinoudelet より:

2024年8月11日 7:18 AM

@kevinoudelet

2024-08-11

Seriously, what a day to be alive !

@jamaljamalwaziat1002 より:

2024年8月11日 7:18 AM

@jamaljamalwaziat1002

2024-08-11

Dont waste ur time using this crap jump in the real life and enjoy it

@smorty3573 より:

2024年8月11日 7:18 AM

@smorty3573

2024-08-11

Hey Károly! The title of your video made me think that Open AI actually released an open model for once... It may confuse other fellow scholars as well, so maybe consider changing it somewhat.
Also I would have liked it if you had mentioned that there are two variants of the model, but that's just my take.

@Village_Iliad より:

2024年8月11日 7:18 AM

@Village_Iliad

2024-08-11

Have we reached the point yet where any of these can synthesize a 3d scan of what’s in their images? Like with photogrammetry, generating a series of images in 360 around the object and creating the model from that?

@Lucasbrlvk より:

2024年8月11日 7:18 AM

@Lucasbrlvk

2024-08-11

wow

@bars7897 より:

2024年8月11日 7:18 AM

@bars7897

2024-08-11

One of the links asks for 0.05 cents per megapixel. And I've used it before I read that. Am I forever in debt now? Should I panic? Was this a scam?

@Yenrabbit より:

2024年8月11日 7:18 AM

@Yenrabbit

2024-08-11

Exciting model! But I wish the title wasn't focused on Dalle 3?

@mirek190 より:

2024年8月11日 7:18 AM

@mirek190

2024-08-11

Flux iis insane .. I run it locally with rtx 3090 and comfyui , image is ready after 20 seconds
Wtit Flux dev quality is better than MJ or Dalle-3

@kairu_b より:

2024年8月11日 6:54 AM

@kairu_b

2024-08-11

Awesome

@Game_with_me-r6j より:

2024年8月11日 6:54 AM

@Game_with_me-r6j

2024-08-11

What a time to be alive

@MonsterJuiced より:

2024年8月11日 6:54 AM

@MonsterJuiced

2024-08-11

Seems like you definitely need a 24GB VRAM GPU for this one. I guess it was obvious but I had my hopes up anyway

@DataJuggler より:

2024年8月11日 6:54 AM

@DataJuggler

2024-08-11

Usually I just come here to get called fa fellow scholar. This is a really good model.

@SamJohnsonking より:

2024年8月11日 6:54 AM

@SamJohnsonking

2024-08-11

Control Net and we good to go.

@roguegryphonica3147 より:

2024年8月11日 6:54 AM

@roguegryphonica3147

2024-08-11

Flux suffers from additional appendage, and mystery appendage syndrome. It also doesn't have a sense of direction. I generated a man flying a plane backwards. It also doesn't generate images well that aren't tropes. For example, it took me four or five tries to create a b horror masterpiece with the right zing. Text sometimes randomly appears in the image even if unprompted.

@NeroDefogger より:

2024年8月11日 6:54 AM

@NeroDefogger

2024-08-11

this is going waaaaay too fast

@兄さん より:

2024年8月11日 6:54 AM

@兄さん

2024-08-11

"for free, forever" yeah no. more like, "for the petit bourgeois and above in mostly anglosphere countries, and only until a megacorporation inevitably establishes a backed monopoly on all AI-related industries in the coming years"

@nevascurded より:

2024年8月11日 6:54 AM

@nevascurded

2024-08-11

Elongated thumb at 58 sec in, 1:03 in, the hand is mangled next to the T. Same problem as other AIs probably, the more specific you are, the less it works.

@YOEL_44 より:

2024年8月11日 6:54 AM

@YOEL_44

2024-08-11

What I think is that a new wave of AI pron is about to come...

@Syzygy2048 より:

2024年8月11日 6:54 AM

@Syzygy2048

2024-08-11

I feel like this discredits Stable Diffusion a bit, which is not worse Midjourney, definitely not worse than Dall-e, plus it is open weights and more flexible than either of the ones mentioned.

@giusepperana6354 より:

2024年8月11日 6:54 AM

@giusepperana6354

2024-08-11

The one their API runs apparently is called the "Pro" model and isn't available for download.

@kombinatsiya6000 より:

2024年8月11日 6:54 AM

@kombinatsiya6000

2024-08-11

Thank you, very useful tool!

@kevinoudelet より:

2024年8月11日 6:54 AM

@kevinoudelet

2024-08-11

Flux pro seems amazing for generating images. What model does the animation ?

@markmuller7962 より:

2024年8月11日 6:54 AM

@markmuller7962

2024-08-11

Apparently you need a GitHub account to log in

@thekid317 より:

2024年8月11日 6:54 AM

@thekid317

2024-08-11

ideogram already did it a year ago, but the video part is new

@vi6ddarkking より:

2024年8月11日 6:54 AM

@vi6ddarkking

2024-08-11

Invisions the notion of running a 12 Billion Parameter image generation model on a Smartphone...
You do know there are more efficient ways to fry an egg right?

@DuchAmagi より:

2024年8月11日 6:54 AM

@DuchAmagi

2024-08-11

It was supposed to be free, but I see some messages about prices and costs?

"Your request will cost $0.05 per megapixel. For $1 you can run this model approximately 20 times."

"Billable Time
21s
Cost estimate: $0.00"

@jerryworm より:

2024年8月11日 6:54 AM

@jerryworm

2024-08-11

Free image generators will not be free forever without limitations. Free versions will not be same as paid versions because paid versions has a lot more features. Free versions have basic or fundamental features only.

@ilakya より:

2024年8月11日 6:54 AM

@ilakya

2024-08-11

Still a little bit too large to chew at home. But I also remembered back then when 13b parameters LLM were impossible to run at home too.

次のコメント →

Can AI Prevent Death?

Cá mập hoàn thiện ngôi nhà

RECOMMENDこちらの記事も人気です。

情報 2024.8.8
ChatGPT Vs. Google Gemini Part 2
情報 2024.8.18
Weird Facts They Don& Teach You …
情報 2024.8.2
Câu Chuyện Của Mèo Mẹ Và Mèo Con
情報 2024.7.21
Guees the ending?
情報 2024.7.29
배스킨라빈스 매장의 과거와 미래 그려줘
情報 2024.7.31
the evil gummy bears (Ai Edition) …
情報 2025.6.24
AI Sonic Memes ARE ADDICTING
情報 2024.7.17
3 GPTs do ChatGPT Úteis para o Traba…

検索

最近の投稿

Ki fingott? fan
AI Sonic Memes ARE ADDICTING
Starfish mutate and attack the city. cat # Science Fiction
Weird Facts They Don& Teach You in School
Were Kamala Harris& crowds AI generated?

最近のコメント

FIRST CONTENT BERSAMA HALU!! HALU CUBA LET4K R4CUN DALAM AIR FIERA?! に @salwanimuhammad7021 より
AISKRIM llAOllAO SAMBIL HURU HARA! RINA SUKA DAPAT AISKRIM に @NorizanYunus より
Using A.I Voice to Impress Girls On New Omegle に @VarunFF57 より
FIRST CONTENT BERSAMA HALU!! HALU CUBA LET4K R4CUN DALAM AIR FIERA?! に @NARUTOSASUKI53 より
Using A.I Voice to Impress Girls On New Omegle に @Zox_devil6666 より

アーカイブ

2025年7月
2025年6月
2025年5月
2025年3月
2025年2月
2025年1月
2024年12月
2024年11月
2024年10月
2024年8月
2024年7月

カテゴリー

ChatGPT
youtube
ショート動画作成
動画作成
情報
画像作成

©Copyright2025 ビートル.All Rights Reserved.