Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Pelican riding a bicycle: https://pasteboard.co/CjJ7Xxftljzp.png


2D SVG is old news. Next frontier is animated 3D. One shot shows there's still progress to be made: https://aistudio.google.com/apps/drive/1XA4HdqQK5ixqi1jD9uMg...


Great improvement by only adding one feedback prompt: Change the rotation axis of the wheels by 90 degrees in the horizontal plane. Same for the legs and arms

https://aistudio.google.com/app/prompts?state=%7B%22ids%22:%...


Did you notice that this embedded a Gemini API connection within the app itself? Or am I not understanding what that is?


I hadn't! It looks like that is there to power the text box at the bottom of the app that allows for AI-powered changes to the scene.


This says Gemini 2.5 though.


Good observation. The app was created with Gemini 3 Pro Preview, but the app calls out to Gemini 2.5 if you use the embedded prompt box.


Incredible. Thanks for sharing.


Some time I think I should spend $50 on Upwork to get a real human artist to do it first to know what is that we're going for. What a good pelican riding a bicycle SVG is actually looking like?


IMO it's not about art, but a completely different path than all these images are going down. The pelican needs tools to ride the bike, or a modified bike. Maybe a recumbent?


At this point I'm surprised they haven't been training on thousands of professionally-created SVGs of pelicans on bicycles.


i think anything that makes it clear they've done that would be a lot worse PR than failing the pelican test would ever be.


It would be next to impossible for anyone without insider knowledge to prove that to be the case.

Secondly, benchmarks are public data, and these models are trained on such large amounts of it that it would be impractical to ensure that some benchmark data is not part of the training set. And even if it's not, it would be safe to assume that engineers building these models would test their performance on all kinds of benchmarks, and tweak them accordingly. This happens all the time in other industries as well.

So the pelican riding a bicycle test is interesting, but it's not a performance indicator at this point.


It’s a good pelican. Not great but good.


The blue lines indicating wind really sell it.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: