LongCut logo

Gemini 3 Pro is INCREDIBLE. Here's how to master it in 13 minutes

By Alex Finn

Summary

## Key takeaways - **Benchmarks Dominate All Categories**: Google Gemini 3 wins in literally every single benchmark category except the coding benchmark where it falls behind Claude Sonnet 3.5, dominating humanity's last exam, math, reasoning, video knowledge, and more, often not even remotely close. [00:06], [00:17] - **Superior Research Tool**: Gemini 3 is the best AI research tool ever, scouring dozens of websites simultaneously for concepts, producing a detailed report on machine learning contests for beginners in just 3 minutes using hundreds of sources, and even building a full website with images and charts from it in under 30 seconds. [01:15], [02:06] - **Best Prototype Generator**: Gemini 3 excels at building prototypes fast, generating code for a stylistic 3D first-person shooter in one minute with sounds, powerups, great graphics, and an on-screen gun— the best code ever produced in this test—unlike previous models where bullets flew from random places. [03:00], [03:25] - **Unmatched Media Editing**: Gemini 3 is the best AI image generator, flawlessly editing thumbnails by changing text from 'it changes everything' to 'I was not expecting this,' enlarging the Grok 4.1 logo without ruining the background or face, and switching background color from orange to another perfectly, with no competition thanks to Google's vast image and video databases. [04:06], [04:46] - **Weak on Creative Vibes**: Gemini 3 falls short in creative writing and business planning with off vibes that feel very AI-like, suggesting unrealistic ideas like streaming or vibe battles for an app store, unlike GPT-4o which provides human-like, realistic feedback like build log feeds and leaderboards that push back and quell fears. [05:36], [07:02] - **Specific Use Cases Guide**: Use Gemini 3 for answers to questions, current events via Google access, media generation, and deep research with tools like quizzes and websites; stick with Claude Sonnet 3.5 in Claude Code for coding, and GPT-4o for creative writing and business planning. [11:36], [12:13]

Topics Covered

  • Gemini 3 Dominates AI Benchmarks Except Coding
  • Unmatched Research Depth in Minutes
  • Rapid Prototyping Redefines Game Development
  • Superior Image Editing Without Artifacts
  • Vibes Trump Benchmarks in Creative AI

Full Transcript

Google Gemini 3 just released and it is the most revolutionary AI model we have ever seen. But don't take my word for

ever seen. But don't take my word for it. Here are the benchmarks they just

it. Here are the benchmarks they just released and it wins in literally every single benchmark category you possibly can have except and this is a big one

for bench verified which is the coding benchmark which it just falls behind Claude Sonnet 45 but every other category humanity's last exam math

reasoning video knowledge everything it absolutely dominated and for a lot of these is actually not even remotely close in this video I'm going to go over everything Gemini 3 is, its strengths,

weaknesses, when you should be using it, and when you shouldn't be using it.

There are quite a few use cases I actually would not be using it. I've had

exclusive access to Gemini 3 for the past week now in early release, and I have tons of thoughts I want to share with you. You're about to become a

with you. You're about to become a Gemini 3 master. Let's get into it. So,

let's go straight into it. I've been

using this non-stop for the last week now. I was lucky enough to have early

now. I was lucky enough to have early access. And here are the big strengths

access. And here are the big strengths and weaknesses. And I'm going to

and weaknesses. And I'm going to demonstrate each one of these for you to show you what I mean. So number one, it is the best problemsolving AI model ever. It is unbelievably intelligent.

ever. It is unbelievably intelligent.

This is the best AI research tool I've ever used in my life. So I set it off to do some research on LLM for beginners. I

I said, "Hey, can you start researching machine learning contests for beginners and give me a detailed description for stone cold beginners?" Right? And it is going now and it has been working now I

think for like 45 seconds and it is scouring the entire internet multiple websites dozens of websites at all at the same time. It's doing it step by step coming up with new concepts it

needs to research and then researching like 20 websites for every single new concept it needs to research. There has

never been a smarter model ever released. And it showed in the

released. And it showed in the benchmarks I just showed you. But here's

the thing though. When I get over to weaknesses in a second, I'll show you why AI is not just all about raw intelligence, but we're sticking to the strengths here. And when it comes to

strengths here. And when it comes to research intelligence, incredible. So,

here is the final research report it built for me. And this took, I think, at most 3 minutes to do. This is

unbelievable. This is basically a full in-depth research report with hundreds of sources it was built on top of that it took just a couple minutes to do. And

on top of that, you can build entire websites based on this. So if I want to build a web page based on this research report, it will just quickly build a website and launch it for me to have all the information I need. Boom. And just

like that, in under 30 seconds, it built an entire website with images that it generated for each section. Wow, look at these charts. Uh, that is unbelievable.

these charts. Uh, that is unbelievable.

And I can share this out. So if I wanted to share and export this website, put it in a Google document, share it out, I can do that. If I wanted to build a quiz, flashc cards, an audio podcast

based on this, I can in one click get that based on the research report. This

is the most powerful research tool I've ever used. It is also incredible at

ever used. It is also incredible at building prototypes fast. So, I have a basic test I do with every new AI model that comes out, and that is a 3D firstperson shooter test. I basically

give it creative freedom. I say, make this the most stylistic, fun, and visually appealing game possible. Give

it creative freedom and say, "Have at it." And this was the best test I've

it." And this was the best test I've ever given an AI model. This this

produced the best code I've ever seen.

So in just that one prompt, it generated code in like a minute of this 3D firsterson shooter. I don't think you

firsterson shooter. I don't think you can hear it, but it has sounds for when the bullets fire, for when it hits enemies. That's a powerup. It has

enemies. That's a powerup. It has

powerups. It has these great graphics.

And most impressive to me is it actually puts like the gun on the screen. This is

the first time in any of these tests I've ever done with models where it actually put a gun on the screen and bullets weren't flying out from random places in the middle of the screen. This

is really, really impressive. So,

generating prototypes, and notice how I say prototypes, I'll get more into that in a second. It is the best ever. And

then media generating bar none the best.

I made this thumbnail with it for my last YouTube video that came out. And

so, basically, I took one of my older thumbnails. I said, "Change the text

thumbnails. I said, "Change the text from it changes everything to I was not expecting this." Perfect. Flawless.

expecting this." Perfect. Flawless.

Nothing. Like just absolutely. You would

never have guessed an AI changed the text there. Then I said, "Can you make

text there. Then I said, "Can you make the Grock 4.1 logo bigger inside the owl image?" And boom, perfect. It made it

image?" And boom, perfect. It made it bigger. It didn't ruin the image in the

bigger. It didn't ruin the image in the background. It didn't muddle anything.

background. It didn't muddle anything.

It didn't change the way my face looked.

With a lot of these other AI image generators, when you change little details like that in the image, it will make all the people like look different.

completely perfect. Nothing changed on the image or any other part of it. And

then I said, "Can you change the background color from orange to any other color?" And it did it perfectly.

other color?" And it did it perfectly.

Nothing ruined in the image. Text looks

the same. Image looks the same. My face

looks the same. It looks perfect. This

is the best AI image generator of all time. It is incredible. There is

time. It is incredible. There is

literally no competition. And I guess that's what you would expect when you have Google, which is like Google image search, which is the mo the biggest image database in the world, as well as YouTube, the biggest video data database

in the world. They nailed it. This is

incredible. So, we went over the strengths. We covered how if you're just

strengths. We covered how if you're just solving black and white challenges, you need research, you need information, or you're dealing with media, this is the best ever. But this isn't going to be my

best ever. But this isn't going to be my daily driver just yet. And here is why.

Let's get into the weaknesses. Not

everything when it comes to AI is black and white. Not everything is kind of an

and white. Not everything is kind of an equation to be solved. For me, my biggest use case when it comes to AI is actually creative writing and business planning. I use AI as my partner for

planning. I use AI as my partner for everything I do at my business for all the creative writing I do. And this was actually not the best model I've used when it comes to that. Let me give you

an example. So for business planning,

an example. So for business planning, every idea I come up with, I run through AI and I say, "Hey, build me a road map.

come up with interesting features and show me how I can increase engagement. I

expect an AI to be a helpful partner who I feel like I can trust when it comes to building that business. But there's a couple red flags that happen for me here. One is the vibes are just off. The

here. One is the vibes are just off. The

way it talks to you is very AI. And I'm

going to show you an example compared to Chad GPT 5.1 in a second, but it just feels like the way it talks to you is very AI researcher. And the ideas it came up with for the app I'm working on,

I'm working on an app store for solo built vibecoded apps. The ideas were just again very AI like. It recommended

me implementing streaming into the app, which just streaming on a website is just like something not a lot of people are going to do. We don't need another streaming website for gamification and community governance. So it has weekly

community governance. So it has weekly bounties for suggesting features. These

are just it has vibe battles. So Tinder

for apps. While these are interesting ideas, like human be people wouldn't actually use these features. These

aren't realistic features people would want to actually use. They're very just kind of AI ideas. But if I compare this to 5.1 thinking, which I think is the best kind of creative thinker, business

planner model out there right now, right off the rip, right now you've basically spec. That's fine, but it's not

spec. That's fine, but it's not defensible and won't drive retention by itself. It feels humanlike. It's pushing

itself. It feels humanlike. It's pushing

back on me a little bit. You need

reasons to come back every day. And then

the ideas it gave me were very strong.

These are realistic ideas people would want. So a build log feed so people can

want. So a build log feed so people can see how apps update. Structured feedback

requests performance-based leaderboards. These were realistic ideas

leaderboards. These were realistic ideas I actually implemented that people would use. It doesn't feel like an AI gave me

use. It doesn't feel like an AI gave me these ideas. It feels like a human

these ideas. It feels like a human actually came up with the ideas. And so

when it comes to AI, right, you have these benchmarks which measure like raw problem solving, raw code generation, and yes, this is the best. If you have measurable problems that are black and

white, this is the best model yet. But

there's a lot of these gray area things, creative writing, feeling like a human being. It doesn't quite nail that for

being. It doesn't quite nail that for me. The vibes, I think vibes are huge

me. The vibes, I think vibes are huge with an AI. I think if you're going to be talking to an AI for hours a day, it needs to feel warm. It needs to feel friendly. It needs to feel human. It

friendly. It needs to feel human. It

doesn't quite hit that vibe test for me.

For instance, I'm launching a vibe coding academy next week, which is just a community for vibe coders. And I asked for ideas. It gave me a few ideas, which

for ideas. It gave me a few ideas, which is how to improve the community and course, which is good. That's what I asked for. It gave me exactly what I

asked for. It gave me exactly what I asked for. But we look at what GPT 5.1

asked for. But we look at what GPT 5.1 gave me and it just knocked it out of the park. It gave me a bunch of

the park. It gave me a bunch of recommendations. It quelled my fears. It

recommendations. It quelled my fears. It

saw in my prompt I had fears about certain angling and positioning of the course and community. It went over pricing which is a very important part of this. It went over anxieties the

of this. It went over anxieties the customer might have and then it gave me an entire structure to ship with what I should have for day one and what I should position later on. It just went

kind of above and beyond and thought like a human being. Okay, where might he have fears? Where might the customers

have fears? Where might the customers have fears? I'm not just going to give

have fears? I'm not just going to give him a few business ideas. I'm going to make sure he feels comfortable with this launch totally. And those are things I

launch totally. And those are things I look for in an AI. Kind of extra mile vibe test where you just feel good and warm using it. I know that's not measurable. I know you can't really

measurable. I know you can't really benchmark that. It's kind of ooey gooey,

benchmark that. It's kind of ooey gooey, but for me, when I'm talking to an AI for hours and hours a day, I want the vibes to be immaculate. I want it to feel like I'm working with a human

being. And for me, Gemini 3 falls a

being. And for me, Gemini 3 falls a little bit short on that scale. So for

me, this is the greatest research tool.

This is the greatest problemolving tool ever. If I have a problem, if I need

ever. If I have a problem, if I need something researched, if I need an answer to a very black and white question, Gemini 3 is it. Couple other

weaknesses here. It is pretty expensive.

So, it is $2 input per million tokens.

Output is $12. And that's if you're under 200,000 tokens, and it actually goes up if you go over 200,000 tokens.

That is a little bit more expensive than GPT 5.1, which is $1.25 in, $10 out. So

you are paying a considerable amount more if if you're using this through the API. So not the most cost effective

API. So not the most cost effective model out there, but I'm very confident they'll probably come out with Gemini 3 Flash very soon. And Gemini 2.5 Flash

was like the best cheap model out there.

The best lightweight cheap model. So I'm

sure Gemini 3 Flash is going to be incredible and probably the go-to for cheaper models. And here's the last

cheaper models. And here's the last part. No great coating harness. So, from

part. No great coating harness. So, from

a raw coding ability, Gemini 3 might be the best, but I'm actually going to stick with Claude Code for my longer, bigger projects. Claude Code is the best

bigger projects. Claude Code is the best coding harness out there. So, it takes a very strong coding model in Sonnet 45 and makes it even better because it gives it really good instructions.

Gemini 3 doesn't really have that kind of great coding harness. The closest

you're going to get is with AI Studio.

If you go to a studio.google.com,

google.com. You can start building out prototypes here very easily. You give an idea for an app, it builds the prototype out. It builds the V1. But this isn't

out. It builds the V1. But this isn't great for long coding sessions. If

you're building a really complex app, I built out a really complex end toend app called Creator Buddy in 4 months with Claude Code and it was excellent. You're

not going to get that here. I'm still

waiting on Gemini CLI to improve or some sort of Gemini coding solution. It isn't

there yet. But if you're looking to build out the best prototypes you can, Gemini 3 is the winner. Longer coding

sessions, go claw code. Shorter coding

sessions, I'm going Gemini 3 in Google AI Studio. So here is my updated list of

AI Studio. So here is my updated list of use cases with which model I would use for each use case. Coding, I'm sticking with Sonnet 45 inside Cloud Code.

Answers to questions. So just general answers. If I have a question, I need an

answers. If I have a question, I need an answer quick. Gemini 3 is the winner

answer quick. Gemini 3 is the winner there. current events because it has

there. current events because it has access to Google and accesses hundreds of websites very very quickly using Gemini 3 for any questions around what's currently going on creative writing business planning sticking with 5.1

thinking that is the goat that is the best anything media related video generation image generation Gemini 3 is the way use any of the Google AI products for that and then if we go down

here deep research Gemini 3 anything research related the AI tool set with Google and what it can do with flashcard cards, quizzes, tests, websites you can generate. It is the goat. I am using

generate. It is the goat. I am using Gemini 3 for deep research. Have you

been using Gemini 3 at all? Let me know down in the replies below. The AI race is heating up. You need to be using these tools the moment they release. You

need to be using the latest and greatest. This is how you get ahead of

greatest. This is how you get ahead of your competition. This is how you build

your competition. This is how you build great products. You need to use the

great products. You need to use the tools the moment they release. So, hop

on Gemini 3. Let me know what you think.

Let me know in the replies below what your thoughts are. Is it going to be replacing every AI model for you? Let me

know. Leave a like down below if you learned anything at all. Make sure to subscribe and turn on notifications as well. I'm going to do a full custom

well. I'm going to do a full custom benchmarking with the Alex Finn world famous benchmarks on a live stream very shortly. So, turn on notifications for

shortly. So, turn on notifications for that. Stick to the channel for all the

that. Stick to the channel for all the latest AI news, and I will see you in the next

Loading...

Loading video analysis...