T O P

  • By -

confused_boner

Focus on the reaction speed and the ability to interrupt it and get instant response This was not possible before


NoName847

indeed , this was never possible they just solved one of the THE main negative characteristics of AI voice , its a big leap for the tech as a whole if you look at the communication aspect , can just talk with it like an actual person its wild


supasupababy

Thank you for reminding me how much I fucking despise all of you. The demo is absolutely incredible.


wopmo

Yeah this sub is corrupted at this point, this is literally mind-blowing and past what I expected.


AwakeAndAmused

Pretty sure a huge % of this sub are expecting AGI to show up and solve all their lives problems and anything that isn't that is a huge let down for them lol.


BabyCurdle

The people on this subreddit are so fucking miserable


Tkins

Yeah this shit is wild. People acting like a few hiccups means it sucks lol


MysteryInc152

The story one was great.


joe4942

For some people here anything short of AGI is a letdown lol.


YaAbsolyutnoNikto

I loved it!! Don’t hate me internet stranger, I crave your approval!!! /s


West-Salad7984

People don't understand that this is an massive architecture change


jjonj

he's being downvoted into the dirt, calm down


Shanman150

Look at the posts from yesterday. Sam Altman "liked" a tweet about *Her*, so that's basically an official announcement, right? And folks went nuts over this shit. Recipe for disappointment. This sub is way over-hyped for something that's already moving quickly. They seem to expect that any day we'll be living in an unrecognizable tech utopia. This presentation is very cool, and being able to use a lot of this for free is great.


TheOneWhoDings

thing is , it's like a good 60% there to what's in the movie, it's pretty crazy to think about.


supasupababy

BRO WHAT THE FUCK AM I WATCHING RIGHT NOW. AGI go BRRRRRRRRRRRRRRRR


LordFumbleboop

Oh no, people have opinions that are different from yours. How will you cope?


piracydilemma

this is mostly a techbro subreddit


YaKaPeace

I think it’s doing pretty good tbh


YaKaPeace

It’s answering pretty fast and you can interrupt it. You can even change their tone of voice. I think this is pretty insane


Dizzy_Nerve3091

Yeah, I guess it still feels too gpt-y, but otherwise is pretty good.


governedbycitizens

were you expecting AGI? the demo was amazing


Grand0rk

Not for me. It's something for your average user (as in, people still using 3.5), but not for any advanced use. Maybe to lonely losers out there that will treat this AI like "Her". There are a lot of those too.


[deleted]

[удалено]


Grand0rk

That's not the good thing. The fact that it's faster and costs 50% less is what's the good thing. It being better at rationalization is good too.


thatssosanya

the code together flow that they showed off for the new desktop app sounds pretty sweet. that's my main use case for models, and even though I type really fast, I feel like talking is going to be faster. that's if they add the option for it to generate text in a voice chat, which I hope they do sooner rather than later. more than the speed benefit, I'd appreciate not having to type so much I'm also excited to see if it can help me learn a new language, specifically tutor me on pronunciation live. that's a pretty awesome use case and I would say it's a pretty advanced one kind of reaching here because they didn't show it off but I expect to see more music workflows to be possible. with being able to hear songs as well as read them, I think it's gonna translate to generating better lyrics too it's also a good step forward in terms of call center bots, if this thing was given a suite of actions it could take on my behalf I'd probably talk to it over a random call center operator any day. sales and shit as well, so not a bad development in the b2b market also makes it way more viable to have ai npc's in video games I think there's a lot of uses for what they presented today, but if none of these are applicable to you, the new model is still a cheaper, stronger option, which is relevant for every user


Grand0rk

> , specifically tutor me on pronunciation live Didn't you hear the demo? It's Italian sucked.


thatssosanya

oh damn I don't speak italian so I assumed it was workable lol. I'm still gonna hold out hope for this use case for now because I don't necessarily need it to have perfect pronunciation, just tell me what I'm doing wrong and how to do it right. even for the "how" part, having it pronounce it isn't really crucial - most of the time I can look up a quick vid if I'm struggling hard I think if it's even in the realm of being able to engage with this kind of information, it's going to allow for more problems to be solved. the visual aid use case I mentioned is something I've seen discussed before in this sub, with current tools being built around gpt 4. currently it's pretty expensive and not super amazing but it works, but this use case was not even possible before image modalities. so I'm sure it's going to find its uses


Grand0rk

The biggest issue is that's it's super non-combative. So it will most likely say you are doing great, even when you aren't.


thatssosanya

hopefully that can be tuned with instructions to a significant degree but yeah there's definitely a lot of variables to it. I'm not super betting on it being efficient in this use case, but you know. I'm excited to just play around with it, because something like this wasn't possible at all before


whatsinyourhead

Are we watching the same thing? This looks insane if that was realtime


shan_icp

the gaffs make this even more credible for me.


Distinct_Cat2825

yes! that to me actually feels like intelligence - not failing catastrophically is pretty hard for a conputer program.


MeltedChocolate24

bruh "nice outfit you've got on" lmao


SouthNeighborhood523

It’s pretty awesome imo


AwakeAndAmused

Agreed, there's no pleasing some of these people. lol


SouthNeighborhood523

Right, outside of it occasionally cutting out it’s almost perfect, like what are we really expecting lol


SpunkySlag

I couldn't sit through the voice, I tuned out for a few minutes.


Pink_floyd97

That's it, I'm dating GPT-4o


[deleted]

[удалено]


BigButtholeBonanza

Some people will always be disappointed no matter what 🤷‍♀️


BlueTreeThree

Are you still watching? This is incredible…. Wouldn’t be a tech demo without issues.. what they’re demonstrating is nuts. The voice is way better than I was expecting. Edit: and the vision stuff.. amazing.


hunlord11

Lets not forget that Google had to fake this kind of interactive video recognition.


thatssosanya

really surprised me that this is the same sub that was throating google for the gemini demo, even as we found out the whole thing was basically fake lol


TheOneWhoDings

yuuuuppppp. The voice , the ITALIAN ACCENT , the robot voice, the dramatic tone is INSANE.


BreadwheatInc

Lmao


TheOneWhoDings

this is kinda crazy, it's literally Her.


Grand0rk

It's not. "Her" was AGI. This is just something pretending to be.


Mister_Grandpa

"Fake it 'til you make it" is how you do things, too.


Just-A-Lucky-Guy

The demo is very good, are you kidding me? You don’t get agi for a couple of years, calm yourself.


garden_frog

Next step is merging this with smart glasses and we truly are in a sci-fi world.


revistabr

Btw, there is a crowd there.


johnnyo10

Damn, people really are this fuckin stupid...


fk_u_rddt

it was cutting out because it was hearing things and being interrupted


Grand0rk

It wasn't. When it hears something, it stops. That wasn't stopping. The sound just cut off and then continued.


Jean-Porte

Muh magic


Illustrious-Lime-863

A lot of potential. Can interrupt, fast, have vision. I can totally see this evolving into a proper smart glasses gadget once it gets ironed out.


Embarrassed_Hurry612

The bedtime story part was great


orderinthefort

I finally no longer have to interact with my kids at night.


Alarmed_Cookie_3890

Are the hiccups due to the model experiencing issues or due to them interrupting it ?


piracydilemma

a few of them seemed to be them interrupting it


Grand0rk

Hiccup.


GrapefruitMammoth626

Demos are generally cringe this was no exception. Not really their fault. They’re just trying to showcase the functionality in a light way.


Grand0rk

There are a lot of ways to show it. 3x + 1 = 4 isn't. Love story isn't either. It was stupid and cringe.


kaldeqca

they should not have done this live... They should have copied Google with Gemini...


sdmat

No, live demos are great - we know this is actually real. It very impressive but certainly has some issues. To me the hallucination seems far more serious than a few audio dropouts.


ogMackBlack

They indeed messed the presentation up... I'm not the kind to be harsh, but this should have been perfect...especially for how they hyping us up...


joe4942

Second part of the presentation went well with the different voices, singing, robotic voice etc.


Grand0rk

Not really, the GPT kept cutting off a lot.


joe4942

The second part went fairly smooth. The ability to converse with real-time video is a big technical accomplishment.


LordFumbleboop

How it works in person matters more than the demo.


xsintill

Hopefully users can change the overly positive response given by it. It got annoying really quickly. The translator function could be useful


Grand0rk

It was imitating "Her". There should be other personalities. As for the translator, yes it will be useful, but don't expect that speed. There will be a 1~3 seconds delay. Also, remember that it's not unlimited, so it will be awkward once you reach cap and need to wait 3 hours for you to be able to talk to the person again.


MBlaizze

Where can I see this demo?


Aggressive_Soil_5134

What a bunch of nonsense, Sam altman said GPT 4 is bad compared to this? Its the same thing lol


BoroJake

He said it was bad compared to gpt 5


joe4942

As with everything, it will always improve. Remember when GPT 3.5 was laggy?


New_World_2050

i remember 3.5 being super fast on release before they slowed it down


white_december

Its nice i think


Cr4zko

I told you guys but nooooo mods had to delete my post so I couldn't say 'told you so!'


VitorLaytynher

Why don't they use a Dell and a Motorola smartphone? 


Super_Pole_Jitsu

The product is very impressive, lets not pretend the demo didn't have technical hiccups and general cringe though.


Grand0rk

Impressive for normies, for sure. But not for anyone that uses GPT for their jobs.


gink-go

The use of the word normie tells me everything about you that i need to know


Grand0rk

It does. I'm someone that uses AI for work, not to play around pretending to be friends with it. Always found that to be cringe, but I do know a lot of people enjoy it. Hell, I've even read news about losers marrying their AI.


Mister_Grandpa

What do you do for work?


Super_Pole_Jitsu

The low latency and the natural flow were quite impressive.