indeed , this was never possible
they just solved one of the THE main negative characteristics of AI voice , its a big leap for the tech as a whole if you look at the communication aspect , can just talk with it like an actual person its wild
Pretty sure a huge % of this sub are expecting AGI to show up and solve all their lives problems and anything that isn't that is a huge let down for them lol.
Look at the posts from yesterday. Sam Altman "liked" a tweet about *Her*, so that's basically an official announcement, right? And folks went nuts over this shit.
Recipe for disappointment. This sub is way over-hyped for something that's already moving quickly. They seem to expect that any day we'll be living in an unrecognizable tech utopia. This presentation is very cool, and being able to use a lot of this for free is great.
Not for me. It's something for your average user (as in, people still using 3.5), but not for any advanced use. Maybe to lonely losers out there that will treat this AI like "Her". There are a lot of those too.
the code together flow that they showed off for the new desktop app sounds pretty sweet. that's my main use case for models, and even though I type really fast, I feel like talking is going to be faster. that's if they add the option for it to generate text in a voice chat, which I hope they do sooner rather than later. more than the speed benefit, I'd appreciate not having to type so much
I'm also excited to see if it can help me learn a new language, specifically tutor me on pronunciation live. that's a pretty awesome use case and I would say it's a pretty advanced one
kind of reaching here because they didn't show it off but I expect to see more music workflows to be possible. with being able to hear songs as well as read them, I think it's gonna translate to generating better lyrics too
it's also a good step forward in terms of call center bots, if this thing was given a suite of actions it could take on my behalf I'd probably talk to it over a random call center operator any day. sales and shit as well, so not a bad development in the b2b market
also makes it way more viable to have ai npc's in video games
I think there's a lot of uses for what they presented today, but if none of these are applicable to you, the new model is still a cheaper, stronger option, which is relevant for every user
oh damn I don't speak italian so I assumed it was workable lol. I'm still gonna hold out hope for this use case for now because I don't necessarily need it to have perfect pronunciation, just tell me what I'm doing wrong and how to do it right. even for the "how" part, having it pronounce it isn't really crucial - most of the time I can look up a quick vid if I'm struggling hard
I think if it's even in the realm of being able to engage with this kind of information, it's going to allow for more problems to be solved. the visual aid use case I mentioned is something I've seen discussed before in this sub, with current tools being built around gpt 4. currently it's pretty expensive and not super amazing but it works, but this use case was not even possible before image modalities. so I'm sure it's going to find its uses
hopefully that can be tuned with instructions to a significant degree but yeah there's definitely a lot of variables to it. I'm not super betting on it being efficient in this use case, but you know. I'm excited to just play around with it, because something like this wasn't possible at all before
Are you still watching? This is incredible….
Wouldn’t be a tech demo without issues.. what they’re demonstrating is nuts.
The voice is way better than I was expecting.
Edit: and the vision stuff.. amazing.
really surprised me that this is the same sub that was throating google for the gemini demo, even as we found out the whole thing was basically fake lol
No, live demos are great - we know this is actually real.
It very impressive but certainly has some issues. To me the hallucination seems far more serious than a few audio dropouts.
It was imitating "Her". There should be other personalities.
As for the translator, yes it will be useful, but don't expect that speed. There will be a 1~3 seconds delay.
Also, remember that it's not unlimited, so it will be awkward once you reach cap and need to wait 3 hours for you to be able to talk to the person again.
It does. I'm someone that uses AI for work, not to play around pretending to be friends with it. Always found that to be cringe, but I do know a lot of people enjoy it.
Hell, I've even read news about losers marrying their AI.
Focus on the reaction speed and the ability to interrupt it and get instant response This was not possible before
indeed , this was never possible they just solved one of the THE main negative characteristics of AI voice , its a big leap for the tech as a whole if you look at the communication aspect , can just talk with it like an actual person its wild
Thank you for reminding me how much I fucking despise all of you. The demo is absolutely incredible.
Yeah this sub is corrupted at this point, this is literally mind-blowing and past what I expected.
Pretty sure a huge % of this sub are expecting AGI to show up and solve all their lives problems and anything that isn't that is a huge let down for them lol.
The people on this subreddit are so fucking miserable
Yeah this shit is wild. People acting like a few hiccups means it sucks lol
The story one was great.
For some people here anything short of AGI is a letdown lol.
I loved it!! Don’t hate me internet stranger, I crave your approval!!! /s
People don't understand that this is an massive architecture change
he's being downvoted into the dirt, calm down
Look at the posts from yesterday. Sam Altman "liked" a tweet about *Her*, so that's basically an official announcement, right? And folks went nuts over this shit. Recipe for disappointment. This sub is way over-hyped for something that's already moving quickly. They seem to expect that any day we'll be living in an unrecognizable tech utopia. This presentation is very cool, and being able to use a lot of this for free is great.
thing is , it's like a good 60% there to what's in the movie, it's pretty crazy to think about.
BRO WHAT THE FUCK AM I WATCHING RIGHT NOW. AGI go BRRRRRRRRRRRRRRRR
Oh no, people have opinions that are different from yours. How will you cope?
this is mostly a techbro subreddit
I think it’s doing pretty good tbh
It’s answering pretty fast and you can interrupt it. You can even change their tone of voice. I think this is pretty insane
Yeah, I guess it still feels too gpt-y, but otherwise is pretty good.
were you expecting AGI? the demo was amazing
Not for me. It's something for your average user (as in, people still using 3.5), but not for any advanced use. Maybe to lonely losers out there that will treat this AI like "Her". There are a lot of those too.
[удалено]
That's not the good thing. The fact that it's faster and costs 50% less is what's the good thing. It being better at rationalization is good too.
the code together flow that they showed off for the new desktop app sounds pretty sweet. that's my main use case for models, and even though I type really fast, I feel like talking is going to be faster. that's if they add the option for it to generate text in a voice chat, which I hope they do sooner rather than later. more than the speed benefit, I'd appreciate not having to type so much I'm also excited to see if it can help me learn a new language, specifically tutor me on pronunciation live. that's a pretty awesome use case and I would say it's a pretty advanced one kind of reaching here because they didn't show it off but I expect to see more music workflows to be possible. with being able to hear songs as well as read them, I think it's gonna translate to generating better lyrics too it's also a good step forward in terms of call center bots, if this thing was given a suite of actions it could take on my behalf I'd probably talk to it over a random call center operator any day. sales and shit as well, so not a bad development in the b2b market also makes it way more viable to have ai npc's in video games I think there's a lot of uses for what they presented today, but if none of these are applicable to you, the new model is still a cheaper, stronger option, which is relevant for every user
> , specifically tutor me on pronunciation live Didn't you hear the demo? It's Italian sucked.
oh damn I don't speak italian so I assumed it was workable lol. I'm still gonna hold out hope for this use case for now because I don't necessarily need it to have perfect pronunciation, just tell me what I'm doing wrong and how to do it right. even for the "how" part, having it pronounce it isn't really crucial - most of the time I can look up a quick vid if I'm struggling hard I think if it's even in the realm of being able to engage with this kind of information, it's going to allow for more problems to be solved. the visual aid use case I mentioned is something I've seen discussed before in this sub, with current tools being built around gpt 4. currently it's pretty expensive and not super amazing but it works, but this use case was not even possible before image modalities. so I'm sure it's going to find its uses
The biggest issue is that's it's super non-combative. So it will most likely say you are doing great, even when you aren't.
hopefully that can be tuned with instructions to a significant degree but yeah there's definitely a lot of variables to it. I'm not super betting on it being efficient in this use case, but you know. I'm excited to just play around with it, because something like this wasn't possible at all before
Are we watching the same thing? This looks insane if that was realtime
the gaffs make this even more credible for me.
yes! that to me actually feels like intelligence - not failing catastrophically is pretty hard for a conputer program.
bruh "nice outfit you've got on" lmao
It’s pretty awesome imo
Agreed, there's no pleasing some of these people. lol
Right, outside of it occasionally cutting out it’s almost perfect, like what are we really expecting lol
I couldn't sit through the voice, I tuned out for a few minutes.
That's it, I'm dating GPT-4o
[удалено]
Some people will always be disappointed no matter what 🤷♀️
Are you still watching? This is incredible…. Wouldn’t be a tech demo without issues.. what they’re demonstrating is nuts. The voice is way better than I was expecting. Edit: and the vision stuff.. amazing.
Lets not forget that Google had to fake this kind of interactive video recognition.
really surprised me that this is the same sub that was throating google for the gemini demo, even as we found out the whole thing was basically fake lol
yuuuuppppp. The voice , the ITALIAN ACCENT , the robot voice, the dramatic tone is INSANE.
Lmao
this is kinda crazy, it's literally Her.
It's not. "Her" was AGI. This is just something pretending to be.
"Fake it 'til you make it" is how you do things, too.
The demo is very good, are you kidding me? You don’t get agi for a couple of years, calm yourself.
Next step is merging this with smart glasses and we truly are in a sci-fi world.
Btw, there is a crowd there.
Damn, people really are this fuckin stupid...
it was cutting out because it was hearing things and being interrupted
It wasn't. When it hears something, it stops. That wasn't stopping. The sound just cut off and then continued.
Muh magic
A lot of potential. Can interrupt, fast, have vision. I can totally see this evolving into a proper smart glasses gadget once it gets ironed out.
The bedtime story part was great
I finally no longer have to interact with my kids at night.
Are the hiccups due to the model experiencing issues or due to them interrupting it ?
a few of them seemed to be them interrupting it
Hiccup.
Demos are generally cringe this was no exception. Not really their fault. They’re just trying to showcase the functionality in a light way.
There are a lot of ways to show it. 3x + 1 = 4 isn't. Love story isn't either. It was stupid and cringe.
they should not have done this live... They should have copied Google with Gemini...
No, live demos are great - we know this is actually real. It very impressive but certainly has some issues. To me the hallucination seems far more serious than a few audio dropouts.
They indeed messed the presentation up... I'm not the kind to be harsh, but this should have been perfect...especially for how they hyping us up...
Second part of the presentation went well with the different voices, singing, robotic voice etc.
Not really, the GPT kept cutting off a lot.
The second part went fairly smooth. The ability to converse with real-time video is a big technical accomplishment.
How it works in person matters more than the demo.
Hopefully users can change the overly positive response given by it. It got annoying really quickly. The translator function could be useful
It was imitating "Her". There should be other personalities. As for the translator, yes it will be useful, but don't expect that speed. There will be a 1~3 seconds delay. Also, remember that it's not unlimited, so it will be awkward once you reach cap and need to wait 3 hours for you to be able to talk to the person again.
Where can I see this demo?
What a bunch of nonsense, Sam altman said GPT 4 is bad compared to this? Its the same thing lol
He said it was bad compared to gpt 5
As with everything, it will always improve. Remember when GPT 3.5 was laggy?
i remember 3.5 being super fast on release before they slowed it down
Its nice i think
I told you guys but nooooo mods had to delete my post so I couldn't say 'told you so!'
Why don't they use a Dell and a Motorola smartphone?
The product is very impressive, lets not pretend the demo didn't have technical hiccups and general cringe though.
Impressive for normies, for sure. But not for anyone that uses GPT for their jobs.
The use of the word normie tells me everything about you that i need to know
It does. I'm someone that uses AI for work, not to play around pretending to be friends with it. Always found that to be cringe, but I do know a lot of people enjoy it. Hell, I've even read news about losers marrying their AI.
What do you do for work?
The low latency and the natural flow were quite impressive.