Transcript of Strawberry Q* SOON, Apple Intelligence Updates, $2,000/mo ChatGPT, Replit Agents (AI News)

OpenAI Strawberry Model Imminent open AI strawberry model is imminent that's our first story for today according to Reuters we have open AI plans to release strawberry for Chachi PT in 2 weeks and Jimmy apples the only reliable leaker has mentioned it as well Jimmy apples last week All Quiet on the Western Front a heavy fog of skizo energy I'm ready to be heard again let's see leaning forward then Jimmy apples 15 hours ago this week I take a small step out of the cave of patience I hope he is not forced to get back in that cave because I want to see some new stuff out of open AI 3 hours ago the age of patience is over it's release season so seems like it's coming soon let's take a look at the article open AI plans to release strawberry its reasoning focused artificial intelligence as part of its chat GPT service in the next 2 weeks the information reported on Tuesday and here's the article from the information we should explain that while strawberry is part of chat GPT it's a standalone offering exactly how it will be offered is unclear one option is for strawberry to be included in the drop- down menu of AI models customers can pick from to power chat gbt and it's quite different from the regular services with some advantages and shortcomings so what differentiates strawberry is its ability to think to reason to plan to take its time and that is the key differentiator it will actually take more inference time when you submit a prompt so the initial version will only be able to take in and produce text not images so not yet multimodal then there's pricing strawberry is likely to be priced differently to open ai's chatbot which has free and subscription pricing tiers they're not sure how it's going to be priced yet but I think they're probably going to charge more for it strawberry is expected to be easier to use than GPT 40 for complex or multi-step queries now apparently some people have already used it some people who've used a strawberry prototype have complained that its slightly better responses compared to open ai's currently released GPT 40 aren't worth the extra 10 to 20 seconds of waiting and here's the thing 95% of prompts and use cases can be accomplished with GPT 40 mini not even GPT 40 then you layer on GPT 40 for the last like 3 to 4% and then finally maybe you need strawberry for the most complex reasoning now a week or so ago the information had reported that strawberry is mainly used to generate synthetic data for Orion open ai's next Generation Frontier Model so this is a little different than we thought was happening but either way we're getting something new from open AI soon according to the information and of course according to Jimmy apples and the next big story of Apple's Major Announcements this week is Apple's major announcements from yesterday Apple had its conference yesterday they made a number of announcements around the iPhone the watch and apple intelligence but here's the thing even if you buy the brand new iPhone 16 Pro Max whatever spend $1,200 $1,300 on this brand new hardware device which really is just an incremental improvement over the 15 the 14 and so on you still won't get apple intelligence out of the box it's not going to be shipped natively and that is very disappointing now you already know that I'm going to get the new phone I'm going to install the beta and I'm going to play around with it and I'll review it for you of course and just a quick reminder of Apple intelligence it is its own native model running on device for the most part now whenever there are World Knowledge Questions where the local model just can't do it it's going to offload it to chat GPT in the cloud but it will ask you first this is the partnership with open AI now I really like Apple's approach here do as much as you can on device and only offload it if you absolutely need to and apple actually has a larger version of their model in their private Cloud before they even ask you to offload to open ai's models I cannot wait to have a seriously capable Siri an agent an AI that is performing tasks completing things on my behalf 24 hours a day that is my perfect vision for the future of how I want to interact with artificial intelligence and I really believe Apple and Google are both well positioned to accomplish that because they both have the hardware in your pocket to actually interface with you and the iPhone 16's camera will actually be able to do a lot of what the meta AI glasses can do you take a picture of something and it will tell you what you're looking at so in the information article it says the iPhone 16 camera camera will also include visual intelligence allowing customers to look up information about the scene in front of them such as finding reviews for a restaurant they're standing in front of I don't really use it all that often I have to be honest I love my meta AI glasses but I don't use the visual AI all that much and I actually don't use the voice AI all that much either I use it all the time for taking pictures though but maybe this is going to change when we have it all built into our phones we will see next according to Nvidia Subpoena in DOJ Antitrust Probe Bloomberg Nvidia gets doj Department of Justice subpoena in escalating antitrust probe Nvidia is the absolute monster in the AI World they are one of the top three most valuable companies in the entire world they basically sell the majority of all chips powering both fine-tuning training and inference and they have this little Library called cuda Cuda which is proprietary to Nvidia and is the software that essentially Powers the gpus and that is probably what they're getting in trouble for because the Cuda library is so powerful and so deeply ingrained in the Nvidia ecosystem that it's hard for anybody to justify buying other companies chips simply because of the Cuda library now even though Bloomberg reported this subpena it seems Nvidia is denying they ever got it and it already affected their stock price so it's interesting to see that a major publication like Bloomberg has reported it but Nvidia denied it so of course we're getting conflicting information right now next Honeycomb AI Beats Amazon in S-Agent Leaderboard we have a new state-of-the-art model on the S agent leaderboard that is a coding AI leaderboard honeycomb it's called Just Beat out Amazon Q with a 22.0 6% Devin score which absolutely took the Internet by surprise got a 13.86% and honeycomb number one so here are the two honeycomb Founders 19-year-old dropouts from MIT of course they are in y combinator and it is Jared Freeman a YC partner who is reporting this information so honeycomb isn't just the model it's a full product that you can use right now as an AI programmer now of course trying to look at this through a bit more skepticism Jared Freeman is obviously promoting this because he has a financial interest in doing so so here is Honeycomb it does look extremely similar to Devon I don't believe it's open source so that's a shame but still You.com Raises $50 Million for AI Agent Expansion very cool next uaii raises 50 million and predicts more AI agents Than People by 2025 that is less than a few months away u.com the AI powered search and productivity platform announced today as has raised $50 million series B funding to continue their work in the Enterprise AI world it was led by Georgian with participation from Tech Giants Nvidia and Salesforce ventures in addition of day one Ventures they've raised about aund million so far now I haven't used u.com but going to their homepage it looks exactly like chat GPT but I'm sure they have a bunch more functionality under the hood next according to Sam Altman's Massive AI Infrastructure Investment Bloomberg Sam Alman plans on spending tens of billions of dollars building out AI infrastructure in the United States which I love to hear now in the Yahoo finance article about the same topic it says open aai chief executive officer Sam Alman plans for a massive buildout of the machinery and systems needed for artificial intelligence and he is beginning with an effort in the United States slated to cost tens of billions of dollars according to a person familiar with the matter Alman had spent the early part of the Year seeking the US government's blessing for the project which aims to form a c of Global Investors to fund the costly physical infrastructure needed to support rapid AI development now we already know we don't have enough chips and we also don't have enough energy to reach AGI so the fact that we're getting this massive investment into the manufacturing process into the chip making process is phenomenal I love hearing it backers are likely to include investors in Canada Korea Japan the UAE and open AI also envisions other private companies being involved in the project so of course this is great news for the US and I hope we continue to invest in building everything we need to reach AGI on our soil next grock is making lots of Groq's New Vision Model and Speed Upgrades progress they now have a vision model which is powered by their grock chips which means it is incredibly fast so you can now try it in the API console it is the lava 1.57 B model so simply upload an image and you can ask any question about it and it is lightning fast and as a reminder I'm an investor in Gro little disclosure are there and more news from grock they have just increased the speed at which you can run inference on the Llama models on the Llama 3.1 70b model they are reaching 544 tokens per second that is astounding now if you've watched this Channel at all you know I love AI agents and inference speed is especially valuable when you're building AI agents because eventually those agents can go off and accomplish tasks on your behalf 24 hours a day and that inference speed is just going to allow them to accomplish so much more and more quickly all right Replit Agents Revolutionize Online Coding next this is a bit late but repet has announced repet agents which apparently everybody's using now instead of cursor repet is a fantastic completely online code editor and now they have ai built in natively just like cursor except it can build you anything in the cloud and deploy it easily so here's a little example of how it works he put in a prompt very short over here submitted it there's a plan of what to build the code gets built in real time it is so fast all the files are being built out and all of this can be deployed and hosted through repet so easily I've actually only used replit a few times but every time I use it it's fantastic so look at that with just a single two sentence prompt we now have a fully functional map application so really really impressive so if you haven't checked it out definitely check it out I'll drop links to this in the description below OpenAI Considering $2,000/Month Subscriptions next according to the information open AI has been considering higher price tiers for their subscriptions all the way up to $2,000 per month now if they achieve AGI I'm willing to pay basically anything for it but let's take a step back from AGI for $2,000 a month what would you be willing to pay for if AI could take 20% of my workload off my plate I would absolutely pay $22,000 a month now it would have to be consistent and reliable and high quality which AI models are not quite there yet but maybe soon and so Executives have discussed high price subscriptions for upcoming large language models such as strawberry and a new flagship llm dubbed Orion we've already talked about both of those things extensively so definitely check out those videos if you haven't seen them the pricing discussions come as Chachi PT maker looks to raise billions of dollars in capital to make up for the billions of dollars it's losing per year now anybody who thinks open AI is going out of business because they're losing so much money I think you need to take another look at how revolutionary their technology is and really how A lot of tech companies work they're going to lose money for a long time until they make money and then they basically print money at least that's the hope so let me know how much you'd be willing to pay for strawberry for Orion and then for AGI drop that in the comments next a new Fine-tuned AI Model: Phind 405b Launched model is launched find 405b this actually came out last week I just haven't had a chance to talk about it find 405b obviously is a fine tune of llama 3.1 405b it scores a 92% on human eval matching Cloud 3.5 Sonet now given the controversy with reflection 70b I'm going to continue to look at these releases with a bit of skepticism however find has been releasing models for a while now and they've actually released really good models in the past and they all seem to be legit we're introducing a new flagship model find 405b along with a new find instant model that offers Lightning Fast search speeds for all of your programming and curiosity questions we're excited to announce the launch of find 405b our new flagship model based on the excellent metal Lama 3.1 405b we've trained it to be State of-the-art model for programming and Technical tasks it can injust 128k tokens of context with 32k context window available at launch and it is now available for find pro user so it is paid next it seems open ai's OpenAI's Internal Cultural Shift cultural changes internally are actually sticking in a post by Jason Quan the CSO of open AI weighing in on California build 10:47 here's what we said to colleagues last Friday so hi everyone we've heard that there is a petition circulating among employees of AI Labs where people have the option to express support for CA 1047 Bill we want you to know that if you want to sign the petition you should feel free to part of what makes open AI work is that we have a diversity of personal views regardless of our company position okay so great so they are finally allowing open AI employees to speak out and voice their own opinions which is fine which is good do you know about California 1047 I've looked into it a little bit let me know what you think in the comments next SambaNova Systems Launches Fastest AI Platform Samba Nova systems releases the world's fastest AI platform Sova Cloud runs llama 3.1 405b at 132 tokens per second at full Precision that is lightning fast the Llama 3.1 70b at 570 tokens per second is comparable to Gro but unfortunately right now grock doesn't support 405b they did briefly it got overloaded because of popularity and then they had to take it down but I'm sure they're going to release it again soon so samb Nova 10x faster inference than gpus so you can definitely take a look at s NOA use their Cloud platform and get Lightning Fast speeds congratulations to them on their launch DeepSeek v2.5: Leading AI in Coding and Math and for our last story the company deep seek has released a new model and it is outstanding at coding and math here we can see a deep seek v2.5 open source and as you could see it beats basically every other model at reasoning math arithmetic knowledge coding I should probably do a full test of it let me know if you want to see that in the comments below and look at the cost per million tokens it is 14 cents per million input tokens and 28 cents per million output tokens that is so cheap and if you want to host it yourself it's open source you can download it so congratulations to deep seek for their release so that's it for today if you enjoyed this video please consider giving a like And subscribe and I'll see you in the next one

Strawberry Q* SOON, Apple Intelligence Updates, $2,000/mo ChatGPT, Replit Agents (AI News)

Share your thoughts