Artwork

Контент предоставлен Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky. Весь контент подкастов, включая эпизоды, графику и описания подкастов, загружается и предоставляется непосредственно компанией Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky или ее партнером по платформе подкастов. Если вы считаете, что кто-то использует вашу работу, защищенную авторским правом, без вашего разрешения, вы можете выполнить процедуру, описанную здесь https://ru.player.fm/legal.
Player FM - приложение для подкастов
Работайте офлайн с приложением Player FM !

Claude Opus 4.5, Olmo 3, and a Paper on Diffusion + Auto Regression

47:45
 
Поделиться
 

Manage episode 521719471 series 3703995
Контент предоставлен Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky. Весь контент подкастов, включая эпизоды, графику и описания подкастов, загружается и предоставляется непосредственно компанией Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky или ее партнером по платформе подкастов. Если вы считаете, что кто-то использует вашу работу, защищенную авторским правом, без вашего разрешения, вы можете выполнить процедуру, описанную здесь https://ru.player.fm/legal.

In this episode of Artificial Developer Intelligence, hosts Shimin and Dan explore the latest advancements in AI models, including the release of Claude Opus 4.5 and Gemini 3. They discuss the implications of these models on software engineering, the rise of open-source models like Olmo 3, and the enhancements in the Claude Developer Platform. The conversation also delves into the challenges of relying on AI for coding tasks, the potential pitfalls of the AI bubble, and the future of written exams in the age of AI.

Takeaways

  • Claude Opus 4.5 setting benchmarks, enhance usability and reduce token consumption.
  • The introduction of open-source models like Olmo 3 is a significant development in AI.
  • The future of written exams may be challenged by AI's ability to generate human-like responses.
  • Relying too heavily on AI can lead to a lack of critical thinking and problem-solving skills.
  • The AI bubble is at 25s to midnight
  • Recent research suggests that AI models can improve their performance through emulating query based search.
  • The importance of prompt engineering in AI interactions is highlighted.

Resources Mentioned
Introducing Claude Opus 4.5
Build with Nano Banana Pro, our Gemini 3 Pro Image model
Andrej Karpathy's Post about Nano Banana Pro
Olmo 3: Charting a path through the model flow to lead open-source AI
Introducing advanced tool use on the Claude Developer Platform
TiDAR: Think in Diffusion, Talk in Autoregression
SSRL: SELF-SEARCH REINFORCEMENT LEARNING
Mira Murati's Thinking Machines seeks $50 billion valuation in funding talks, Bloomberg News reports
Boom, bubble, bust, boom. Why should AI be different?
Nvidia didn’t save the market. What’s next for the AI trade?

Chapters

  • (00:00) - Introduction to Artificial Developer Intelligence
  • (01:25) - Claude Opus 4.5
  • (07:02) - Exploring Gemini 3 and Image Models
  • (11:24) - Olmo 3 and The Rise of Open Flow Models
  • (15:46) - Innovations in AI Tools and Platforms
  • (19:33) - Research Insights: Diffusion and Auto-Regression Models
  • (23:39) - Advancements in AI Output Efficiency
  • (25:45) - Exploring Self Search Reinforcement Learning
  • (27:48) - The Dilemma of Language Models
  • (30:11) - Prompt Engineering and Search Integration
  • (32:55) - Dan's Rants on AI Limitations
  • (38:17) - 2 Minutes to Midnight
  • (46:41) - Outro

Connect with ADIPod
  continue reading

4 эпизода

Artwork
iconПоделиться
 
Manage episode 521719471 series 3703995
Контент предоставлен Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky. Весь контент подкастов, включая эпизоды, графику и описания подкастов, загружается и предоставляется непосредственно компанией Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky или ее партнером по платформе подкастов. Если вы считаете, что кто-то использует вашу работу, защищенную авторским правом, без вашего разрешения, вы можете выполнить процедуру, описанную здесь https://ru.player.fm/legal.

In this episode of Artificial Developer Intelligence, hosts Shimin and Dan explore the latest advancements in AI models, including the release of Claude Opus 4.5 and Gemini 3. They discuss the implications of these models on software engineering, the rise of open-source models like Olmo 3, and the enhancements in the Claude Developer Platform. The conversation also delves into the challenges of relying on AI for coding tasks, the potential pitfalls of the AI bubble, and the future of written exams in the age of AI.

Takeaways

  • Claude Opus 4.5 setting benchmarks, enhance usability and reduce token consumption.
  • The introduction of open-source models like Olmo 3 is a significant development in AI.
  • The future of written exams may be challenged by AI's ability to generate human-like responses.
  • Relying too heavily on AI can lead to a lack of critical thinking and problem-solving skills.
  • The AI bubble is at 25s to midnight
  • Recent research suggests that AI models can improve their performance through emulating query based search.
  • The importance of prompt engineering in AI interactions is highlighted.

Resources Mentioned
Introducing Claude Opus 4.5
Build with Nano Banana Pro, our Gemini 3 Pro Image model
Andrej Karpathy's Post about Nano Banana Pro
Olmo 3: Charting a path through the model flow to lead open-source AI
Introducing advanced tool use on the Claude Developer Platform
TiDAR: Think in Diffusion, Talk in Autoregression
SSRL: SELF-SEARCH REINFORCEMENT LEARNING
Mira Murati's Thinking Machines seeks $50 billion valuation in funding talks, Bloomberg News reports
Boom, bubble, bust, boom. Why should AI be different?
Nvidia didn’t save the market. What’s next for the AI trade?

Chapters

  • (00:00) - Introduction to Artificial Developer Intelligence
  • (01:25) - Claude Opus 4.5
  • (07:02) - Exploring Gemini 3 and Image Models
  • (11:24) - Olmo 3 and The Rise of Open Flow Models
  • (15:46) - Innovations in AI Tools and Platforms
  • (19:33) - Research Insights: Diffusion and Auto-Regression Models
  • (23:39) - Advancements in AI Output Efficiency
  • (25:45) - Exploring Self Search Reinforcement Learning
  • (27:48) - The Dilemma of Language Models
  • (30:11) - Prompt Engineering and Search Integration
  • (32:55) - Dan's Rants on AI Limitations
  • (38:17) - 2 Minutes to Midnight
  • (46:41) - Outro

Connect with ADIPod
  continue reading

4 эпизода

Todos os episódios

×
 
Loading …

Добро пожаловать в Player FM!

Player FM сканирует Интернет в поисках высококачественных подкастов, чтобы вы могли наслаждаться ими прямо сейчас. Это лучшее приложение для подкастов, которое работает на Android, iPhone и веб-странице. Зарегистрируйтесь, чтобы синхронизировать подписки на разных устройствах.

 

Краткое руководство

Слушайте это шоу, пока исследуете
Прослушать