DeepSeek causes industry panic as OpenAI and Meta dump billions into development plans

DeepSeek causes industry panic as OpenAI and Meta dump billions into development plans
DeepSeek causes industry panic as OpenAI and Meta dump billions into development plans

what to know for now

๐ŸŒ DeepSeek shakes AI landscape. Chinese startup DeepSeek introduced AI models with performance comparable to leading chatbots at lower costs, challenging the necessity for extensive capital in AI development. Their open-source approach and efficiency navigate US export restrictions, indicating shifts in global AI competition. Read more

๐Ÿ“‰ AI disruption impacts markets. DeepSeek's emergence led to a decline in global technology stocks, significantly affecting Nvidia and other semiconductor suppliers. Investors are evaluating the implications for US-based AI competitors and their hardware dependencies. Read more

โš™๏ธ ChatGPT Operator seeks to automate all tasks. OpenAI announced Operator, an agent leveraging GPT-4o that autonomously manages web-based tasks such as booking trips, purchasing supplies, and deploying software. Available to ChatGPT Pro users, it aims to enhance productivity for consumers and businesses alike. Read more

๐Ÿค– Google releases Gemini 2.0 Flash Thinking beta. The experimental model scores 73.3% on AIME and 74.2% on GPQA Diamond benchmarks, processes up to one million tokens, and includes native code execution. Released for free in beta, it challenges OpenAIโ€™s premium strategy. Read more

๐Ÿงช AI Research of the Week

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
From DeepSeek-AI

Jakeโ€™s Take: It would be silly for DeepSeekโ€™s extremely popular paper on their new R1 model to not be the research of the week. The paper introduces DeepSeek-R1-Zero and DeepSeek-R1, reasoning-focused models developed using reinforcement learning (RL). DeepSeek-R1-Zero explores reasoning capabilities purely through RL without supervised fine-tuning, achieving remarkable benchmark results but suffering from readability and language consistency issues. DeepSeek-R1 builds upon this by incorporating cold-start data and multi-stage training, significantly enhancing reasoning performance and user accessibility. Additionally, reasoning capabilities are distilled into smaller models, enabling cost-efficient, high-performing alternatives that outperform other open-source models.

The accessibility and simplicity of the concepts in this paper that lead to such a large improvement represents a significant moment in the world of post-training reasoning models with R1 performing on the level of OpenAIโ€™s flagship reasoner o1. OpenAI and others are feeling the heat and itโ€™s not unreasonable to assume R1 will push Americaโ€™s AI labs to releases models faster and with lower costs to the consumer.

what to know for later

๐Ÿง  Stargate: massive funding for US AI. A private partnership involving OpenAI, SoftBank, Oracle, and MGX plans to construct data centers across the U.S. with a $500 billion investment to develop and deploy AI technologies. The initiative emphasizes economic growth and national security while facing funding and safety challenges.

๐Ÿ†“ OpenAI to launch o3-mini for free. OpenAI's free ChatGPT tier will use the o3-mini model, which will offer faster response times and lower computational requirements compared to GPT-4o. Paid subscribers will continue to have access to existing models alongside extensive o3-mini usage, enhancing flexibility for various tasks. Read more

๐Ÿ—๏ธ Meta to invest $65 billion in AI. The company plans to build a new data center the size of Manhattan and expand its AI teams. Meta will deploy around one gigawatt of computing power and 1.3 million GPUs by year-end. Read more

๐Ÿค Perplexity AI proposes TikTok merger. Perplexity AI submitted a plan to merge with TikTok's U.S. operations, offering up to 50% stake to the U.S. government after a $300 billion IPO. The proposal excludes TikTok's proprietary algorithm and ensures the government holds no voting power or board positions. Read more