"Chinese AI Giant DeepSeek R1 Rivals OpenAI with Less Power"

Chinese AI Model DeepSeek-R1 Matches OpenAI Performance with Reduced Computing Power
Chinese AI startup DeepSeek has made waves in the global AI community with its cutting-edge, open-source models and low inference costs. Founded in 2023 by Liang Wenfeng, a former head of the High-Flyer quantitative hedge fund, DeepSeek has quickly risen to the top of the AI market with its innovative approach to AI research and development.
DeepSeek's latest model, DeepSeek-R1, has garnered significant attention for its impressive performance. The neural network, released on January 20, 2023, has attracted community attention and led to sell-offs in stock and cryptocurrency markets over the weekend of January 25-26.
DeepSeek's open-source model competes with leading AI technologies, offering advanced reasoning and performance benchmarks. The model has 671 billion parameters, surpassing Llama 3.1's 405 billion parameters. This metric reflects the AI's ability to adapt to more complex applications and provide more accurate responses.
The development of the neural network took two months, costing $5.58 million and requiring significantly fewer computational resources compared to larger tech companies. Nvidia H800 chips were used, optimizing the use of computing power in the model training process.
Thanks to the new AI model DeepSeek-R1, the company's chatbot skyrocketed in the rankings of free apps on the App Store in the USA, surpassing even ChatGPT.
DeepSeek has developed a range of AI models that have been praised for their reasoning capabilities, problem-solving capabilities, and cost-effectiveness. The company's flagship model, DeepSeek R1, is a large language model that has been trained using a reinforcement learning (RL) approach, allowing it to learn independently and develop self-verification, reflection, and chain-of-thought (CoT) capabilities.
DeepSeek R1 has been released in six smaller versions that are small enough to run locally on laptops, with one of them outperforming OpenAI’s o1-mini on certain benchmarks.
DeepSeek’s AI models are designed to be highly efficient, with a focus on maximizing software-driven resource optimization and embracing open-source methods. This approach not only mitigates resource constraints but also accelerates the development of cutting-edge technologies
Comments
No comments yet