"Chinese AI Giant DeepSeek R1 Rivals OpenAI with Less Power"

Coin WorldMonday, Jan 27, 2025 7:56 am ET
1min read

Chinese AI Model DeepSeek-R1 Matches OpenAI Performance with Reduced Computing Power

Chinese AI startup DeepSeek has made waves in the global AI community with its cutting-edge, open-source models and low inference costs. Founded in 2023 by Liang Wenfeng, a former head of the High-Flyer quantitative hedge fund, DeepSeek has quickly risen to the top of the AI market with its innovative approach to AI research and development.

DeepSeek's latest model, DeepSeek-R1, has garnered significant attention for its impressive performance. The neural network, released on January 20, 2023, has attracted community attention and led to sell-offs in stock and cryptocurrency markets over the weekend of January 25-26.

DeepSeek's open-source model competes with leading AI technologies, offering advanced reasoning and performance benchmarks. The model has 671 billion parameters, surpassing Llama 3.1's 405 billion parameters. This metric reflects the AI's ability to adapt to more complex applications and provide more accurate responses.

The development of the neural network took two months, costing $5.58 million and requiring significantly fewer computational resources compared to larger tech companies. Nvidia H800 chips were used, optimizing the use of computing power in the model training process.

Thanks to the new AI model DeepSeek-R1, the company's chatbot skyrocketed in the rankings of free apps on the App Store in the USA, surpassing even ChatGPT.

DeepSeek has developed a range of AI models that have been praised for their reasoning capabilities, problem-solving capabilities, and cost-effectiveness. The company's flagship model, DeepSeek R1, is a large language model that has been trained using a reinforcement learning (RL) approach, allowing it to learn independently and develop self-verification, reflection, and chain-of-thought (CoT) capabilities.

DeepSeek R1 has been released in six smaller versions that are small enough to run locally on laptops, with one of them outperforming OpenAI’s o1-mini on certain benchmarks.

DeepSeek’s AI models are designed to be highly efficient, with a focus on maximizing software-driven resource optimization and embracing open-source methods. This approach not only mitigates resource constraints but also accelerates the development of cutting-edge technologies

Comments



Add a public comment...
No comments

No comments yet

Disclaimer: The news articles available on this platform are generated in whole or in part by artificial intelligence and may not have been reviewed or fact checked by human editors. While we make reasonable efforts to ensure the quality and accuracy of the content, we make no representations or warranties, express or implied, as to the truthfulness, reliability, completeness, or timeliness of any information provided. It is your sole responsibility to independently verify any facts, statements, or claims prior to acting upon them. Ainvest Fintech Inc expressly disclaims all liability for any loss, damage, or harm arising from the use of or reliance on AI-generated content, including but not limited to direct, indirect, incidental, or consequential damages.