DeepSeek, an emerging artificial intelligence startup based in China, has made headlines with its innovative approach to developing high-performing language models. This laboratory, which does not depend on funding from tech giants such as Baidu or Alibaba, has established itself as one of the few major players in AI in China.
By bringing together a team of young talents from the top Chinese universities, such as Peking University and Tsinghua University, DeepSeek aimed to transcend the traditional barriers of the industry. This bold approach has fostered a collaborative and innovative corporate culture centered around unconventional research.
The Origins of DeepSeek
Founded by a group of AI enthusiasts, DeepSeek initially began its journey by delving into fundamental research. Liang, the founder, decided not to recruit experienced engineers but rather young researchers, often recent graduates, driven by the desire to prove their worth in a rapidly expanding field.
This recruitment strategy created an environment where innovation could thrive, allowing researchers to work freely on ambitious projects. Paradoxically, while many traditional companies face internal competition for resources, DeepSeek has cultivated a culture of support and knowledge sharing.
A Culture of Collaborative Innovation
The unique culture of the company, encouraged by the youth of its employees, has fostered a mindset of experimentation. This contrasts sharply with other tech companies in China, where rivalry for resources can stifle creativity. The young researchers at DeepSeek, who have often won awards and published papers in leading journals, bring valuable expertise to the team despite a lack of industry experience.
Liang argues that this lack of experience can actually work in the team’s favor. Young researchers are often more willing to dedicate their time and energy to high-risk, low-return projects, driven by a sense of duty and a passion for innovation. By focusing on solving the most complex questions in AI, they aim to leave their mark on the industry.
The Challenges of the AI Industry in China
In October 2022, new U.S. export regulations severely restricted Chinese companies’ access to advanced technologies, particularly high-performance chips like Nvidia’s H100. DeepSeek thus faced a major challenge as it pursued ambitions to compete with heavyweights like OpenAI and Meta.
Despite initially accumulating a significant reserve of these chips, DeepSeek had to reassess its training methods for its models. Liang explained that the company’s real constraint was not funding but these export controls. This led the startup to adopt efficient optimization methods and architectures.
Technical Innovations and Optimization
To overcome the hurdles posed by these restrictions, DeepSeek developed several technical strategies. The company optimized its model architecture by employing various engineering tricks, including custom communication schemes between chips, reducing field sizes to save memory, and innovative use of a mixed model approach.
The combination of old but effective methods allowed DeepSeek to market an AI model that requires fewer resources while maintaining a high level of performance. Indeed, the DeepSeek model is now capable of achieving results similar to, or even exceeding, those of OpenAI’s o1 model while requiring less computational power. This shift in approach could well alter the dynamics of the current market.
The Performances of DeepSeek
DeepSeek R1, the latest model developed by the company, has made waves in the field of artificial intelligence. Within a few days, the startup transformed from an unknown entity into a key player in AI, thanks to exceptional performance and a development cost that defies all competition.
This open-source model was designed to be accessible to a wide audience while ensuring results comparable to those of the most advanced models currently available. The performance of DeepSeek R1 is measured by several recognized industry standards, and it appears that this model even surpasses OpenAI’s o1 model across multiple criteria for speed, efficiency, and cost.
A Revolution in Development Cost
What primarily distinguishes DeepSeek from its competitors like OpenAI is the development cost of its model. While other AI giants invest billions, DeepSeek managed to develop its model for only 5 million dollars. This cost difference, while maintaining high performance levels, could change the game in a sector where AI investments are on the rise.
By offering a more cost-effective solution, DeepSeek positions itself not only as an alternative but also creates new opportunities for innovation within the AI sector. This signals a potentially transformative era for the industry, where the financial accessibility of AI could spur wider adoption and foster new startups.
The Open Source Approach of DeepSeek
Another notable aspect of DeepSeek’s strategy is its commitment to open source. In a world where most large players keep their models locked away, DeepSeek is challenging this norm by making its model accessible to everyone. This approach not only fosters innovation but also attracts a community of contributors who can enhance and evolve the model.
Many experts believe that this strategy could be the key to catching up with Western companies such as OpenAI, Anthropic, and Meta, which dominate the market thanks to substantial resources. By developing open-source models, DeepSeek could not only attract more users but also benefit from the valuable contributions of an expanded community.
An Opportunity for Global Partnership
Companies based in China, like DeepSeek, find themselves at a critical crossroads in the face of export challenges. However, the willingness to create an open-source model highlights a potential opportunity for international collaboration. By sharing its innovations globally, DeepSeek could facilitate the emergence of a contributory AI ecosystem that fosters knowledge sharing.
This dynamic could also be seen as a potential response to U.S. restrictions on advanced technologies, thereby creating a virtuous cycle for innovation and development in artificial intelligence.
Conclusion: The Future of DeepSeek
With major challenges like export controls and stiff competition in the sector, the future of DeepSeek looks promising. By focusing on innovation, optimization, and an open-source approach, it could redefine its role not only in the AI market in China but also on the global stage. As the startup continues to progress, all eyes will be on its ability to turn its ambitions into tangible achievements.







