In the ever-evolving world of artificial intelligence, DeepSeek, a Chinese AI chatbot, has taken the App Store by storm. However, despite its popularity, a recent audit by NewsGuard reveals alarming shortcomings in its accuracy and reliability.
DeepSeek’s Accuracy Issues
According to NewsGuard’s assessment, DeepSeek fails to provide accurate information about news and current events 83% of the time. Among 11 leading AI chatbots tested, DeepSeek ranked second to last.
Key Accuracy Findings:
- 30% of responses contained false information.
- 53% of responses failed to answer queries meaningfully.
- Only 17% of responses successfully debunked false claims.
- The chatbot’s fail rate significantly exceeded the industry average of 62%.
Chinese Government Messaging Influence
One of the more concerning aspects of DeepSeek is its tendency to insert Chinese government positions into unrelated discussions. For instance, when asked about a conflict in Syria, DeepSeek framed its response around China’s principles of non-interference, even though the question had no direct relevance to China.
Technical Shortcomings & Misinformation Risks
Despite claims of matching OpenAI’s capabilities with just $5.6 million in training costs, DeepSeek demonstrates substantial knowledge gaps. The chatbot explicitly states that its training data only extends through October 2023, making it incapable of addressing recent events accurately.
Additionally, NewsGuard found that DeepSeek is highly vulnerable to misinformation, particularly when responding to prompts designed to test AI models’ susceptibility to manipulation. Alarmingly, eight out of nine false responses from DeepSeek were triggered by such prompts, raising concerns about how bad actors could weaponize the chatbot for misinformation campaigns.
Industry Implications
DeepSeek’s limitations highlight critical issues in the ongoing AI competition between China and the United States.
Notably, DeepSeek’s terms of use place the responsibility on users to verify the authenticity of its outputs, a policy NewsGuard criticizes as a “hands-off” approach that shifts accountability from developers to end users.
Looking Ahead
Moving forward, DeepSeek will be included in NewsGuard’s monthly AI audits, where its performance will be tracked alongside other chatbots.
For marketers, content creators, and everyday users, DeepSeek’s rapid rise in popularity serves as a reminder: always fact-check AI-generated information with reliable sources before relying on it.