DeepSeek Tops App Store Charts But Struggles with Accuracy

In the ever-evolving world of artificial intelligence, DeepSeek, a Chinese AI chatbot, has taken the App Store by storm. However, despite its popularity, a recent audit by NewsGuard reveals alarming shortcomings in its accuracy and reliability.

DeepSeek’s Accuracy Issues

According to NewsGuard’s assessment, DeepSeek fails to provide accurate information about news and current events 83% of the time. Among 11 leading AI chatbots tested, DeepSeek ranked second to last.

Key Accuracy Findings:

30% of responses contained false information.
53% of responses failed to answer queries meaningfully.
Only 17% of responses successfully debunked false claims.
The chatbot’s fail rate significantly exceeded the industry average of 62%.

Chinese Government Messaging Influence

One of the more concerning aspects of DeepSeek is its tendency to insert Chinese government positions into unrelated discussions. For instance, when asked about a conflict in Syria, DeepSeek framed its response around China’s principles of non-interference, even though the question had no direct relevance to China.

DeepSeek Terms of Use Raise Major Concerns for Users

Technical Shortcomings & Misinformation Risks

Despite claims of matching OpenAI’s capabilities with just $5.6 million in training costs, DeepSeek demonstrates substantial knowledge gaps. The chatbot explicitly states that its training data only extends through October 2023, making it incapable of addressing recent events accurately.

Additionally, NewsGuard found that DeepSeek is highly vulnerable to misinformation, particularly when responding to prompts designed to test AI models’ susceptibility to manipulation. Alarmingly, eight out of nine false responses from DeepSeek were triggered by such prompts, raising concerns about how bad actors could weaponize the chatbot for misinformation campaigns.

Industry Implications

DeepSeek’s limitations highlight critical issues in the ongoing AI competition between China and the United States.

Notably, DeepSeek’s terms of use place the responsibility on users to verify the authenticity of its outputs, a policy NewsGuard criticizes as a “hands-off” approach that shifts accountability from developers to end users.

Looking Ahead

Moving forward, DeepSeek will be included in NewsGuard’s monthly AI audits, where its performance will be tracked alongside other chatbots.

For marketers, content creators, and everyday users, DeepSeek’s rapid rise in popularity serves as a reminder: always fact-check AI-generated information with reliable sources before relying on it.

Kumail Mehdi

I am a goal-driven person, I work well with people and like to challenge myself in different ways. I also want to have a great career that can develop me as an individual and an employer as well, so as to be part of a positive working environment where I can learn and grow. My interests include reading, swimming, and going out for fun.

DeepSeek Tops App Store Charts But Struggles with Accuracy

DeepSeek’s Accuracy Issues

Chinese Government Messaging Influence

Technical Shortcomings & Misinformation Risks

Industry Implications

Looking Ahead

Kumail Mehdi

Leave a Reply Cancel reply

Services

Quick Links

About

DeepSeek Tops App Store Charts But Struggles with Accuracy

DeepSeek’s Accuracy Issues

Chinese Government Messaging Influence

Technical Shortcomings & Misinformation Risks

Industry Implications

Looking Ahead

Kumail Mehdi

Leave a Reply Cancel reply

Services

Quick Links

About

Online Presence Audit Checklist