Clash of Great Powers in the Field of Generative AI

Since its launch of ChatGPT in 2022, OpenAI, the US-based company, has maintained dominant leadership in the generative artificial intelligence (Generative AI) market. The field of AI innovation has long become a fiercely contested…


Dr Maurice Tse and Mr Clive Ho

19 March 2025

The American company OpenAI has dominated the Generative AI market since launching ChatGPT in 2022. AI-driven innovation has become the Holy Grail for players in the field. To curb China’s advancement in this area, the US has banned the export of high-end chips to the country and restricted Chinese users from accessing ChatGPT, making it necessary for Hong Kong users to rely on Virtual Private Networks to use the service. In 2024, a Chinese start-up turned the tables by introducing a generative AI model, DeepSeek, aiming for ChapGPT’s leading position.

By January 2025, DeepSeek took over ChatGPT’s spot as the most downloaded free app on Apple’s US App Store, causing Nvidia’s stock price to drop by 18% in a single day. Many cannot help but wonder: will DeepSeek simply replace ChatGPT?

New star making a splash

Neither DeepSeek nor ChatGPT is trained to play chess. Last month, however, under the arrangements of Levy Rozman, an American chess International Master, they did just that in a chess game. In the end, DeepSeek pulled a trick from Sun Zi’s Art of War and, with a clever stratagem, turned the tide to clinch victory against ChatGPT from the verge of defeat.

DeepSeek was developed in Hangzhou in 2023 by the Chinese hedge fund High-Flyer. Launched on 20 January 2025, the new open-source model DeepSeek-R1 provides free services as both a mobile application and desktop version, immediately attracting attention in global AI circles. The new model, trained on the database available up to July 2024, can integrate updated data from web sources as necessary.

DeepSeek-R1 is equipped with AI assistant functions, including writing song lyrics and making business development plans, or even preparing a recipe based on the contents of a fridge. With the ability to communicate in multiple languages, it excels in English and Chinese in particular. As R1 reveals its chain of thought when answering questions, making the reasoning process transparent, users gain a clear understanding of the AI system’s logic. This enables them to learn in reverse how to deconstruct complex problems and arrive at problem-solving solutions through practice. From International Mathematical Olympiad (IMO) questions to gossip news, or from quantum physics to Chinese literature, users can visualize the thinking process of an AI machine. The resulting cognitive impact is conducive to providing a good starting point for humans to enrich their own thinking through AI.

Works wonders at minimal cost

DeepSeek is not only lauded as “true OpenAI” because of its open-source model but also impresses with its basic model V3, which took merely two months to train and just US$5.58 million to develop. As reported by Sina Finance, the 10,000-plus graphics processing units (subsequently increasing to 50,000) purchased by DeepSeek cost far less than those acquired by renowned labs such as OpenAI and Google.

The fact that the US has been able to produce high-performance chips while denying China access to related technologies has long been regarded as an enormous advantage in the AI race. DeepSeek’s success offers food for thought for the industry, prompting reflection on whether improving high-end chips is the only key to perfecting AI models. By leveraging just 2,000 of Nvidia’s H800 chips, DeepSeek has been able to rival OpenAI’s capability with one-tenth of the latter’s training cost. This is a testament to DeepSeek’s excellent algorithmic design and effective resource utilization.

Two rival models going toe to toe

The primary difference between ChatGPT and DeepSeek is that the former is tailored for conversational applications, focusing on task-specific intelligence within the realm of Artificial Narrow Intelligence whereas DeepSeek aims to achieve Artificial General Intelligence (AGI). Language support is another major difference. DeepSeek is a Chinese company with an emphasis on Chinese language and culture, rendering it a strong contender in the Chinese market. In comparison, as the current market leader supporting multiple languages, ChatGPT holds a greater advantage with its wide user base among professionals worldwide.

OpenAI has built a powerful ecosystem around ChatGPT, encompassing application interfaces, plug-ins, and partnerships with tech giants such as Microsoft. This household name in AI benefits from a community of active developers who lend support to its continued improvement and innovation.

Meanwhile, DeepSeek is no slouch as a newcomer, making it ChatGPT’s formidable rival. Without substantial fine-tuning, ChatGPT may struggle to ensure the same level of accuracy in a specialized environment. DeepSeek is customized to efficiently handle specific datasets or domains, especially in sectors such as finance, healthcare, or legal documents. Its long-term goal of attaining AGI may facilitate the advancement of AI systems with even greater adaptability in future.

Facing direct headwinds as a force against the flow

Despite DeepSeek’s advanced technology, its data source has been mired in controversy. The chatbot has disclosed in its responses to users that its training process may have utilized ChatGPT’s output data, potentially violating OpenAI’s agreement.

Furthermore, DeepSeek is subject to stricter censorship in terms of certain content areas (e.g. personal finance), which limits some users’ experiences. In China, internet services are required to embody “core socialist values”. This means that Mainland chatbots must comply with government regulations regarding politically sensitive issues.

According to NewsGuard’s chatbot audit report in January 2025, DeepSeek could only provide accurate information about news topics 17% of the time, ranking it tied for 10th out of 11. Notably, all the other chatbots tested were Western models. The report points out that DeepSeek’s fail rate was 83%, compared with the average fail rate of 62% among other AI models. Microsoft and OpenAI have launched an investigation into whether any close associates of DeepSeek have used dishonest means to steal large amounts of data via the OpenAI application interface. Despite the fact that DeepSeek has readily become the top downloaded app in Apple’s App Store, its technology is inevitably called into question by competitors, and the pressure is piling on the US in the AI domain.

The introduction of the DeepSeek-R1 model in early 2025 coincided with America imposing tighter restrictions on the export of AI technology to China. Companies such as OpenAI have already warned that China’s AI models could catch up with or even surpass their American counterparts in future. China is explicitly aiming to become the world leader in AI technology by 2030 and plans to invest tens of billions of US dollars to support the sector. Currently, at least three Chinese labs, namely DeepSeek, Alibaba, and Moonshot AI, have launched AI models reportedly on a par with OpenAI o1.

DeepSeek’s success demonstrates that Chinese companies have overcome the hurdles involved, indicating that their development teams are fast approaching cutting-edge technological levels. The fact that the R1 model can be run on a personal computer greatly contributes to the popularization of AI applications. Its exceptional performance also means that powerful reasoning systems will soon become widespread. In view of the model’s ability to run on local hardware, it is likely to bypass America’s export controls.

With the spectacular debut of DeepSeek, AI has triggered not only revolutionary change. While creating unprecedented growth opportunities in law, accounting, healthcare, education, and transport, it has also given rise to unignorable risks.

Hidden risks not to be overlooked

Since AI systems are often trained on historical data, they are inevitably prone to the influence of embedded biases. Without proper action to address these biases, overt discrimination could occur. It is imperative to adopt both preventive and corrective measures, particularly in recruitment, lending, healthcare, and law enforcement. Amidst the rampant hacker attacks today, AI systems are susceptible to malicious data input due to adversarial sample attacks, data pollution, or model theft, which can result in erroneous or harmful decisions. This could end up with dire consequences in life- or property-critical domains, such as self-driving vehicles or financial systems.

Moreover, AI can also be used to generate convincing fraudulent content. For instance, increasingly prevalent deepfakes have fuelled the spread of false information. Deepfake videos of political figures can even sway elections or provoke social upheaval, seriously compromising institutional credibility and social cohesion. The weaponization of AI, whether applied to drones or cyber warfare, will further augment the potential for abuse and pose a formidable threat to global security.

Undoubtedly, the gains and risks of AI are equally far-reaching. For this very reason, it is essential to strike a balance between innovation and responsibility in the face of such challenges. Only through ethical governance, concerted efforts, and mutual vigilance by the international community can we fully harness AI’s power to benefit society and the economy while minimizing its dangers.

Translation
美國OpenAI 公司自2022 年推出ChatGPT以來,就一直雄踞生成式人工智能(Generative AI )市場領導地位。AI 創新領域早已成為兵家必爭之地,美國為了限制中國在此一領域的發展,禁止高端晶片出口中國,並限制中國用戶使用ChatGPT,香港用戶因而需透過虛擬專用網絡( VPN )才能使用。2024 年,中國一間初創公司推出了深度求索(DeepSeek)生成式AI模型,卻徹底扭轉局勢,劍指ChatGPT 的霸主寶座。

2025 年 1 月,DeepSeek 已超越 ChatGPT而成為美國iOS 應用商店下載次數最多的免費應用程式,導致輝達( Nvidia)股價下跌 18%。許多人不禁問:DeepSeek會就此取代ChatGPT 嗎?

AI新星  一鳴驚人


DeepSeek與ChatGPT雖非棋藝類AI,但國際象棋大師羅兹曼(Levy Rozman)上月加以測試,兩個AI模型在棋盤上博奕,結果DeepSeek使出《孫子兵法》的招數,以權謀反敗為勝,擊敗ChatGPT。

DeepSeek由杭州一家名為「幻方量化」的AI初創企業於2023年成立,本年1月20日發布採用開源模式的新模型 DeepSeek-R1 ,提供流動應用程式及桌面版免費服務,隨即在全球 AI 科技圈廣受關注。該模型資料庫使用截至2024年7月的訊息進行訓練,並可根據需要,整合來自網路的更新訊息;

R1具備AI助手功能,包括編寫歌詞、制定企業發展計劃,甚至按冰箱裡現有食材寫出一份餐單。R1又能以多種語言進行交流,在英語和漢語的水平尤為出色。此一模型在對答中詳細展示其思維鏈(chain of thought),將推理過程透明化,用戶可清晰了解AI的思考邏輯,甚至反向學習如何拆解複雜問題,產生實戰解難之效。從奧數題到八卦新聞,從量子物理到中國文學,用戶得以目睹機器的思路,由此引起的認知衝擊,不失為人類向AI學習思考的起步。

白菜成本  超高效益


DeepSeek不但以開源方式分享模型,而被譽為「真正的OpenAI」,其基礎模型V3的訓練時間只需兩個月,費用不過 558 萬美元。新浪財經曾報導,DeepSeek購買了1萬多塊英偉達圖像處理器(GPU),隨後增至5萬塊,成本遠低於動輒使用超過50萬塊GPU的OpenAI和谷歌等領先AI實驗室。

一直以來,美國在生產高性能晶片的能力以及阻止中國獲取相關技術的策略,被視為其在AI競賽領域的極大優勢。DeepSeek 的成功卻促使業界反思,改良高端晶片是否提升 AI 模型的唯一要素。DeepSeek 只使用 2000 粒輝達H800晶片,就實現了媲美 ChatGPT 的性能,訓練成本僅為其十分之一,顯示出優秀的算法設計和資源利用能力。

兩大模型  各擅勝場


ChatGPT 與 DeepSeek 的主要分別,在於前者主要針對對話應用程式而設計。ChatGPT專注於狹義人工智能中的特定任務智能,DeepSeek 的目標則是實現通用人工智能。語言支援是另一個重大分別。 DeepSeek是一家中國公司,注重中國語言和文化,這使其成為中國市場的競爭者。反觀ChatGPT 支援世界各地多種語言;作為當前的領導者,比 DeepSeek 較具優勢,獲全球專業人士廣泛使用。

OpenAI 圍繞着ChatGPT 建立了一個強大的生態系統,包括應用程式介面、外掛程式以及與科技巨頭如微軟的合作夥伴關係。這個人工智能界家喻戶曉的名字,擁有龐大活躍的開發者社區,支撐其不斷改進和創新的成果。

與此同時,DeepSeek作為後起之秀也有一定實力,這使其成為ChatGPT不可小覷的對手。ChatGPT 若不進行重大微調,可能無法在專門環境中提供相同程度的準確性。DeepSeek 專為更有效地處理特定資料集或領域而客製化。對於特定行業的數據,例如金融、醫療或法律文件或會更好。DeepSeek 實現通用人工智能的長期目標可能會在未來帶來適應性更強的人工智能系統。

迎面逆風  逆流動能


儘管 DeepSeek 技術領先,但其數據來源出現爭議,曾被用戶發現在回應中,透露可能在訓練過程中使用了 ChatGPT 的輸出數據,或有違OpenAI 的協議。此外,DeepSeek 在某些內容(如理財問題)所受審查較ChatGPT嚴格,以致部分用戶的體驗受限。中國對網路服務規定,人工智能需體現「社會主義核心價值觀」,這意味著內地的聊天機器人,在政治敏感話題上往往受到規管。

本年1月,NewsGuard發表審核報告,分析多個聊天機械人有關新聞議題的準確度,DeepSeek僅獲 17% 準確率,與一個西方AI模型同列第 10位,排名榜前9位為西方AI模型。報告指出,DeepSeek 的錯誤率為 83%,較其他AI模型的平均錯誤率 62% 為高。微軟與 OpenAI 就開始調查是否有與DeepSeek關係密切者,以不誠實手段透過 OpenAI的應用程式介面竊取大量數據。儘管 DeepSeek 在 Apple App Store 中迅速成為最受歡迎的流動應用程式,其技術仍未免受到對手質疑,並加重了美國在 AI 領域的競爭壓力。

DeepSeek-R1 模型在年初面世,正值是美國加強對中國出口AI技術限制之時。OpenAI等企業已作出警告,中國的AI模型未來可能緊貼甚至超越美國。中國明言到2030年將成為人工智能技術的全球領導者,並計劃在未來幾年內投入數百億美元,以支持該行業。目前包括DeepSeek、阿里巴巴以及Moonshot AI在內的至少3家中國實驗室,已推出據稱可媲美OpenAI o1的模型。

事實上,DeepSeek的成功足證此等企業已經開始克服有關障礙,顯示開發團隊迅速接近前沿技術水準。R1模型可在個人電腦上操作,大大有助於推動AI應用的普及化。其卓越性能意味著強大的推理系統會迅速傳播,並能在本地硬體上運行,而避開任何美國的出口管制。

潛藏風險  不可不察


隨着DeepSeek橫空而出,人工智能在激發革命性改變之餘,既在法律、會計、醫療、教育、交通等領域提供前所未有的發展機會,也帶來了不容忽視的風險。

AI 系統因常從歷史數據中學習,而難免受其中的偏見影響。要是未能妥善處理,這些偏見可能導致歧視性結果,尤其是在招聘、貸款、醫療保健、執法方面,必須加以防範和矯正。在駭客猖獗的今天,AI 系統或遭對抗性樣本攻擊、數據污染或模型盜竊,而被輸入惡意數據,以致作出錯誤或有害的決策,在自動駕駛車輛或金融系統在自動駕駛車輛或金融系統等性命財產攸關的領域,後果不堪設想。

此外,AI 還可以用來創建具有說服力的假內容此外,例如日益流行的深偽(deepfake)或合成媒體更每每助長假資訊傳播。政治人物的深偽視頻更足以左右選舉或觸發社會動盪,嚴重破壞機構的可信性以至和社會的凝聚力。AI的武器化,若在自主無人機或網絡戰爭中應用,將進一步放大其被濫用的潛能,而會對全球安全構成莫大威脅。

無庸置疑,AI的效益與風險同樣深遠。正因如此,面對這些挑戰之際,務須在創新與責任之間取得平衡。只有基於倫理管治、國際社會通力合作與彼此警惕,才有望充分發揮AI的力量,造福社會和經濟,同時將其危險性盡量降低。
謝國生博士   港大經管學院金融學首席講師、新界鄉議局當然執行委員

何敏淙  香港大學附屬學院講師
(本文同時於二零二五年三月十九日載於《信報》「龍虎山下」專欄)