政府新闻

中国人工智能专家表示，大模型远未达到技术极限 2025-02-26

LLMs are far from reaching technical limits, Chinese AI experts say

Large language models are not yet approaching their technological ceiling, with ample room for further advancements, experts said at the recent Global Developer Conference.

LLMs are in a rapid development phase, Liu Hua, vice president of Shanghai-based artificial intelligence startup MiniMax, said during the AI-themed GDC that wrapped up in the eastern city on Feb. 23.

The launch of ChatGPT-o1 by OpenAI late last year and DeepSeek's open-source release DeepSeek R1 in January exemplify this progress, Liu said. In the next two to three years, technological advancements comparable to the leap from GPT-3.5 to GPT-4 are likely to occur twice more, he added.

Industry insiders are speculating on how close LLM developers are to hitting the scaling law limit. The limit refers to a tipping point when increases in model parameters, dataset size, or computational resources no longer enhance model performance but instead cause diminishing returns and wasteful resource allocation.

Developers require more corpus data: text written or audio spoken by native speakers of the language or dialect. An industry practitioner told Yicai that the input of fundamental raw materials has not increased proportionally with the growing scale of LLMs, hindering how models learn new knowledge.

However, He Conghui, a scientist at the Shanghai AI Laboratory, claimed that available data have not been exhausted and there remains room for quality improvement. Moreover, enhancing data quality can improve efficiency, suggesting future models may require less data. This could lead to further reductions in computational costs and encourage broader participation in model optimization.

Qiao Yu, assistant director of the Shanghai AI Laboratory, noted at the conference that LLMs still face numerous challenges in industrial implementation, including costs, efficiency, reliability, stability, and security.

Beginning this year, LLMs will enter their next phase, where innovation and application will become crucial for overcoming development bottlenecks, Qiao said. DeepSeek has made significant progress through systematic innovation in model architecture, training methods, and high-speed parallel frameworks. This has greatly improved efficiency and reduced costs, providing valuable insights, the expert added.

Looking ahead, Qiao believes this year will bring advancements in multimodal intelligence and AI-assisted scientific discoveries.

Source: Yicai Global

注册记者登录

记者点此免费注册 | 忘记密码

采访申请流程

06月08日	21315203	受理中
02月16日	21315167	已办结
01月26日	21315166	已办结

咨询申请流程

06月12日	02131545	已办结
05月12日	02131544	已办结
05月06日	02131541	已办结

查看全部 »

共性问题提示

Q: 问：如果想要迅速了解上海这座...
A: 答：请注册登陆本网站“今日上...

Q: 问：如果您想在上海进行采访，...
A: 答：(1) 请注册登陆本网站...

Q: 在哪里可以买到上海的地图？
A: 上海各大书店中均有出售，一些...