2024ITValue-文章详情顶部

OpenAI Launches First Model With Reasoning Abilities

OpenAI CEO Sam Altman described o1 as the company's most capable and aligned models yet, but admitted that “o1 is still flawed, still limited, and it still seems more impressive on first use than it does after you spend more time with it.”

TMTPOST -- In a groundbreaking move, OpenAI has unveiled its latest AI model, 'o1,' which promises to redefine the landscape of artificial intelligence with its advanced reasoning capabilities.

Two distinct versions were released: o1-preview and o1-mini. The former is designed for high-level reasoning tasks in mathematics, programming, and scientific inquiries, boasting performance close to that of PhD-level experts. The latter is a more compact model optimized for code generation.

The o1 model is the highly-anticipated and touted 'Strawberry' project. Some industry insiders suggest that 'o1' stands for 'Orion.'

OpenAI has emphasized that this new model represents a fresh start in AI's ability to handle complex reasoning tasks, meriting a new naming convention distinct from the 'GPT-4' series. Meanwhile, this also marks another new starting point of the AI era - the important arrival of large models that can perform general complex reasoning.

Despite its advanced capabilities, the current chat experience with o1 remains basic. Unlike its predecessor GPT-4o, o1 does not offer functions such as browsing the web or handling file analysis tasks. Although it has image analysis capabilities, this feature is temporarily disabled pending further testing. Additionally, there are message limits: the number of passages sent on o1-preview is capped at 30 per week, while o1-mini allows for 50 messages per week.

Starting Friday, both versions are available to ChatGPT Plus/Team users and via API channels, with enterprise and educational users gaining priority access next week.

OpenAI CEO Sam Altman described o1 as the company's most capable and aligned models yet, but admitted that “o1 is still flawed, still limited, and it still seems more impressive on first use than it does after you spend more time with it.”

The training behind o1 is fundamentally different from its predecessors, said OpenAI’s research lead, Jerry Tworek. He said o1 “has been trained using a completely new optimization algorithm and a new training dataset specifically tailored for it.”

OpenAI taught previous GPT models to imitate patterns from its training data. With o1, it trained the model to solve problems on its own applying a technique known as reinforcement learning, which teaches the system through rewards and penalties. It then uses a “chain of thought” to process queries, similarly to the way humans process problems in a step-by-step manner.

OpenAI's new training methodology has led to a model that, according to the company, is more accurate. "We've noticed this model hallucinates less," says Tworek. However, the issue hasn’t been fully resolved. "We can’t claim to have eliminated hallucinations."

What distinguishes this new model from GPT-4o is its enhanced ability to solve complex problems, particularly in coding and math, while also providing explanations for its reasoning, OpenAI explains.

“The model is definitely better at solving the AP math test than I am, and I was a math minor in college,” says Bob McGrew, OpenAI’s chief research officer. OpenAI tested o1 on a qualifying exam for the International Mathematics Olympiad, where it solved 83% of the problems, compared to GPT-4o’s 13%.

In Codeforces programming contests, the model ranked in the 89th percentile of participants. OpenAI also claims the next update will perform similarly to PhD students on challenging benchmark tasks in physics, chemistry, and biology.

Despite these advancements, o1 lags behind GPT-4o in certain areas, such as factual knowledge about the world. It also lacks web-browsing capabilities and the ability to process files and images. Still, OpenAI views o1 as representing a new class of AI capabilities, naming it to symbolize "resetting the counter back to 1."

It is clear that while the new OpenAI o1 model does not yet possess a fully comprehensive problem-solving ability, its significantly improved reasoning capability makes it far more useful in specialized fields like science, programming, and mathematics. Additionally, the overall lower and upper limits of AI agent-related technologies have been raised, greatly enhancing capabilities in scientific research and production. However, its significance for the consumer sector is relatively limited.

Jim Fan, the Chief Scientist of Nvidia, noted that the new o1 model requires more computational power and data, and it can generate a data flywheel effect—correct answers and their thought processes can become valuable training data. This, in turn, continuously improves the reasoning core, much like how AlphaGo’s value network improved as more refined data was generated through MCTS (Monte Carlo Tree Search).

OpenAI's o1 series models significantly enhance reasoning capabilities and have introduced a new scaling paradigm: unlocking test time compute through reinforcement learning, according to Tianfeng Securities.

However, the model has its critics. Some users have noted delays in response times due to the multi-step processing involved in generating answers. Others have pointed out that while o1 excels in certain benchmarks, it does not yet surpass GPT-4o in all metrics. OpenAI's product manager, Joanne Jang, has cautioned against unrealistic expectations, emphasizing that o1 is a significant step forward but not a miracle solution.

The AI community remains divided over the terminology used to describe o1's capabilities. Terms like 'reasoning' and 'thinking' have sparked debate, with some experts arguing that these anthropomorphic descriptions can be misleading. Nonetheless, the o1 model's ability to perform tasks that require planning and multi-step problem-solving marks a notable advancement in AI technology.

Founded in 2015, OpenAI has been at the forefront of the tech industry's rapid shift towards AI. Its chatbot product, ChatGPT, first launched in 2022, sparked a global investment frenzy in AI.

OpenAI is in discussions to raise funds at a valuation of $150 billion, Bloomberg reported. The company is aiming to secure approximately $6.5 billion from investors including Apple, Nvidia and Microsoft, and is also exploring $5 billion in debt financing from banks.

OpenAI's CFO Sarah Friar recently mentioned in an internal memo that the upcoming round of financing will support the company's needs for increased computational capacity and other operational expenses. She emphasized that the company's goal is to allow employees to sell a portion of their shares in a buyback offer later this year.

(Sources: CNN, TechCrunch, The Verge.)

转载请注明出处、作者和本文链接
声明:文章内容仅供参考、交流、学习、不构成投资建议。
想和千万钛媒体用户分享你的新奇观点和发现,点击这里投稿 。创业或融资寻求报道,点击这里

敬原创,有钛度,得赞赏

赞赏支持
发表评论
0 / 300

根据《网络安全法》实名制要求,请绑定手机号后发表评论

登录后输入评论内容

快报

更多

2024-09-21 22:58

币安创始人赵长鹏确认即将出狱

2024-09-21 22:27

我国发布全球首个百亿级遥感解译基础模型

2024-09-21 21:45

我国成立首个国家卓越工程师实践基地

2024-09-21 21:04

AIGC产业联盟在京成立,推动AI生成内容技术创新与应用

2024-09-21 20:36

巴菲特继续抛售美国银行,2个月减持近81亿美元

2024-09-21 20:24

推进城中村改造迈出更大步伐,陈吉宁龚正在城中村改造工作推进会上作部署

2024-09-21 20:03

9月21日新闻联播速览20条

2024-09-21 19:34

汽车零部件供应商博世CEO:尚未排除关闭德国北部Hildesheim电动汽车工厂的可能性

2024-09-21 19:01

马斯克旗下社交平台X任命巴西法律代表

2024-09-21 18:47

杨勇平任兰州大学校长

2024-09-21 18:22

签约金额超600亿,京港洽谈会今日闭幕

2024-09-21 18:16

上海市委常委会审议通过《关于优化投资促进机制加强招商和服务一体化推进的实施方案》

2024-09-21 17:40

我国著名航空发动机专家刘松龄逝世

2024-09-21 17:04

深交所迎来西南地区首单消费REITs,中邮保险成为最大外部战配投资者

2024-09-21 17:01

乘联会崔东树:全国乘用车市场8月末库存315万台、库存46天

2024-09-21 16:51

市场监管总局:严查侵权网店、直播带货假冒商品,淘宝拼多多等81家平台签署自律公约

2024-09-21 16:33

同比增长19.08%,2023年我国数字出版产业达16179.68亿元

2024-09-21 16:01

中国贸促会副会长于健龙会见英中贸易协会首席执行官彼得·博内特

2024-09-21 15:50

打击资本市场“小作文”,三名造谣者被罚

2024-09-21 15:42

江苏:全省临床检验结果将逐步实现线上共享互认

扫描下载App

Baidu
map