Jul 12, 2024 10:45:00

OpenAI creates a standard to evaluate how close large-scale language models are to human intelligence

OpenAI, which develops large-scale language models such as GPT-4o, has revealed that it has created an evaluation scale to show how well the intelligence of large-scale language models is catching up with human levels.

OpenAI Sets Levels to Track Progress Toward Superintelligent AI - Bloomberg

https://www.bloomberg.com/news/articles/2024-07-11/openai-sets-levels-to-track-progress-toward-superintelligent-ai

Here's how OpenAI will determine how powerful its AI systems are - The Verge
https://www.theverge.com/2024/7/11/24196746/heres-how-openai-will-determine-how-powerful-its-ai-systems-are

An OpenAI spokesperson told Bloomberg that the new AI evaluation scale was shared at a general meeting for OpenAI employees.

The scale consists of five levels, from level 1 to level 5, and the higher the level, the more closely the robot is evaluated as catching up with humans.

OpenAI states that its large-scale language model is currently at Level 1 and is approaching Level 2. According to OpenAI, Level 2 is considered a system with basic problem-solving abilities equivalent to a doctoral-level educated human. Level 3 is considered a system that can act on behalf of users, Level 4 is considered a system that can create new innovations, and the highest level, Level 5, is considered a system that can handle the work of an entire organization.

OpenAI's new rating scale was introduced shortly after the company signed a partnership with Los Alamos National Laboratory.

OpenAI and Los Alamos National Laboratory collaborate to strengthen AI safety - GIGAZINE

OpenAI aims to develop artificial general intelligence (AGI) as 'a system that is highly autonomous, surpassing humans in most economically valuable tasks.' CEO Sam Altman said in October 2023 that 'AGI is about five years away from completion,' but completing AGI equivalent to level 5 will require enormous computing power and funding.

However, this rating scale is provisional and may be subject to further refinement based on feedback from employees, investors, and the board of directors.

Related Posts:

Jul 12, 2024 10:45:00 in AI, Software, Posted by log1i_yk