人工智能进入“数据挖掘”时代
人工智能进入“数据挖掘”时代,随着各类机器学习算法的提出和应用,特别是深度学习技术的发展,人们希望机器能够通过大量数据分析,自动学习知识,达到一定程度的智能。随着计算机硬件的提升和大数据分析技术的发展,机器对数据的采集、存储和处理水平得到了极大的提升。
随着计算机技术的不断进步和应用领域的拓展,全球人工智能市场发展迅猛,预计到2026年,全球市场规模将达到4840亿美元。 2023年是大型语言模型(LLM)之年,OpenAI的GPT-4震惊了世界,相比纯文本的GPT-3及后续版本,GPT-4是多模态的:它在文本和图像上进行训练;除了其他功能外,它还可以根据图像生成文本。它以8192个token推出,在可能的输入大小方面已经超越了之前最好的GPT-3.5。它还使用 RLHF 进行训练,这是最先进的 LLM 成功的核心。 OpenAI 对 GPT-4 进行了全面评估,不仅针对经典的 NLP 基准,还针对旨在评估人类的考试(例如律师资格考试、GRE、Leetcode)。
GPT-4 解决了一些 GPT-3.5 无法完成的任务,例如统一律师资格考试,GPT-4 得分为 90%,而 GPT-3.5 得分为 10%。在大多数任务中,添加的视觉组件只会产生很小的影响,但在其他任务中,它可以提供很大的帮助。对抗性真相数据集上的事实准确率实际上比之前最好的 ChatGPT 模型高出 40%。

On February 16, 2024, OpenAI launched Sora, a large Vincent video model, which can directly generate Vincent pictures and convert the pictures into vivid and lifelike dynamic videos. The most shocking thing about Sora is that it produces realistic content that is in line with people's common sense, which means that it can deeply learn and understand the interactions between many elements of the world.
Some professional organizations pointed out that if Sora is viewed from the perspective of "understanding the world", then the image quality and picture relationship of a certain frame of image are by no means a criterion for judging the quality of the model. Even the 60-second one-shot video released on the official website is not The core part. The real point is that there are different camera positions in the video. No matter far, medium, close, special, or wide, the relationship between the characters and the background in the video remains quite consistent.

The technical report released by OpenAI attributes Sora's powerful capabilities to the diffusion model based on the "converter" and the technical ability to convert visual data into a usable unified format. The emergence of Sora lays the foundation for models that can understand and simulate the real world. This ability will become an important milestone in achieving more efficient general artificial intelligence.
Last updated