For OpenAI, o1 represents a step toward its broader goal of human-like artificial intelligence. More practically, it does a better job at writing code and solving multistep problems than previous models. But it’s also more expensive and slower to use than GPT-4o. OpenAI is calling this release of o1 a “preview” to emphasize how nascent it is.

The training behind o1 is fundamentally different from its predecessors, OpenAI’s research lead, Jerry Tworek, tells me, though the company is being vague about the exact details. He says o1 “has been trained using a completely new optimization algorithm and a new training dataset specifically tailored for it.”

OpenAI taught previous GPT models to mimic patterns from its training data. With o1, it trained the model to solve problems on its own using a technique known as reinforcement learning, which teaches the system through rewards and penalties. It then uses a “chain of thought” to process queries, similarly to how humans process problems by going through them step-by-step.

At the same time, o1 is not as capable as GPT-4o in a lot of areas. It doesn’t do as well on factual knowledge about the world. It also doesn’t have the ability to browse the web or process files and images. Still, the company believes it represents a brand-new class of capabilities. It was named o1 to indicate “resetting the counter back to 1.”

I think this is the most important part (emphasis mine):

As a result of this new training methodology, OpenAI says the model should be more accurate. “We have noticed that this model hallucinates less,” Tworek says. But the problem still persists. “We can’t say we solved hallucinations.”

  • TommySoda@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    edit-2
    7 days ago

    I’m kinda in the same boat but on the other side. I always try to argue with people about this. It gets me a lot of flak on pro AI posts but that won’t stop me. I usually get very aggressive replies and sometimes some fucked up dm’s too.

    I’m against it because we are already seeing the consequences of this technology and it’s only getting worse. By the time laws catch up it’s gonna be too late and the damage will be done. For some technologies that’s not always the worst. But we already saw how long it took for anyone to do anything about the Internet when it came out, and we are still trying to this day. This shit is growing so fast we will all feel the whiplash. Sites like Facebook are getting absolutely flooded with so much AI that they are becoming almost unusable. And that’s before we even get into the shady shit people use AI for like making porn of people they know with the click of a button. I recently read an article about how bad deepfake porn is in South Korea (found the article. https://www.nytimes.com/2024/09/12/world/asia/south-korea-deepfake-videos.html). And in places like the US, where a lot of these companies are based, they are so slow to do anything about a problem it’s going to be too late by the time they get to it.

    But besides all the awful things happening because of AI, I do have one personal gripe with the whole ordeal. Why are we so quick to replace the things we enjoy with AI? When I get home from work I like to make music and practice pixel art (I’m not very good at either yet). I’d much rather have AI replace my job than my hobbies. I’m down for things that are useful, but too much of this just gives me a bad gut feeling. Like their trying to replace people and not their jobs.

    This may be the future. But it sounds like a pretty dystopian future to me. You already can’t believe everything you see on the Internet and this will only make it worse.