Inflection debuts its own foundation AI model to rival Google and OpenAI LLMs

Inflection, a well-funded AI startup aiming to create “personal AI for everyone,” has taken the wraps off the large language model powering its Pi conversational agent. It’s hard to evaluate the quality of these things in any way, let alone objectively and systematically, but a little competition is a good thing.

Inflection-1, as the model is called, is of roughly GPT-3.5 (AKA ChatGPT) size and capabilities — as measured in the computing power used to train them. The company claims that it’s competitive or superior with other models on this tier, backing it up with a “technical memo” describing some benchmarks it ran on its model, GPT-3.5, LLaMA, Chinchilla and PaLM-540B.

According to the results they published, Inflection-1 indeed performs well on various measures, like middle- and high school-level exam tasks (think biology 101) and “common sense” benchmarks (things like “if Jack throws the ball on the roof, and Jill throws it back down, where is the ball?”). It mainly falls behind on coding, where GPT-3.5 beats it handily and, for comparison, GPT-4 smokes the competition; OpenAI’s biggest model is well known to have been a huge leap in quality there, so it’s no surprise.

Inflection notes that it expects to publish results for a larger model comparable to GPT-4 and PaLM-2(L), but no doubt they are waiting until the results are worth publishing. At any rate, Inflection-2 or Inflection-1-XL or whatever is in the oven but not quite baked.

So far the community hasn’t formally divided AI models into the machine learning equivalent of boxing weight classes, but the concepts do map to one another quite well. You don’t expect a flyweight to go up against a heavyweight, they’re practically different sports. Same with AI models: a small one isn’t as capable as a large one, but the small one runs efficiently on a phone while the large one requires a data center. It’s an apples to oranges thing.

It’s still too early to attempt such a thing, since the field is still comparatively young and there’s no real consensus on what sizes and shapes of AI model should be considered of a feather.

Ultimately for most of these models the proof of the pudding is in the tasting, of course, and until Inflection opens up its model to widespread use and independent evaluation, all its vaunted benchmarks must be taken with a grain of salt. If you want to give Pi a shot, you can just add it on one of your messaging apps, or chat with it online here.

Quavo and 2 Chainz lead tributes to late rapper Rich Homie Quan

Shakira settled tax case 'to protect her children'

Beyoncé 'grateful for another year' as she celebrates turning 43

Spotlight: UK hip-hop artist Jordan Adetunji on all things ‘Kehlani’

‘Munich you were incredible!’ Adele thanks her fans who came to see her

Princess Diana would have accepted Camilla ‘eventually’, says former royal bodyguard

Princess Lilibet has ‘found her voice’ at the age of three

Diana’s gowns and royal memorabilia fetch over R102 million at auction

Prince Harry's chief of staff quits after just three months

Prince William and Catherine devastated after horror stabbing incident

Trendy Acupuncture Technique Turns You into a Human Porcupine

Company Lays Devious Trap to Fire Senior Employees Without Severance Pay

Man Has 23 Teeth Extracted and 12 Implants Done on the Same Day, Dies Shortly After

Vieux Boulogne – The ’s Stinkiest Cheese

Man Inhales Cockroach in His Sleep, Has Bad Breath for Three Days

Deadly waves sweep away 5 persons at Oman beach, video triggers debate on craze for ‘likes’

Twitter flooded with stunning pictures of ‘Manhattanhenge’ as sun aligns perfectly between Manhattan streets

Storm splits a house into two in the US. Here is what it looks like

Soccer time: Bear in US has a ball, literally! Wins the love of netizens

‘Cake meme gone too far’: Netizens compare Kanye West’s new Yeezy Sulfur shoes with failed foods

She Asked TikTok If Her House Was Haunted. Then the Cops Came

Bluesky grows to 9M+ users

Telegram reportedly ‘inundated’ with illegal and extremist activity

Payroll startup Warp disavows ‘affiliate’ who posted about white superiority

Boeing’s Starliner performs flawless touchdown without on-board crew, program’s future remains uncertain

Inflection debuts its own foundation AI model to rival Google and OpenAI LLMs

She Asked TikTok If Her House Was Haunted. Then the Cops Came

Dyslexia in higher education: Tools and strategies for helping learners succeed

Bluesky grows to 9M+ users

Telegram reportedly ‘inundated’ with illegal and extremist activity

She Asked TikTok If Her House Was Haunted. Then the Cops Came

Dyslexia in higher education: Tools and strategies for helping learners succeed

Bluesky grows to 9M+ users

Telegram reportedly ‘inundated’ with illegal and extremist activity

LEAVE A REPLY Cancel reply