Sunday, December 22, 2024

Elon Musk teases next-gen AI chatbot Grok-1.5 with superior coding and math expertise

Elon Musk introduced that an upgraded iteration of his synthetic intelligence agency xAI’s chatbot Grok could also be launched subsequent week.

This revelation got here through Musk’s social media submit on March 29, following xAI’s announcement of Grok-1.5 in a weblog submit. The improved AI chatbot will initially be accessible to “early testers and current Grok customers on the social media platform.”

Moreover, Musk hinted on the ongoing improvement of Grok 2, which he anticipates will surpass present AI requirements in all features.

Grok-1.5

Grok-1.5 is a complicated model of the Grok-1 AI mannequin and comes with improved reasoning and a context size of 128,000 tokens.

xAI’s evaluation signifies vital enhancements within the efficiency of its superior chatbot, significantly in coding and math-related duties. Nonetheless, it falls quick in comparison with Google’s Gemini Professional 1.5 and OpenAI’s GPT-4.

xAI Grok-1.5
AI Chatbot’s evaluation. (Supply: xAI)

Based on the agency:

“Grok-1.5 achieved a 50.6% rating on the MATH benchmark and a 90% rating on the GSM8K benchmark, two math benchmarks overlaying a variety of grade faculty to highschool competitors issues. Moreover, it scored 74.1% on the HumanEval benchmark, which evaluates code era and problem-solving skills.”

Furthermore, Grok-1.5 can make the most of info from considerably longer paperwork, and the mannequin can deal with longer and extra advanced prompts whereas sustaining its instruction-following functionality as its context window expands.

The agency added:

“Grok-1.5 is constructed on a customized distributed coaching framework primarily based on JAX, Rust, and Kubernetes. This coaching stack allows our group to prototype concepts and practice new architectures at scale with minimal effort.”

Grok is Open-source

Earlier this month, xAI took a big step by open-sourcing the bottom code of Grok-1.

This choice arose as a response to a authorized motion initiated by Musk towards OpenAI, the group he as soon as co-founded. Musk alleged that OpenAI has deviated from its unique dedication to prioritize open-source mannequin improvement over shareholder pursuits.

In the meantime, xAI stated the launched code was “the uncooked base mannequin checkpoint from the Grok-1 pre-training section, which concluded in October 2023. Because of this the mannequin isn’t fine-tuned for any particular software, resembling dialogue.” It added that the mannequin was licensed underneath Apache License 2.0.

Talked about on this article



Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles