A wave of frustration is hitting developers as Grok 4 falls short against Claude 4, with some asserting it's performing worse than Grok 3. Users are sharing their doubts as the model's programming abilities come under scrutiny, amid promises of future updates.
Many developers are turning to forums to share their displeasure with Grok 4's performance. Initial excitement has shifted to skepticism. "So youโre saying Grok 4 will get even worse," one commenter remarked, encapsulating the growing doubt over the modelโs future. Further comments state that Grok 4 is the worst among the top coding models, with users noting its deficiencies when compared to others like Opus 4 and Gemini.
"Itโs not xAIโs coding LLM as that one is coming soon." This comment emphasizes confusion around current expectations versus future releases.
Performance Issues: Users reveal Grok 4's heavy model performs poorly, even worse than Grok 3.
Trust in Benchmarking: Some caution that initial benchmarks may not reflect performance fairly.
Looking Ahead: Users express mixed feelings over the anticipated specialized coding model set to launch soon.
Among the many discussions, several notable quotes reflect user sentiments:
"Where is 3.5 Sonnet ranked? Itโs still my favorite model for coding as it never goes off track."
"Theyโve already said upfront itโs NOT a coding model right now."
"Check my rating," a response illustrates the ongoing quest for clarity from Grok's performance.
โ๏ธ Users express dissatisfaction, citing Grok 4's inadequate performance against Claude 4.
โ ๏ธ Heavy model criticism suggests it may lag considerably in coding tasks.
๐ Anticipation builds for upcoming updates, amid hopes that they may eventually address current limitations.
Despite the prevailing discontent, thereโs a thread of optimism for Grok 4's future updates.
As disappointment spreads, Grok 4 faces an uphill path to regain developer trust. While some experts estimate a 60% probability that updates will strengthen coding capabilities, many foresee delays extending into late 2025. If Grok can enhance its tech effectively, it may attract some users back, but only if the changes meet expectations set during earlier releases.
A striking comparison can be drawn with early smartphone brands that faced significant backlash. Just as those brands learned from their shortcomings to improve in subsequent iterations, Grok 4 still holds the chance to make amends. Will it turn setbacks into success? Only time will unveil the answer.