This is quite interesting. When the AI lab announced their results: "Look, this fully proves our breakthrough." But when the outcome didn't meet expectations? They immediately changed tune: "Benchmark tests can't really measure true intelligence levels."



Selective belief in data—this trick is tried and true in the tech industry. The problem is, you can't use benchmarks as a success metric and then claim the benchmarks are invalid when you fail. Either benchmarks are meaningful, or you shouldn't mention them at all.

This attitude reflects a phenomenon in the industry: when the data favors you, it's ironclad proof; when the data is unfavorable, you start questioning the validity of the test itself. Truly capable projects should have a clear understanding of their results—celebrate wins modestly, and don't blame others when you lose.
View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • 4
  • Repost
  • Share
Comment
0/400
LadderToolGuyvip
· 21h ago
Haha, this is the classic "I win, the data speaks; I lose, the data is nonsense." Stances change suddenly, truly impressive. Bitcoin once stood firm against all skepticism, and now some projects?
View OriginalReply0
BlockchainBouncervip
· 21h ago
This double standard theory, the tech circle is now playing it very smoothly, just like the crypto circle. As for benchmarking, if it benefits oneself, it's a "scientific standard"; if not, it's "impossible to measure true ability"? Truly hilarious. When the results don't meet expectations, they shift the blame to the testing method. We've seen this move too many times. Basically, they want to win twice—boasting when the data looks good and excusing when it fails. Even if there's an issue with the middleware, it must be acknowledged. This kind of back-and-forth damages credibility the most.
View OriginalReply0
quiet_lurkervip
· 21h ago
Winning by data, losing by benchmarks—I'm really tired of this game.
View OriginalReply0
ForumMiningMastervip
· 21h ago
Haha, that's hilarious. Such obvious double standards, and you still have the nerve to say you're doing research.
View OriginalReply0
  • Pin
Trade Crypto Anywhere Anytime
qrCode
Scan to download Gate App
Community
  • 简体中文
  • English
  • Tiếng Việt
  • 繁體中文
  • Español
  • Русский
  • Français (Afrique)
  • Português (Portugal)
  • Bahasa Indonesia
  • 日本語
  • بالعربية
  • Українська
  • Português (Brasil)