Performance
Medium
@emollick
Importance score: 5 • Posted: February 26, 2026 at 06:29
Score
5
Has anyone actually benchmarked AI ability with any of the default knowledge work skills shipping with Claude Cowork? Does it increase GDPval scores over default 4.6? (Not GDPval-AA) It seems worth testing for real, given that the market freaks out every time they ship skills.
Grok reasoning
Question on benchmarking Claude skills, highlights AI tool performance.
Likes
170
Reposts
5
Views
16,807
Tweet ID: 2026907347340189886
Prompt source: ai-influencers-news
Fetched at: February 27, 2026 at 04:33