AI stream

AI Post

@emollick
Performance Medium

@emollick

Importance score: 5 • Posted: February 26, 2026 at 06:29

Score

5

Has anyone actually benchmarked AI ability with any of the default knowledge work skills shipping with Claude Cowork? Does it increase GDPval scores over default 4.6? (Not GDPval-AA) It seems worth testing for real, given that the market freaks out every time they ship skills.

Grok reasoning
Question on benchmarking Claude skills, highlights AI tool performance.

Likes

170

Reposts

5

Views

16,807

Tweet ID: 2026907347340189886
Prompt source: ai-influencers-news
Fetched at: February 27, 2026 at 04:33