Filter Options
Date
Image
null
External articles

AI models are proliferating fast. There’s Claude, ChatGPT, Gemini, Copilot, DeepSeek, Grok, Mistral, Llama, and many more emerging every day. But which ones to work with? And why? We asked MIT CSAIL faculty and students which AI tools they’re reaching for right now. The responses showed a variety of preferences, a clear winner in one area, and a word of caution about what goes into any public model’s memory.

Image
null
CSAIL article

Anthropic CEO Dario Amodei has said that AI could surpass “almost all humans at almost everything” shortly after 2027. While AI’s capabilities are certainly improving, such rapid progress might seem at odds with findings that show AI is still failing at 95%+ of remote freelance projects, and continues to struggle with hallucination, long term planning, and forms of abstract reasoning that humans find easy. But recent work from METR has found evidence that LLMs can gain capabilities in rapid surges — jumping from succeeding almost never to almost always in just a few years. If this is true across the economy, it could mean that workers could be blindsided by AI advances.