Graphic & Vision

Wednesday, December 4, 2024

alt="Regina Barzilay, MIT professor, CSAIL Principal Investigator, and Jameel Clinic AI Faculty Lead (Credit: WCVB)."

CSAIL article

MIT EECS Professor Regina Barzilay receives the 2025 Frances E. Allen Medal

Regina Barzilay, School of Engineering Distinguished Professor for AI and Health at MIT, CSAIL Principal Investigator, and Jameel Clinic AI Faculty Lead, has been awarded the 2025 Frances E. Allen Medal from the Institute of Electrical and Electronics Engineers (IEEE). Barzilay’s award recognizes the impact of her machine-learning algorithms on medicine and natural language processing.

Tuesday, December 3, 2024

The team's "Sketching" model can effectively take many sounds from the world and generate a human-like imitation of them — like a snake’s hiss and an approaching ambulance siren. Their model can also be run in reverse to guess real-world sounds from human vocal imitations (Image: Alex Shipps/MIT CSAIL with visual elements from Pixabay).

CSAIL article

Teaching AI to communicate sounds like humans do

Whether you’re describing the sound of your faulty car engine or meowing like your neighbor’s cat, imitating sounds with your voice can be a helpful way to relay a concept when words don’t do the trick.

Thursday, November 21, 2024

MIT Assistant Professor Sara Beery contributed to the new Tree D-fusion system, which can generate a simulation-ready 3D model of a real tree from images such as those found on Google Street View. The system leverages a tree shape generated using species- and environment-specific data to create realistic, lifelike tree models (Credits: Alex Shipps/MIT CSAIL with background image via Pixabay).

CSAIL article

Advancing urban tree monitoring with AI-powered digital twins

The Irish philosopher George Berkely, best known for his theory of immaterialism, once famously mused, “If a tree falls in a forest and no one is around to hear it, does it make a sound?”

Wednesday, November 13, 2024

The MIT researchers developed an AI-powered simulator that generates unlimited, diverse, and realistic training data for robots. The team found that robots trained in this virtual environment called “LucidSim” can seamlessly transfer their skills to the real world, performing at expert levels without additional fine-tuning (Credit: Mike Grimmett/MIT CSAIL).

CSAIL article

Can robots learn from machine dreams?

For roboticists, one challenge towers above all others: generalization – the ability to create machines that can adapt to any environment or condition. Since the 1970s, the field has evolved from writing sophisticated programs to using deep learning, teaching robots to learn directly from human behavior. But a critical bottleneck remains: data quality. To improve, robots need to encounter scenarios that push the boundaries of their capabilities, operating at the edge of their mastery.

Wednesday, October 16, 2024

alt="The “Diffusion Forcing” method can sort through noisy data and reliably predict the next steps in a task, helping a robot complete manipulation tasks, for example. In one experiment, it helped a robotic arm rearrange toy fruits into target spots on circular mats despite starting from random positions and visual distractions (Credits: Mike Grimmett/MIT CSAIL)."

CSAIL article

Combining next-token prediction and video diffusion in computer vision and robotics

In the current AI zeitgeist, sequence models have skyrocketed in popularity for their ability to analyze data and predict what to do next. For instance, you’ve likely used next-token prediction models like ChatGPT, which anticipate each word (token) in a sequence to form answers to users’ queries. There are also full-sequence diffusion models like Sora, which convert words into dazzling, realistic visuals by successively “denoising” an entire video sequence

Tuesday, October 15, 2024

Figure 1: Schematic overview of the framework for on-road evaluation of explanations in automated vehicles (Credit: MIT CSAIL and GIST).

CSAIL article

MIT CSAIL-GIST study on autonomous vehicle safety displays wins Distinguished Paper Award

The Proceedings of the ACM on Interactive, Mobile, Wearable, and Ubiquitous Technologies (IMWUT) Editorial Board has awarded MIT Computer Science and Artificial Intelligence Laboratory (CSAIL) and the Gwangju Institute of Science and Technology (GIST) researchers with a Distinguished Paper Award for their evaluation of visual explanations in autonomous vehicles’ decision-making.

Monday, September 30, 2024

alt="The “Faces in Things” dataset is a comprehensive, human-labeled collection of over 5,000 pareidolic images. The research team trained face-detection algorithms to see faces in these pictures, giving insight into how humans learned to recognize faces within their surroundings (Credits: Alex Shipps/MIT CSAIL)."

CSAIL article

AI pareidolia: Can machines spot faces in inanimate objects?

In 1994, Florida jewelry designer Diana Duyser discovered what she believed to be the Virgin Mary’s image in a grilled cheese sandwich, which she preserved and later auctioned for $28,000. But how much do we really understand about pareidolia, the phenomenon of seeing faces and patterns in objects when they aren’t really there?

Monday, September 9, 2024

ScribblePrompt's interface allows users to scribble across the rough area of a biomedical image they'd like segmented. They can also click on it or use a bounding box, and the tool will highlight the entire structure or background as requested (Credit: Alex Shipps/MIT CSAIL with elements from Pixabay, and the researchers).

CSAIL article

A fast and flexible approach to help doctors annotate medical scans

To the untrained eye, a medical image like an MRI or X-ray appears to be a murky collection of black-and-white blobs. It can be a struggle to decipher where one structure (like a tumor) ends and another begins.

Monday, September 9, 2024

MIT news article

The Download: Roblox’s generative AI, and tech for humanity

Roblox is launching a generative AI that builds 3D environments in a snap.

Wednesday, August 28, 2024

Part of a new algorithm developed at MIT solves the so-called Fokker-Planck equation, where heat diffuses in a linear way, but there are additional terms that drift in the same direction heat is spreading. In a straightforward application, the approach models how swirls would evolve over the surface of a triangulated sphere (Credits: Alex Shipps/MIT CSAIL and the researchers).

CSAIL article

A framework for solving parabolic partial differential equations

Computer graphics and geometry processing research provide the tools needed to simulate physical phenomena like fire and flames, aiding the creation of visual effects in video games and movies as well as the fabrication of complex geometric shapes using tools like 3D printing.

Subscribe to Graphic & Vision