Graphic & Vision

Thursday, June 26, 2025

A robotic arm learns to understand its own body (Credit: Courtesy of the researchers).

CSAIL article

Robots that know themselves: MIT’s vision-based system teaches machines to understand their bodies

In an office at MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL), a soft robotic hand carefully curls its fingers to grasp a small object. The intriguing part isn’t the mechanical design or embedded sensors – in fact, the hand contains none. Instead, the entire system relies on a single camera that watches the robot’s movements and uses that visual data to control it.

Friday, June 6, 2025

MIT researchers developed a computationally efficient method that could enable artists to design realistic simulations of elastic objects, like bouncy or squishy characters, for animated movies or video games (Credits: Courtesy of the researchers).

CSAIL article

Animation technique simulates the motion of squishy objects

Animators could create more realistic bouncy, stretchy, and squishy characters for movies and video games thanks to a new simulation method developed by researchers at MIT.

Monday, June 2, 2025

Top row, left to right: Matthew Caren, April Qiu Cheng, Arav Karighattam, and Benjamin Lou. Bottom row, left to right: Isabelle Quaye, Albert Qin, Ananthan Sadagopan, and Gianfranco (Franco) Yee (Credits: Photos courtesy of the Hertz Foundation).

CSAIL article

CSAIL researcher among 2025 Hertz Foundation Fellowship recipients

The Hertz Foundation announced that it has awarded fellowships to eight MIT affiliates. The prestigious award provides each recipient with five years of doctoral-level research funding (up to a total of $250,000), which gives them an unusual measure of independence in their graduate work to pursue groundbreaking research.

Monday, June 2, 2025

alt="SketchAgent uses a multimodal language model to turn natural language prompts into sketches in a few seconds. It can doodle on its own or through collaboration, drawing with a human or incorporating text-based input to sketch each part separately (Credits: Alex Shipps/MIT CSAIL, with AI-generated sketches from the researchers)."

CSAIL article

Teaching AI models the broad strokes to sketch more like humans do

When you’re trying to communicate or understand ideas, words don’t always do the trick. Sometimes the more efficient approach is to do a simple sketch of that concept — for example, diagramming a circuit might help make sense of how the system works.

But what if artificial intelligence could help us explore these visualizations? While these systems are typically proficient at creating realistic paintings and cartoonish drawings, many models fail to capture the essence of sketching: its stroke-by-stroke, iterative process, which helps humans brainstorm and edit how they want to represent their ideas.

Tuesday, May 20, 2025

A new color-correcting tool, SeaSplat, reconstructs true colors of an underwater image, taken in Curacao. The original photo is in the left, and the color-corrected version made with SeaSplat is on the right (Credits: Courtesy of the researchers).

CSAIL article

Imaging technique removes the effect of water in underwater scenes

The ocean is teeming with life. But unless you get up close, much of the marine world can easily remain unseen. That’s because water itself can act as an effective cloak: Light that shines through the ocean can bend, scatter, and quickly fade as it travels through the dense medium of water and reflects off the persistent haze of ocean particles. This makes it extremely challenging to capture the true color of objects in the ocean without imaging them at close range.

Tuesday, May 6, 2025

The CausVid model can quickly generate clips from a simple text prompt, creating many imaginative and artistic scenes (Credits: Alex Shipps/MIT CSAIL, using AI-generated images from the researchers).

CSAIL article

Hybrid AI model crafts smooth, high-quality videos in seconds

What would a behind-the-scenes look at a video generated by an artificial intelligence model be like? You might think the process is similar to stop-motion animation, where many images are created and stitched together, but that’s not quite the case for “diffusion models” like OpenAl's SORA and Google's VEO 2.

Thursday, May 1, 2025

MIT researchers have developed a newly improved method that could be used to direct an AI model to generate a set of probable medical diagnoses along with a strong guarantee that one of those diagnoses is correct (Credits: iStock).

CSAIL article

Making AI models more trustworthy for high-stakes settings

The ambiguity in medical imaging can present major challenges for clinicians who are trying to identify disease. For instance, in a chest X-ray, pleural effusion, an abnormal buildup of fluid in the lungs, can look very much like pulmonary infiltrates, which are accumulations of pus or blood.

Monday, April 28, 2025

CSAIL article

CSAIL Spinout Foundation EGI Pioneers New Area of AI: Engineering General Intelligence

An estimated 20% of every dollar spent on manufacturing is wasted, totaling up to $8 trillion a year, more than the entire annual budget for the U.S. federal government. While industries like healthcare and finance have been rapidly transformed by digital technologies, manufacturing has relied on traditional processes that lead to costly errors, product delays, and an inefficient use of engineers’ time.

Tuesday, April 22, 2025

PhD student Faraz Faruqi, lead author of a new paper on the project, says that TactStyle could have far-reaching applications extending from home decor and personal accessories to tactile learning tools (Credits: Mike Grimmett/MIT CSAIL).

CSAIL article

3D modeling you can feel

Essential for many industries ranging from Hollywood computer-generated imagery to product design, 3D modeling tools often use text or image prompts to dictate different aspects of visual appearance, like color and form. As much as this makes sense as a first point of contact, these systems are still limited in their realism due to their neglect of something central to the human experience: touch.

Wednesday, April 16, 2025

The models were trained on a dataset of synthetic images like the ones pictured, with objects such as tea kettles or calculators superimposed on different backgrounds. Researchers trained the model to identify one or more spatial features of an object, including rotation, location, and distance (Credits: Courtesy of the researchers).

CSAIL article

A visual pathway in the brain may do more than recognize objects

When visual information enters the brain, it travels through two pathways that process different aspects of the input. For decades, scientists have hypothesized that one of these pathways, the ventral visual stream, is responsible for recognizing objects, and that it might have been optimized by evolution to do just that.

Subscribe to Graphic & Vision