Entertainment/Media

Monday, June 2, 2025

Top row, left to right: Matthew Caren, April Qiu Cheng, Arav Karighattam, and Benjamin Lou. Bottom row, left to right: Isabelle Quaye, Albert Qin, Ananthan Sadagopan, and Gianfranco (Franco) Yee (Credits: Photos courtesy of the Hertz Foundation).

CSAIL article

CSAIL researcher among 2025 Hertz Foundation Fellowship recipients

The Hertz Foundation announced that it has awarded fellowships to eight MIT affiliates. The prestigious award provides each recipient with five years of doctoral-level research funding (up to a total of $250,000), which gives them an unusual measure of independence in their graduate work to pursue groundbreaking research.

Monday, June 2, 2025

alt="SketchAgent uses a multimodal language model to turn natural language prompts into sketches in a few seconds. It can doodle on its own or through collaboration, drawing with a human or incorporating text-based input to sketch each part separately (Credits: Alex Shipps/MIT CSAIL, with AI-generated sketches from the researchers)."

CSAIL article

Teaching AI models the broad strokes to sketch more like humans do

When you’re trying to communicate or understand ideas, words don’t always do the trick. Sometimes the more efficient approach is to do a simple sketch of that concept — for example, diagramming a circuit might help make sense of how the system works.

But what if artificial intelligence could help us explore these visualizations? While these systems are typically proficient at creating realistic paintings and cartoonish drawings, many models fail to capture the essence of sketching: its stroke-by-stroke, iterative process, which helps humans brainstorm and edit how they want to represent their ideas.

Thursday, May 22, 2025

alt=""We are building AI systems that can process the world like humans do, in terms of having both audio and visual information coming in at once and being able to seamlessly process both modalities," says co-author Andrew Rouditchenko (Credits: MIT News; iStock)."

CSAIL article

AI learns how vision and sound are connected, without human intervention

Humans naturally learn by making connections between sight and sound. For instance, we can watch someone playing the cello and recognize that the cellist’s movements are generating the music we hear.

Thursday, May 15, 2025

MIT CSAIL researchers combined generative AI and a physics simulation engine to create a machine that outjumped a robot designed by a human (Credit: Researchers photographed by Dan McDonald and image collaged by Alex Shipps/MIT CSAIL using assets from the researchers).

CSAIL article

Using generative AI to help robots jump higher and land better

Diffusion models like OpenAI’s DALL-E are becoming increasingly useful in helping brainstorm new designs. Humans can prompt these systems to generate an image, create a video, or refine a blueprint, and come back with ideas they hadn’t considered before.

Thursday, May 8, 2025

With a new simulation method, robots can guess the weight, softness, and other physical properties of an object just by picking it up (Credits: MIT News, iStock).

CSAIL article

System lets robots identify an object’s properties through handling

A human clearing junk out of an attic can often guess the contents of a box simply by picking it up and giving it a shake, without the need to see what’s inside. Researchers from MIT, Amazon Robotics, and the University of British Columbia have taught robots to do something similar.

Tuesday, May 6, 2025

The CausVid model can quickly generate clips from a simple text prompt, creating many imaginative and artistic scenes (Credits: Alex Shipps/MIT CSAIL, using AI-generated images from the researchers).

CSAIL article

Hybrid AI model crafts smooth, high-quality videos in seconds

What would a behind-the-scenes look at a video generated by an artificial intelligence model be like? You might think the process is similar to stop-motion animation, where many images are created and stitched together, but that’s not quite the case for “diffusion models” like OpenAl's SORA and Google's VEO 2.

Thursday, May 1, 2025

“Architecture is the only discipline to bring everybody together, because it means rethinking the built environment, the places we all live,” says MIT professor of the practice Carlo Ratti, curator of the Venice Biennale’s 19th International Architecture Exhibition (Credits: Stephanie Fuessenich).

CSAIL article

At the Venice Biennale, design through flexible thinking

When the Venice Biennale’s 19th International Architecture Exhibition launches on May 10, its guiding theme will be applying nimble, flexible intelligence to a demanding world — an ongoing focus of its curator, MIT faculty member Carlo Ratti.

Tuesday, April 22, 2025

PhD student Faraz Faruqi, lead author of a new paper on the project, says that TactStyle could have far-reaching applications extending from home decor and personal accessories to tactile learning tools (Credits: Mike Grimmett/MIT CSAIL).

CSAIL article

3D modeling you can feel

Essential for many industries ranging from Hollywood computer-generated imagery to product design, 3D modeling tools often use text or image prompts to dictate different aspects of visual appearance, like color and form. As much as this makes sense as a first point of contact, these systems are still limited in their realism due to their neglect of something central to the human experience: touch.

Wednesday, April 16, 2025

The models were trained on a dataset of synthetic images like the ones pictured, with objects such as tea kettles or calculators superimposed on different backgrounds. Researchers trained the model to identify one or more spatial features of an object, including rotation, location, and distance (Credits: Courtesy of the researchers).

CSAIL article

A visual pathway in the brain may do more than recognize objects

When visual information enters the brain, it travels through two pathways that process different aspects of the input. For decades, scientists have hypothesized that one of these pathways, the ventral visual stream, is responsible for recognizing objects, and that it might have been optimized by evolution to do just that.

Monday, April 7, 2025

InteRecon can recreate the interaction functions in the physical world, such as the head motions of your favorite bobblehead, the music on your old iPod, and the way your doll moves (Credits: Alex Shipps/MIT CSAIL, with elements from the researchers).

CSAIL article

A new way to bring personal items to mixed reality

Think of your most prized belongings. In an increasingly virtual world, wouldn’t it be great to save a copy of that precious item and all the memories it holds?

Subscribe to Entertainment/Media