Annual Meeting 2024: Student and Startup Poster Presentations

Student Posters

Guardrails for LLMs Supporting Security | Anahita Srinivasan

Anahita Srinivasan is a third-year undergraduate student majoring in 6-2 (electrical engineering and computer science). She is currently conducting research at the Anyscale Learning For All (ALFA) lab at CSAIL as a Nadar Foundation Undergraduate Research and Innovation Scholar with the SuperUROP program. Her research focuses on two main topics: evaluating the cybersecurity knowledge of large language models and generating PDDL (Planning Domain Definition Language) files with LLMs using a combination of tactics including prompting, output parsing, and context-free grammar enforcement. Outside of her research, she enjoys reading, taking walks, and exploring Boston.

See poster here.

Bottom-up Standardization for Data Preparation | Eugenie Lai

Eugenie Lai is a 3rd-year PhD student advised by Principal Research Scientist Michael Cafarella in the Data Systems Group at MIT CSAIL. Today, with the explosion of data, more and more people are in desperate need to access and make use of data in many fields outside of CS. Her current research focuses on developing methods to help users interact with and make sense of data.

Abstract

Data preparation is an essential step in every data-related effort, from scientific projects in academia to data-driven decision-making in industry. Typically, data preparation is not the novel or interesting piece of a project—it transforms raw data into a format that enables further innovative work. Because data preparation scripts are never intended to be interesting, are project-specific, and are written in general-purpose languages, they can be tedious to understand and check. As a result, data preparation scripts can easily become a breeding ground for poor engineering and statistical practices.

Ideally, data preparation scripts are "admirably boring"—they should serve the project, but otherwise be as simple and as standard as possible. We propose a bottom-up script standardization framework that takes a user’s data preparation script and transforms it into a simpler, more standardized, more boring version of itself. Our framework takes the user’s input script not as an unchangeable definition of correctness, but as a semantic sketch of the user’s overall intent. We present an algorithmic framework and implemented a prototype system. We evaluate our approach against state-of-the-art methods, including GPT-4, on six real-world datasets. Our approach improves script standardization by 39.5% while not meaningfully changing the user’s intent, while GPT-4 achieves 2.9%.

View poster here.

Tailors: Accelerating Sparse Tensor Algebra by Overbooking Buffer Occupancy | Zi Yu (Fisher) Xue

Fisher Xue received his BASc in Computer Engineering from the University of British Columbia, Canada in 2022. In 2020, he worked in the Data Center Solutions Group at Intel, Vancouver, as an architecture performance modeling intern. He is currently pursuing his PhD under the supervision of Associate Professor Vivienne Sze and Professor of the Practice Joel Emer. His current research focuses on the design of tensor accelerators for sparse data.

Abstract

Sparse tensor algebra is a challenging class of workloads to accelerate due to low arithmetic intensity and varying sparsity patterns. Prior sparse tensor algebra accelerators have explored tiling sparse data to increase exploitable data reuse and improve throughput, but typically allocate tile size in a given buffer for the worst-case data occupancy. This severely limits the utilization of available memory resources and reduces data reuse. Other accelerators employ complex tiling during preprocessing or at runtime to determine the exact tile size based on its occupancy. This work proposes a speculative tensor tiling approach, called \emph{overbooking}, to improve buffer utilization by taking advantage of the distribution of nonzero elements in sparse tensors to construct larger tiles with greater data reuse. To ensure correctness, we propose a low-overhead hardware mechanism, Tailors, that can tolerate data overflow by design while ensuring reasonable data reuse. We demonstrate that Tailors can be easily integrated into the memory hierarchy of an existing sparse tensor algebra accelerator. To ensure high buffer utilization with minimal tiling overhead, we introduce a statistical approach, Swiftiles, to pick a tile size so that tiles usually fit within the buffer's capacity, but can potentially overflow, i.e., it overbooks the buffers. Across a suite of 22 sparse tensor algebra workloads, we show that our proposed overbooking strategy introduces an average speedup of 52.7x and 2.3x and an average energy reduction of 22.5x and 2.5x over ExTensor without and with optimized tiling, respectively.

See the poster here

Detection, Creation, and Evaluation of Role-Play based Jailbreak Attacks in Large Language Models | Zach Johnson

Zach Johnson is currently a Master's of Engineering student in MIT CSAIL's Decentralized Information Group, and is advised by Principal Research Scientist Lalana Kagal. He is primarily interested in the ethics and safety behind Large Language Models (LLMs), and how we can create algorithmic methods for ensuring the detection of adversarial attacks in these models.

Abstract

Large Language Models (LLMs) generally are robust against outright harmful requests (e.g. Tell me how to build a bomb). However, there exist ways of circumventing these guard rails, called Jailbreak Attacks, in which a user instructs the LLM to take on the role of a made-up character that does not abide by certain ethical guidelines. These attacks have been shown to be quite successful in yielding harmful information from LLMs. They also do not require any algorithmic generation—meaning the pool of users who can generate these attacks is much larger than normal. In this project, we explore the current landscape of jailbreak attack prompts in LLMs—that is, how they can be automatically generated, how we can automatically detect when a jailbreak has taken place, and how we can automatically identify these role play prompts in order to prevent future attacks. We hope this work provides key insights and methods on how to robustly defend against future unforeseen jailbreak attacks.

See poster here.

Constrained Bimanual Planning with Analytic Inverse Kinematics | Thomas Cohn

Thomas Cohn is a computer science PhD student at MIT, and a member of the Robot Locomotion Group. Thomas is originally from Cambridge, Massachusetts, but grew up in Ann Arbor, Michigan. He has been passionate about mathematics and computer programming for most of his life, and developed an interest in robotics in high school as a member of FIRST Robotics Team 3322. During his undergraduate studies, he worked as a research assistant at the Laboratory for Progress under the direction of Professor Chad Jenkins. At MIT, he is advised by Professor Russ Tedrake and works in the Robot Locomotion Group. The group's research focuses on motion planning, primarily for robotic manipulation tasks. Thomas is particularly interested in developing novel algorithms that exploit underlying geometric structure to help robots move faster, safer, and more elegantly.

Abstract

In order for a bimanual robot to manipulate an object that is held by both hands, it must construct motion plans such that the transformation between its end effectors remains fixed. This amounts to complicated nonlinear equality constraints in the configuration space, which are difficult for trajectory optimizers. In addition, the set of feasible configurations becomes a measure zero set, which presents a challenge to sampling-based motion planners. We leverage an analytic solution to the inverse kinematics problem to parametrize the configuration space, resulting in a lower-dimensional representation where the set of valid configurations has positive measure. We describe how to use this parametrization with existing motion planning algorithms, including sampling-based approaches, trajectory optimizers, and techniques that plan through convex inner-approximations of collision-free space. A video of the results can be found above.

View Poster Here.

LoopTree: Exploring a More Extensive Fused-layer Dataflow Accelerator Design Space | Michael Gilbert

Michael Gilbert (he/him) is a PhD student in computer architecture, specializing in accelerator architectures. He works in the Energy-Efficient Multimedia Systems group at the Massachusetts Institute of Technology, co-advised by Associate Professor Vivienne Sze and Professor of the Practice Joel Emer.

Michael’s research is on fused-layer dataflow accelerators, focusing on the trade-off between: reusing data within and across layers, retaining and recomputing data, and leveraging parallelism in multiple layers and within a layer. He has presented his work at the International Symposium on Performance Analysis of Systems and Software, 2023.

Previously, Michael earned his BS and MEng from the Massachusetts Institute of Technology. In his free time, he enjoys hiking, biking, and cooking.