computer vision
July 6-10, 2020

Registration Deadline: June 22, 2020. This course covers the latest developments in vision AI, with a sharp focus on advanced deep learning methods, specifically convolutional neural networks, that enable smart vision systems to recognize, reason, interpret and react to images with improved precision.

Member DiscountAlliances members are eligible for a discount for this program. Please log in to view discount instructions.
Category
Graphic & Vision
Language
Python

GAN Dissection is a way to inspect the internal representations of a generative adversarial network (GAN) to understand how internal units align with human-interpretable concepts. It is part of NetDissect.

MIT License
Last Updated
Category
Graphic & Vision
Language
Python

Given a grayscale photograph as input, this paper attacks the problem of hallucinating a plausible color version of the photograph. This problem is clearly underconstrained, so previous approaches have either relied on significant user interaction or resulted in desaturated colorizations. We propose a fully automatic approach that produces vibrant and realistic colorizations. We embrace the underlying uncertainty of the problem by posing it as a classification task and use class-rebalancing at training time to increase the diversity of colors in the result.

BSD License
Last Updated
Category
Graphic & Vision
Language
JSON

Lyra is an interactive environment that enables custom visualization design without writing any code. Graphical “marks” can be bound to data fields using property drop zones; dynamically positioned using connectors; and directly moved, rotated, and resized using handles. Lyra also provides a data pipeline interface for iterative visual specification of data transformations and layout algorithms. Lyra is more expressive than interactive systems like Tableau, allowing designers to create custom visualizations comparable to hand-coded visualizations built with D3 or Processing.

University of Washington Interactive Data Lab
Last Updated
Category
Graphic & Vision
Language
JSON

Vega-Lite is a high-level grammar of interactive graphics. It provides a concise JSON syntax for rapidly generating visualizations to support analysis. Vega-Lite specifications can be compiled to Vega specifications.

University of Washington Interactive Data Lab
Last Updated
Image
new vision model
MIT news article

“What we were trying to do in this work is to explain how perception can be so much richer than just attaching semantic labels on parts of an image, and to explore the question of how do we see all of the physical world,” says Josh Tenenbaum, a professor of computational cognitive science and a member of MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) and the Center for Brains, Minds, and Machines (CBMM).