Michael Chang - Ph.D. Research

Overview

Our neural networks today are strikingly similar to what computing was like a hundred years ago. Back then we designed specialized electronic circuits for each different task; in the past decade we have been training specialized neural circuits for each different task. In the 1940s we developed the von Neumann architecture for computing; now our retrieval augmented transformers are doing essentially the same thing. Just as software abstractions were key to scaling our electronic circuits to the modern software stack, I believe that to shift artificial intelligence research from building learning circuits to building learning software, we also need to invent the analog of software abstractions for neural networks.

My research is on what I call neural software abstractions: understanding the principles that make abstractions in traditional software powerful, and translating these principles into deep learning algorithms to enable neural learners to construct their own abstractions for modeling and manipulating systems. Examples include:

How digital abstraction, or the discretization of continuous representations, can be implemented via the principle of error correction.
How variable abstraction, or the construction of placeholders for datapoints, can be implemented via the principle of symmetry.
How function abstraction, or the representation of reusable computations, can be implemented via the principle of independence.
How problem abstraction, or the coordination of local modules for exhibiting global behaviors, can be implemented via the principle of competition.

To build learning algorithms that automatically model and manipulate systems, much of my research has focused on the unsupervised learning of representations of objects and their relations, because studying how to model and manipulate physical systems confers various advantages:

Objects and relations are intuitive abstractions that are present in everyday experience.
A general-purpose solution for solving physical tasks would provide much economic and societal value.
Literature in cognitive science studying how infants acquire physical common sense can offer inspiration for developing agents with similar capabilities.

News

June 2023: I began working at Google DeepMind as a Research Scientist.
April 2022: Oral Presentation - Our paper Object Representations as Fixed Points: Training Iterative Inference Algorithms with Implicit Differentiation was selected for a oral presentation at the ICLR 2022 Workshop on Elements of Reasoning: Objects, Structure, and Causality.
July 2021: Oral Presentation - Our paper Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment was selected for a long oral presentation at ICML 2021.
December 2020: Workshop - I co-organized a NeurIPS 2020 workshop on Object Representations for Learning and Reasoning.
June 2019: Workshop - I co-organized an ICML 2019 workshop on Generative Modeling and Model-Based Reasoning for Robotics and AI.
May 2018: Press Article - "A Compositional Object-Based Approach to Learning Physical Dynamics" featured in Science Magazine (accompany video | featured segment).
April 2018: Press Article - "Relational Neural Expectation Maximization: Unsupervised Discovery of Objects and their Interactions" featured in an NVIDIA blog post.
December 2017: Award - Our NIPS workshop paper on Relational Neural Expectation Maximization received the Outstanding Paper Award sponsored by Oculus.
March 2015: Press Article - Finger-Mounted Reading Device for the Blind, with Roy Shilkrot and Marcelo Polanco.

Talks

Dissertation Talk: "Neural Software Abstractions: Learning to Automatically Model and Manipulate Systems." [video]
April 2022: Generally Intelligent. "Object Representations as Fixed Points: Training Iterative Inference Algorithms with Implicit Differentiation." [video]
July 2021: International Conference on Machine Learning (2021). "Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment." [slides] [video]
July 2020: Berkeley Multi-Agent Reinforcement Learning Seminar. "Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions." [slides]
February 2020: MIT CoCoSci, Improbable AI Lab, Learning and Intelligent Systems Group. "Entity Abstraction in Visual Model-Based Reinforcement Learning." [slides]
October 2019: Stanford Computation & Cognition Lab. "Automatically Composing Representation Transformations as a Means for Generalization." [slides]
August 2019: Google Brain, Mountain View. "Automatically Composing Represenation Transformations as a Means for Generalization." [slides]
November 2018: MIT CoCoSci. "Leveraging Compositional Inductive Biases to Help Deep Learning Methods Extrapolate"
April 2018: Microsoft Research, Redmond. "Unsupervised Discovery of Objects and the Interactions"
May 2017: Montreal Institute for Learning Algorithms. "Learning Visual and Physical Models of the Environment"
March 2017: OpenAI. "A Compositional Object-Based Approach to Learning Physical Dynamics"
February 2017: Harvard NLP. "Learning Visual and Physical Models of the Environment"
January 2017: Google, Cambridge. "Learning Visual and Physical Models of the Environment"
April 2016: MIT EECScon. "Understanding Visual Concepts with Continuation Learning"

Teaching

CS188: Introduction to Artificial Intelligence - Spring 2019: Graduate Student Instructor
CS294-112: Deep Reinforcement Learning - Fall 2018: Graduate Student Instructor

Publications

The list below highlights my publications during and before my Ph.D. For an updated list of my most recent publications, please see my Google Scholar.

	Object Representations as Fixed Points: Training Iterative Inference Algorithms with Implicit Differentiation Michael Chang, Thomas Griffiths, Sergey Levine Proceedings of the Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS), 2022 project webpage / talk Also in: ICLR Workshop on the Elements of Reasoning: Objects, Structure, and Causality, 2019 (Oral Presentation) Deep generative models, particularly those that aim to factorize the observations into discrete entities (such as objects), must often use iterative inference procedures that break symmetries among equally plausible explanations for the data. Such inference procedures include variants of the expectation-maximization algorithm and structurally resemble clustering algorithms in a latent space. However, combining such methods with deep neural networks necessitates differentiating through the inference process, which can make optimization exceptionally challenging. We observe that such iterative amortized inference methods can be made differentiable by means of the implicit function theorem, and develop an implicit differentiation approach that improves the stability and tractability of training such models by decoupling the forward and backward passes. This connection enables us to apply recent advances in optimizing implicit layers to not only improve the stability and optimization of the slot attention module in SLATE, a state-of-the-art method for learning entity representations, but do so with constant space and time complexity in backpropagation and only one additional line of code.
	Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment Michael Chang, Sid Kaushik, Sergey Levine, Thomas Griffiths Proceedings of the Thirty-eighth International Conference on Machine Learning (ICML), 2021 Long Presentation (166 out of 5513) project webpage / talk / slides Many transfer problems require re-using previously optimal decisions for solving new tasks, which suggests the need for learning algorithms that can modify the mechanisms for choosing certain actions independently of those for choosing others. However, there is currently no formalism nor theory for how to achieve this kind of modular credit assignment. To answer this question, we define modular credit assignment as a constraint on minimizing the algorithmic mutual information among feedback signals for different decisions. We introduce what we call the modularity criterion for testing whether a learning algorithm satisfies this constraint by performing causal analysis on the algorithm itself. We generalize the recently proposed societal decision-making framework as a more granular formalism than the Markov decision process to prove that for decision sequences that do not contain cycles, certain single-step temporal difference action-value methods meet this criterion while all policy-gradient methods do not. Empirical evidence suggests that such action-value methods are more sample efficient than policy-gradient methods on transfer problems that require only sparse changes to a sequence of previously optimal decisions.
	Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions Michael Chang, Sid Kaushik, S. Matthew Weinberg, Thomas Griffiths, Sergey Levine Proceedings of the Thirty-seventh International Conference on Machine Learning (ICML), 2020 project webpage / ICML talk / slides / blog post / code We develop the societal decision-making framework in which a society of primitive agents buy and sell to each other the right to operate on the environment state in a series of auctions. We prove that the Vickrey auction mechanism can be adapted to incentive the society to collectively solve MDPs as an emergent consequence of the primitive agents optimizing their own auction utilities. We propose a class of decentralized reinforcement learning algorithms for training the society that uses credit assignment that is local in space and time. The societal decision-making framework and decentralized reinforcement learning algorithms can be applied not only to standard reinforcement learning, but also for selecting options in semi-MDPs and dynamically composing computation graphs. We find evidence that suggests the potential advantages of a society’s inherent modular structure for more efficient transfer learning.
	Entity Abstraction in Visual Model-Based Reinforcement Learning Rishi Veerapaneni, John D. Co-Reyes, Michael Chang, Michael Janner, Chelsea Finn, Jiajun Wu, Joshua B. Tenenbaum, Sergey Levine Proceedings of the Conference on Robot Learning (CORL), 2019 project webpage / code / environment / slides Also in: NeurIPS workshop on Perception as Generative Reasoning, 2019 (Spotlight Talk)* NeurIPS workshop on Deep Reinforcement Learning, 2019 NeurIPS workshop on Shared Visual Representations in Human & Machine Intelligence, 2019 ICML workshop on Generative Modeling and Model-Based Reasoning for Robotics and AI, 2019 We present object-centric perception, prediction, and planning (OP3), which to the best of our knowledge is the first entity-centric dynamic latent variable framework for model-based reinforcement learning that acquires entity representations from raw visual observations without supervision and uses them to predict and plan. OP3 enforces entity-abstraction -- symmetric processing of each entity representation with the same locally-scoped function -- which enables it to scale to model different numbers and configurations of objects from those in training. Our approach to solving the key technical challenge of grounding these entity representations to actual objects in the environment is to frame this variable binding problem as an inference problem, and we developing an interactive inference algorithm that uses temporal continuity and interactive feedback to bind information about object properties to the entity variables.
	MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies Jason Peng, Michael Chang, Grace Zhang, Pieter Abbeel, Sergey Levine Proceedings of the Thirty-third Conference on Neural Information Processing Systems, 2019 project webpage We propose multiplicative compositional policies (MCP), a method for learning reusable motor skills that can be composed to produce a range of complex behaviors. Our method factorizes an agent's skills into a collection of primitives, where multiple primitives can be activated simultaneously via multiplicative composition. This flexibility allows the primitives to be transferred and recombined to elicit new behaviors as necessary for novel tasks. We demonstrate that MCP is able to extract composable skills for highly complex simulated characters from pre-training tasks, such as motion imitation, and then reuse these skills to solve challenging continuous control tasks, such as dribbling a soccer ball to a goal, and picking up an object and transporting it to a target location.
	Doing more with less: meta-reasoning and meta-learning in humans and machines Thomas Griffiths, Frederick Callaway, Michael Chang, Erin Grant, Paul M. Krueger, Falk Lieder Current Opinion in Behavioral Sciences, Volume 29, 2019 Artificial intelligence systems use an increasing amount of computation and data to solve very specific problems. By contrast, human minds solve a wide range of problems using a fixed amount of computation and limited experience. We identify two abilities that we see as crucial to this kind of general intelligence: meta-reasoning (deciding how to allocate computational resources) and meta-learning (modeling the learning environment to make better use of limited data). We summarize the relevant AI literature and relate the resulting ideas to recent work in psychology.
	Automatically Composing Representation Transformations as a Means for Generalization Michael Chang, Abhishek Gupta, Sergey Levine, Thomas Griffiths Proceedings of the International Conference on Learning Representations (ICLR) , 2019 project webpage / code / poster / slides This paper connects and synthesizes ideas from reformulation, metareasoning, program induction, hierarchical reinforcement learning, and self-organizing neural networks. The key perspective of this paper is to recast the problem of generalization to a problem of learning algorithmic procedures over representation transformations: discovering the structure of a family of problems amounts to learning a set of reusable primitive transformations and their means of composition. Our formulation enables the learner to learn the structure and parameters of its own computation graph with sparse supervision, make analogies between problems by transforming one problem representation to another, and exploit modularity and reuse to scale to problems of varying complexity.
	Representational Efficiency Outweighs Action Efficiency in Human Program Induction Sophia Sanborn, David Bourgin, Michael Chang, Thomas Griffiths Proceedings of the 40th Annual Conference of the Cognitive Science Society, 2018 This paper introduces Lightbot, a problem-solving domain that explores the link between problem solving and program induction. This paper departs from work in hierarchical learning that hypothesize that hierarchies accelerates the discovery of shortest-path solutions to a problem by segmenting the solution into subgoals. Instead, we investigate a setting in which the hierarchical solutions that humans discover minimize the complexity of the underlying program that generated the solution rather than minimize the length of the solution itself.
	Relational Neural Expectation Maximization: Unsupervised Discovery of Objects and their Interactions Sjoerd van Steenkiste, Michael Chang, Klaus Greff, Jürgen Schmidhuber Proceedings of the International Conference on Learning Representations (ICLR), 2018 Press: NVIDIA article project webpage / code We present a novel method that learns to discover objects and model their physical interactions from raw visual images in a purely unsupervised fashion. It incorporates prior knowledge about the compositional nature of human perception to factor interactions between object-pairs and learn efficiently. On videos of bouncing balls we show the superior modeling capabilities of our method compared to other unsupervised neural approaches that do not incorporate such prior knowledge.
	Relational Neural Expectation Maximization Sjoerd van Steenkiste, Michael Chang, Klaus Greff, Jürgen Schmidhuber NIPS workshop on Cognitively Informed Artificial Intelligence, 2017 Oral Presentation, Oculus Outstanding Paper Award We propose a novel approach to common-sense physical reasoning that learns physical interactions between objects from raw visual images in a purely unsupervised fashion. Our method incorporates prior knowledge about the compositional nature of human perception, enabling it to discover objects, factor interactions between object-pairs to learn efficiently, and generalize to new environments without re-training.
	A Compositional Object-Based Approach to Learning Physical Dynamics Michael B. Chang, Tomer D. Ullman , Antonio Torralba, Joshua B. Tenenbaum Proceedings of the International Conference on Learning Representations (ICLR), 2017 Press: Science Magazine article (accompany video \| featured segment) project webpage / code / poster / spotlight talk (NIPS Intuitive Physics Workshop) The Neural Physics Engine (NPE) frames learning a simulator of intuitive physics as learning a compositional program over objects and interactions. This allows the NPE to naturally generalize across variable object count and different scene configurations.
	Understanding Visual Concepts with Continuation Learning William F. Whitney, Michael B. Chang, Tejas D. Kulkarni, Joshua B. Tenenbaum International Conference on Learning Representations (ICLR) workshop, 2016 project webpage / code This paper presents an unsupervised approach to learning factorized symbolic representations of high-level visual concepts by exploiting temporal continuity in the scene.