Publications by Year

Alternately see my publications by topic.

2023

Can Large Language Models Reason about Program Invariants?. Kexin Pei, David Bieber, Kensen Shi, Charles Sutton and Pengcheng Yin. In International Conference on Machine Learning. 2023.

[ .pdf | bib | abstract ]

Identifying invariants is an important program analysis task with applications towards program understanding, bug finding, vulnerability analysis, and formal verification. Existing tools for identifying program invariants rely on dynamic analysis, requiring traces collected from multiple executions in order to produce reliable invariants. We study the application of large language models to invariant prediction, finding that models trained on source code and fine-tuned for invariant generation can perform invariant prediction as static rather than dynamic analysis. Using a scratchpad approach where invariants are predicted sequentially through a program gives the best performance, finding invariants statically of quality comparable to those obtained by a dynamic analysis tool with access to five program traces.
```
@inproceedings{pei23invariants,
  author = {Pei, Kexin and Bieber, David and Shi, Kensen and Sutton, Charles and Yin, Pengcheng},
  booktitle = {International Conference on Machine Learning},
  month = {jun},
  title = {Can Large Language Models Reason about Program Invariants?},
  year = {2023}
}
```
Any-scale Balanced Samplers for Discrete Space. Haoran Sun, Bo Dai, Charles Sutton, Dale Schuurmans and Hanjun Dai. In International Conference on Learning Representations. 2023.

[ .pdf | bib | abstract ]

The locally balanced informed proposal has proved to be highly effective for sampling from discrete spaces. However, its success relies on the “local” factor, which ensures that whenever the proposal distribution is restricted to be near the current state, the locally balanced weight functions are asymptotically optimal and the gradient approximations are accurate. In seeking a more efficient sampling algorithm, many recent works have considered increasing the scale of the proposal distributions, but this causes the ”local” factor to no longer hold. Instead, we propose any-scale balanced samplers to repair the gap in non-local proposals. In particular, we substitute the locally balanced function with an any-scale balanced function that can self-adjust to achieve better efficiency for proposal distributions at any scale. We also use quadratic approximations to capture curvature of the target distribution and reduce the error in the gradient approximation, while employing a Gaussian integral trick with a special estimated diagonal to efficiently sample from the quadratic proposal distribution. On various synthetic and real distributions, the proposed sampler substantially outperforms existing approaches.
```
@inproceedings{sun23anyscale,
  author = {Sun, Haoran and Dai, Bo and Sutton, Charles and Schuurmans, Dale and Dai, Hanjun},
  booktitle = {International Conference on Learning Representations},
  month = {sep},
  title = {Any-scale Balanced Samplers for Discrete Space},
  year = {2023}
}
```
Natural Language to Code Generation in Interactive Data Science Notebooks. Pengcheng Yin, Wen-Ding Li, Kefan Xiao, Abhishek Rao, Yeming Wen, Kensen Shi, Joshua Howland, Paige Bailey, Michele Catasta, Henryk Michalewski, Alex Polozov and Charles Sutton. In Proceedings of the Association of Computational Linguistics (ACL). 2023.

[ arXiv | bib | abstract | source code ]

Computational notebooks, such as Jupyter notebooks, are interactive computing environments that are ubiquitous among data scientists to perform data wrangling and analytic tasks. To measure the performance of AI pair programmers that automatically synthesize programs for those tasks given natural language (NL) intents from users, we build ARCADE, a benchmark of 1082 code generation problems using the pandas data analysis framework in data science notebooks. ARCADE features multiple rounds of NL-to-code problems from the same notebook. It requires a model to understand rich multi-modal contexts, such as existing notebook cells and their execution states as well as previous turns of interaction. To establish a strong baseline on this challenging task, we develop PaChiNCo, a 62B code language model (LM) for Python computational notebooks, which significantly outperforms public code LMs. Finally, we explore few-shot prompting strategies to elicit better code with step-by-step decomposition and NL explanation, showing the potential to improve the diversity and explainability of model predictions.
```
@inproceedings{yin23arcade,
  author = {Yin, Pengcheng and Li, Wen-Ding and Xiao, Kefan and Rao, Abhishek and Wen, Yeming and Shi, Kensen and Howland, Joshua and Bailey, Paige and Catasta, Michele and Michalewski, Henryk and Polozov, Alex and Sutton, Charles},
  booktitle = {Proceedings of the Association of Computational Linguistics (ACL)},
  title = {Natural Language to Code Generation in Interactive Data Science Notebooks},
  year = {2023}
}
```

2022

PaLM: Scaling Language Modeling with Pathways. Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin, Michael Isard, Guy Gur-Ari, Pengcheng Yin, Toju Duke, Anselm Levskaya, Sanjay Ghemawat, Sunipa Dev, Henryk Michalewski, Xavier Garcia, Vedant Misra, Kevin Robinson, Liam Fedus, Denny Zhou, Daphne Ippolito, David Luan, Hyeontaek Lim, Barret Zoph, Alexander Spiridonov, Ryan Sepassi, David Dohan, Shivani Agrawal, Mark Omernick, Andrew M. Dai, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Rewon Child, Oleksandr Polozov, Katherine Lee, Zongwei Zhou, Xuezhi Wang, Brennan Saeta, Mark Diaz, Orhan Firat, Michele Catasta, Jason Wei, Kathy Meier-Hellstern, Douglas Eck, Jeff Dean, Slav Petrov and Noah Fiedel. arXiv:2204.02311. 2022.

[ arXiv | bib ]

@misc{palm,
  author = {Chowdhery, Aakanksha and Narang, Sharan and Devlin, Jacob and Bosma, Maarten and Mishra, Gaurav and Roberts, Adam and Barham, Paul and Chung, Hyung Won and Sutton, Charles and Gehrmann, Sebastian and Schuh, Parker and Shi, Kensen and Tsvyashchenko, Sasha and Maynez, Joshua and Rao, Abhishek and Barnes, Parker and Tay, Yi and Shazeer, Noam and Prabhakaran, Vinodkumar and Reif, Emily and Du, Nan and Hutchinson, Ben and Pope, Reiner and Bradbury, James and Austin, Jacob and Isard, Michael and Gur-Ari, Guy and Yin, Pengcheng and Duke, Toju and Levskaya, Anselm and Ghemawat, Sanjay and Dev, Sunipa and Michalewski, Henryk and Garcia, Xavier and Misra, Vedant and Robinson, Kevin and Fedus, Liam and Zhou, Denny and Ippolito, Daphne and Luan, David and Lim, Hyeontaek and Zoph, Barret and Spiridonov, Alexander and Sepassi, Ryan and Dohan, David and Agrawal, Shivani and Omernick, Mark and Dai, Andrew M. and Pillai, Thanumalayan Sankaranarayana and Pellat, Marie and Lewkowycz, Aitor and Moreira, Erica and Child, Rewon and Polozov, Oleksandr and Lee, Katherine and Zhou, Zongwei and Wang, Xuezhi and Saeta, Brennan and Diaz, Mark and Firat, Orhan and Catasta, Michele and Wei, Jason and Meier-Hellstern, Kathy and Eck, Douglas and Dean, Jeff and Petrov, Slav and Fiedel, Noah},
  publisher = {arXiv},
  title = {PaLM: Scaling Language Modeling with Pathways},
  year = {2022}
}

CrossBeam: Learning to Search in Bottom-Up Program Synthesis. Kensen Shi, Hanjun Dai, Kevin Ellis and Charles Sutton. In International Conference on Learning Representations (ICLR). 2022.

[ arXiv | bib ]

@inproceedings{shi2022-wd,
  author = {Shi, Kensen and Dai, Hanjun and Ellis, Kevin and Sutton, Charles},
  booktitle = {International Conference on Learning Representations (ICLR)},
  title = {CrossBeam: Learning to Search in Bottom-Up Program Synthesis},
  year = {2022}
}

Compositional generalization and decomposition in neural program synthesis. Kensen Shi, Joey Hong, Manzil Zaheer, Pengcheng Yin and Charles Sutton. In ICLR Workshop on Deep Learning for Code (DL4C). 2022.

[ arXiv | bib | abstract ]

When writing programs, people have the ability to tackle a new complex task by decomposing it into smaller and more familiar subtasks. While it is difficult to measure whether neural program synthesis methods have similar capabilities, what we can measure is whether they compositionally generalize, that is, whether a model that has been trained on the simpler subtasks is subsequently able to solve more complex tasks. In this paper, we focus on measuring the ability of learned program synthesizers to compositionally generalize. We first characterize several different axes along which program synthesis methods would be desired to generalize, e.g., length generalization, or the ability to combine known subroutines in new ways that do not occur in the training data. Based on this characterization, we introduce a benchmark suite of tasks to assess these abilities based on two popular existing datasets, SCAN and RobustFill. Finally, we make first attempts to improve the compositional generalization ability of Transformer models along these axes through novel attention mechanisms that draw inspiration from a human-like decomposition strategy. Empirically, we find our modified Transformer models generally perform better than natural baselines, but the tasks remain challenging.
```
@inproceedings{shi2022composition,
  author = {Shi, Kensen and Hong, Joey and Zaheer, Manzil and Yin, Pengcheng and Sutton, Charles},
  booktitle = {ICLR Workshop on Deep Learning for Code (DL4C)},
  title = {Compositional generalization and decomposition in neural program synthesis},
  year = {2022}
}
```

2021

SpreadsheetCoder: Formula Prediction from Semi-structured Context. Xinyun Chen, Petros Maniatis, Rishabh Singh, Charles Sutton, Hanjun Dai, Max Lin and Denny Zhou. In International Conference in Machine Learning (ICML). 2021.

[ to appear | bib ]

@inproceedings{chen21spreadsheetcoder,
  author = {Chen, Xinyun and Maniatis, Petros and Singh, Rishabh and Sutton, Charles and Dai, Hanjun and Lin, Max and Zhou, Denny},
  booktitle = {International Conference in Machine Learning (ICML)},
  title = {SpreadsheetCoder: Formula Prediction from Semi-structured Context},
  year = {2021}
}

Latent Programmer: Discrete Latent Codes for Program Synthesis. Joey Hong, David Dohan, Rishabh Singh, Charles Sutton and Manzil Zaheer. In International Conference in Machine Learning (ICML). 2021.

[ to appear | bib ]

@inproceedings{hong21latent,
  author = {Hong, Joey and Dohan, David and Singh, Rishabh and Sutton, Charles and Zaheer, Manzil},
  booktitle = {International Conference in Machine Learning (ICML)},
  title = {Latent Programmer: Discrete Latent Codes for Program Synthesis},
  year = {2021}
}

BUSTLE: Bottom-Up Program Synthesis Through Learning-Guided Exploration. Augustus Odena, Kensen Shi, David Bieber, Rishabh Singh, Charles Sutton and Hanjun Dai. In International Conference on Learning Representations. 2021.

Publications by Year

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002