Publications

Last Updated: August 2024.
An up to date list of all publications can be found on my Google Scholar profile.

Filter: Selected Papers Controllability, Domain Shift, Grounding, Synthetic Data

2024

A Little Human Data Goes a Long Way

Dhananjay Ashok, Jonathan May

ACL 2025

TLDR: Performance declines associated with replacing human generated data with synthetic data is most chronic only after crossing 90% replacement.

Selected Papers Grounding, Synthetic Data

Controllable Text Generation in the Instruction Tuning Era

Dhananjay Ashok, Barnabas Poczos

TLDR: We show that prior methods for controlling text generation of base Language Models perform worse than Instruction-Tuning. We also release ConGenBench, a testbed of more difficult controllable text generation problems.

Controllability

2023

SciFix: Outperforming GPT3 on Scientific Factual Error Correction

Dhananjay Ashok, Atharva Kulkarni, Hai Pham, Barnabas Poczos

EMNLP 2023

TLDR: We combine synthetic data generation and score guided decoding to outperform GPT3 on Scientific Factual Error Correction.

Selected Papers Grounding, Synthetic Data

FeedbackLogs: Recording and Incorporating Stakeholder Feedback into Machine Learning Pipelines

Matthew Barker, Emma Kallina, Dhananjay Ashok, Katherine Collins, Ashley Casovan, Adrian Weller, Ameet Talwalkar, Valerie Chen, Umang Bhatt

ACM EAAMO 2023

TLDR: Introduces FeedbackLogs, an addenda to existing documentation of ML pipelines that tracks the input of multiple stakeholders.

PromptNER: Prompting For FewShot Named Entity Recognition

Dhananjay Ashok, Zachary Chase Lipton

TLDR: We set the state-of-the-art in several FewShot and CrossDomain NER benchmarks with a Prompting approach.

Selected Papers Controllability, Domain Shift,

2022

A Solver+ Gradient Descent Training Algorithm for Deep Neural Networks

Dhananjay Ashok, Vineel Nagisetty, Christopher Srinivasa, Vijay Ganesh

IJCAI 2022

TLDR: Using Mixed Integer Linear Programming Solvers to fine-tune Neural Networks and escape local minima during optimization.

2021

Logic guided genetic algorithms

Dhananjay Ashok, Joseph Scott, Sebastian J Wetzel, Maysum Panju, Vijay Ganesh

AAAI 2021

TLDR: A data augmentation approach that uses prior knowledge to accelerate equation discovery.

Controllability, Grounding