Publications
Last Updated: August 2024.
An up to date list of all publications can be found on my Google Scholar profile.
2024
Dhananjay Ashok, Jonathan May
TLDR: Performance declines associated with replacing human generated data with synthetic data is most chronic only after crossing 90% replacement.
Selected Papers Grounding, Synthetic DataDhananjay Ashok, Barnabas Poczos
TLDR: We show that prior methods for controlling text generation of base Language Models perform worse than Instruction-Tuning. We also release ConGenBench, a testbed of more difficult controllable text generation problems.
Controllability2023
Dhananjay Ashok, Atharva Kulkarni, Hai Pham, Barnabas Poczos
EMNLP 2023TLDR: We combine synthetic data generation and score guided decoding to outperform GPT3 on Scientific Factual Error Correction.
Selected Papers Grounding, Synthetic DataMatthew Barker, Emma Kallina, Dhananjay Ashok, Katherine Collins, Ashley Casovan, Adrian Weller, Ameet Talwalkar, Valerie Chen, Umang Bhatt
ACM EAAMO 2023TLDR: Introduces FeedbackLogs, an addenda to existing documentation of ML pipelines that tracks the input of multiple stakeholders.
Dhananjay Ashok, Zachary Chase Lipton
TLDR: We set the state-of-the-art in several FewShot and CrossDomain NER benchmarks with a Prompting approach.
Selected Papers Controllability, Domain Shift,2022
2021
Dhananjay Ashok, Joseph Scott, Sebastian J Wetzel, Maysum Panju, Vijay Ganesh
AAAI 2021TLDR: A data augmentation approach that uses prior knowledge to accelerate equation discovery.
Controllability, Grounding