Papers
The following are the latest papers (sorted by release date) on prompt engineering for large language models (LLMs). We update the list of papers on a daily/weekly basis.
Overviews
- Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluation (opens in a new tab) (May 2023)
 - Jailbreaking ChatGPT via Prompt Engineering: An Empirical Study (opens in a new tab) (May 2023)
 - Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond (opens in a new tab) (April 2023)
 - Tool Learning with Foundation Models (opens in a new tab) (April 2023)
 - One Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC Era (opens in a new tab) (April 2023)
 - A Bibliometric Review of Large Language Models Research from 2017 to 2023 (opens in a new tab) (April 2023)
 - A Survey of Large Language Models (opens in a new tab) (April 2023)
 - Nature Language Reasoning, A Survey (opens in a new tab) (Mar 2023)
 - Augmented Language Models: a Survey (opens in a new tab) (Feb 2023)
 - A Survey for In-context Learning (opens in a new tab) (Dec 2022)
 - Towards Reasoning in Large Language Models: A Survey (opens in a new tab) (Dec 2022)
 - Reasoning with Language Model Prompting: A Survey (opens in a new tab) (Dec 2022)
 - Emergent Abilities of Large Language Models (opens in a new tab) (Jun 2022)
 - A Taxonomy of Prompt Modifiers for Text-To-Image Generation (opens in a new tab) (Apr 2022)
 - Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing (opens in a new tab) (Jul 2021)
 
Approaches
- Focused Prefix Tuning for Controllable Text Generation (opens in a new tab) (June 2023)
 - Exploring Lottery Prompts for Pre-trained Language Models (opens in a new tab) (May 2023)
 - Less Likely Brainstorming: Using Language Models to Generate Alternative Hypotheses (opens in a new tab) (May 2023)
 - Let's Verify Step by Step (opens in a new tab) (May 2023)
 - Universality and Limitations of Prompt Tuning (opens in a new tab) (May 2023)
 - MultiTool-CoT: GPT-3 Can Use Multiple External Tools with Chain of Thought Prompting (opens in a new tab) (May 2023)
 - PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents (opens in a new tab) (May 2023)
 - Reasoning with Language Model is Planning with World Model (opens in a new tab) (May 2023)
 - Self-Critique Prompting with Large Language Models for Inductive Instructions (opens in a new tab) (May 2023)
 - Better Zero-Shot Reasoning with Self-Adaptive Prompting (opens in a new tab) (May 2023)
 - Hierarchical Prompting Assists Large Language Model on Web Navigation (opens in a new tab) (May 2023)
 - Interactive Natural Language Processing (opens in a new tab) (May 2023)
 - Can We Edit Factual Knowledge by In-Context Learning? (opens in a new tab) (May 2023)
 - In-Context Learning of Large Language Models Explained as Kernel Regression (opens in a new tab) (May 2023)
 - Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models (opens in a new tab) (May 2023)
 - Meta-in-context learning in large language models (opens in a new tab) (May 2023)
 - Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs (opens in a new tab) (May 2023)
 - Post Hoc Explanations of Language Models Can Improve Language Models (opens in a new tab) (May 2023)
 - Compress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable Prompt (opens in a new tab) (May 2023)
 - TreePrompt: Learning to Compose Tree Prompts for Explainable Visual Grounding (opens in a new tab) (May 2023)
 - TELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex Tasks (opens in a new tab) (May 2023)
 - Efficient Prompting via Dynamic In-Context Learning (opens in a new tab) (May 2023)
 - The Web Can Be Your Oyster for Improving Large Language Models (opens in a new tab) (May 2023)
 - Flatness-Aware Prompt Selection Improves Accuracy and Sample Efficiency (opens in a new tab) (May 2023)
 - Tree of Thoughts: Deliberate Problem Solving with Large Language Models (opens in a new tab) (May 2023)
 - ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs (opens in a new tab) (May 2023)
 - Chain-of-Symbol Prompting Elicits Planning in Large Langauge Models (opens in a new tab) (May 2023)
 - CooK: Empowering General-Purpose Language Models with Modular and Collaborative Knowledge (opens in a new tab) (May 2023)
 - What In-Context Learning "Learns" In-Context: Disentangling Task Recognition and Task Learning (opens in a new tab) (May 2023)
 - Reprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs Sampling (opens in a new tab) (May 2023)
 - Satisfiability-Aided Language Models Using Declarative Prompting (opens in a new tab) (May 2023)
 - Pre-Training to Learn in Context (opens in a new tab) (May 2023)
 - Boosted Prompt Ensembles for Large Language Models (opens in a new tab) (April 2023)
 - Global Prompt Cell: A Portable Control Module for Effective Prompt (opens in a new tab) (April 2023)
 - Why think step-by-step? Reasoning emerges from the locality of experience (opens in a new tab) (April 2023)
 - Revisiting Automated Prompting: Are We Actually Doing Better? (opens in a new tab) (April 2023)
 - REFINER: Reasoning Feedback on Intermediate Representations (opens in a new tab) (April 2023)
 - Reflexion: an autonomous agent with dynamic memory and self-reflection (opens in a new tab) (March 2023)
 - CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society (opens in a new tab) (Mar 2023)
 - Self-Refine: Iterative Refinement with Self-Feedback (opens in a new tab) (Mar 2023)
 - kNN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference (opens in a new tab) (Mar 2023)
 - Visual-Language Prompt Tuning with Knowledge-guided Context Optimization (opens in a new tab) (Mar 2023)
 - Fairness-guided Few-shot Prompting for Large Language Models (opens in a new tab) (Mar 2023)
 - Context-faithful Prompting for Large Language Models (opens in a new tab) (Mar 2023)
 - Is Prompt All You Need? No. A Comprehensive and Broader View of Instruction Learning (opens in a new tab) (Mar 2023)
 - UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation (opens in a new tab) (Mar 2023)
 - Model-tuning Via Prompts Makes NLP Models Adversarially Robust (opens in a new tab) (Mar 2023)
 - Structure Pretraining and Prompt Tuning for Knowledge Graph Transfer (opens in a new tab) (March 2023)
 - CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification (opens in a new tab) (March 2023)
 - Larger language models do in-context learning differently (opens in a new tab) (March 2023)
 - OpenICL: An Open-Source Framework for In-context Learning (opens in a new tab) (March 2023)
 - Dynamic Prompting: A Unified Framework for Prompt Tuning (opens in a new tab) (March 2023)
 - ART: Automatic multi-step reasoning and tool-use for large language models (opens in a new tab) (March 2023)
 - Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning (opens in a new tab) (March 2023)
 - Effectiveness of Data Augmentation for Prefix Tuning with Limited Data (opens in a new tab) (March 2023)
 - Mixture of Soft Prompts for Controllable Data Generation (opens in a new tab) (March 2023)
 - Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners (opens in a new tab) (March 2023)
 - How Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding Tasks (opens in a new tab) (March 2023)
 - Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT (opens in a new tab) (Feb 2023)
 - EvoPrompting: Language Models for Code-Level Neural Architecture Search (opens in a new tab) (Feb 2023)
 - In-Context Instruction Learning (opens in a new tab) (Feb 2023)
 - Chain of Hindsight Aligns Language Models with Feedback (opens in a new tab) (Feb 2023)
 - Language Is Not All You Need: Aligning Perception with Language Models (opens in a new tab) (Feb 2023)
 - Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data (opens in a new tab) (Feb 2023)
 - Active Prompting with Chain-of-Thought for Large Language Models (opens in a new tab) (Feb 2023)
 - More than you've asked for: A Comprehensive Analysis of Novel Prompt Injection Threats to Application-Integrated Large Language Models (opens in a new tab) (Feb 2023)
 - A Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPT (opens in a new tab) (Feb 2023)
 - Guiding Large Language Models via Directional Stimulus Prompting (opens in a new tab) (Feb 2023)
 - How Does In-Context Learning Help Prompt Tuning? (opens in a new tab) (Feb 2023)
 - Scalable Prompt Generation for Semi-supervised Learning with Language Models (opens in a new tab) (Feb 2023)
 - Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints (opens in a new tab) (Feb 2023)
 - À-la-carte Prompt Tuning (APT): Combining Distinct Data Via Composable Prompting (opens in a new tab) (Feb 2023)
 - GraphPrompt: Unifying Pre-Training and Downstream Tasks for Graph Neural Networks (opens in a new tab) (Feb 2023)
 - The Capacity for Moral Self-Correction in Large Language Models (opens in a new tab) (Feb 2023)
 - SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domains (opens in a new tab) (Feb 2023)
 - Evaluating the Robustness of Discrete Prompts (opens in a new tab) (Feb 2023)
 - Compositional Exemplars for In-context Learning (opens in a new tab) (Feb 2023)
 - Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery (opens in a new tab) (Feb 2023)
 - Multimodal Chain-of-Thought Reasoning in Language Models (opens in a new tab) (Feb 2023)
 - Large Language Models Can Be Easily Distracted by Irrelevant Context (opens in a new tab) (Feb 2023)
 - Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models (opens in a new tab) (Feb 2023)
 - Progressive Prompts: Continual Learning for Language Models (opens in a new tab) (Jan 2023)
 - Batch Prompting: Efficient Inference with LLM APIs (opens in a new tab) (Jan 2023)
 - Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP (opens in a new tab) (Dec 2022)
 - On Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot Reasoning (opens in a new tab) (Dec 2022)
 - Constitutional AI: Harmlessness from AI Feedback (opens in a new tab) (Dec 2022)
 - Successive Prompting for Decomposing Complex Questions (opens in a new tab) (Dec 2022)
 - Large Language Models are reasoners with Self-Verification (opens in a new tab) (Dec 2022)
 - Discovering Language Model Behaviors with Model-Written Evaluations (opens in a new tab) (Dec 2022)
 - Structured Prompting: Scaling In-Context Learning to 1,000 Examples (opens in a new tab) (Dec 2022)
 - PAL: Program-aided Language Models (opens in a new tab) (Nov 2022)
 - Large Language Models Are Human-Level Prompt Engineers (opens in a new tab) (Nov 2022)
 - Ignore Previous Prompt: Attack Techniques For Language Models (opens in a new tab) (Nov 2022)
 - Machine Generated Text: A Comprehensive Survey of Threat Models and Detection Methods (opens in a new tab) (Nov 2022)
 - Teaching Algorithmic Reasoning via In-context Learning (opens in a new tab) (Nov 2022)
 - Enhancing Self-Consistency and Performance of Pre-Trained Language Models through Natural Language Inference (opens in a new tab) (Nov 2022)
 - Ask Me Anything: A simple strategy for prompting language models (opens in a new tab) (Oct 2022)
 - Recitation-Augmented Language Models (opens in a new tab) (Oct 2022)
 - ReAct: Synergizing Reasoning and Acting in Language Models (opens in a new tab) (Oct 2022)
 - Prompting GPT-3 To Be Reliable (opens in a new tab) (Oct 2022)
 - Decomposed Prompting: A Modular Approach for Solving Complex Tasks (opens in a new tab) (Oct 2022)
 - Automatic Chain of Thought Prompting in Large Language Models (opens in a new tab) (Oct 2022)
 - Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought (opens in a new tab) (Oct 2022)
 - Evaluating the Susceptibility of Pre-Trained Language Models via Handcrafted Adversarial Examples (opens in a new tab) (Sep 2022)
 - Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning (opens in a new tab) (Sep 2022)
 - Promptagator: Few-shot Dense Retrieval From 8 Examples (opens in a new tab) (Sep 2022)
 - Atlas: Few-shot Learning with Retrieval Augmented Language Models (opens in a new tab) (Nov 2022)
 - DocPrompting: Generating Code by Retrieving the Docs (opens in a new tab) (July 2022)
 - On the Advance of Making Language Models Better Reasoners (opens in a new tab) (June 2022)
 - Large Language Models are Zero-Shot Reasoners (opens in a new tab) (May 2022)
 - Maieutic Prompting: Logically Consistent Reasoning with Recursive Explanations (opens in a new tab) (May 2022)
 - MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning (opens in a new tab) (May 2022)
 - PPT: Pre-trained Prompt Tuning for Few-shot Learning (opens in a new tab) (Mqy 2022)
 - Toxicity Detection with Generative Prompt-based Inference (opens in a new tab) (May 2022)
 - Learning to Transfer Prompts for Text Generation (opens in a new tab) (May 2022)
 - The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (opens in a new tab) (May 2022)
 - A Taxonomy of Prompt Modifiers for Text-To-Image Generation (opens in a new tab) (Apr 2022)
 - PromptChainer: Chaining Large Language Model Prompts through Visual Programming (opens in a new tab) (Mar 2022)
 - Self-Consistency Improves Chain of Thought Reasoning in Language Models (opens in a new tab) (March 2022)
 - Training language models to follow instructions with human feedback (opens in a new tab)
 - Rethinking the Role of Demonstrations: What Makes In-Context Learning Work? (opens in a new tab) (Feb 2022)
 - Chain of Thought Prompting Elicits Reasoning in Large Language Models (opens in a new tab) (Jan 2022)
 - Show Your Work: Scratchpads for Intermediate Computation with Language Models (opens in a new tab) (Nov 2021)
 - AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts (opens in a new tab) (Oct 2021)
 - Generated Knowledge Prompting for Commonsense Reasoning (opens in a new tab) (Oct 2021)
 - Multitask Prompted Training Enables Zero-Shot Task Generalization (opens in a new tab) (Oct 2021)
 - Reframing Instructional Prompts to GPTk's Language (opens in a new tab) (Sep 2021)
 - Design Guidelines for Prompt Engineering Text-to-Image Generative Models (opens in a new tab) (Sep 2021)
 - Making Pre-trained Language Models Better Few-shot Learners (opens in a new tab) (Aug 2021)
 - Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity (opens in a new tab) (April 2021)
 - BERTese: Learning to Speak to BERT (opens in a new tab) (April 2021)
 - The Power of Scale for Parameter-Efficient Prompt Tuning (opens in a new tab) (April 2021)
 - Prompt Programming for Large Language Models: Beyond the Few-Shot Paradigm (opens in a new tab) (Feb 2021)
 - Calibrate Before Use: Improving Few-Shot Performance of Language Models (opens in a new tab) (Feb 2021)
 - Prefix-Tuning: Optimizing Continuous Prompts for Generation (opens in a new tab) (Jan 2021)
 - Learning to Generate Task-Specific Adapters from Task Description (opens in a new tab) (Jan 2021)
 - Making Pre-trained Language Models Better Few-shot Learners (opens in a new tab) (Dec 2020)
 - Learning from Task Descriptions (opens in a new tab) (Nov 2020)
 - AutoPrompt: Eliciting Knowledge from Language Models with Automatically Generated Prompts (opens in a new tab) (Oct 2020)
 - Language Models are Few-Shot Learners (opens in a new tab) (May 2020)
 - How Can We Know What Language Models Know? (opens in a new tab) (July 2020)
 - Scaling Laws for Neural Language Models (opens in a new tab) (Jan 2020)
 
Applications
- Interpretable Math Word Problem Solution Generation Via Step-by-step Planning (opens in a new tab) (June 2023)
 - In-Context Learning User Simulators for Task-Oriented Dialog Systems (opens in a new tab) (June 2023)
 - SQL-PaLM: Improved Large Language ModelAdaptation for Text-to-SQL (opens in a new tab) (June 2023)
 - Effective Structured Prompting by Meta-Learning and Representative Verbalizer (opens in a new tab) (June 2023)
 - Layout and Task Aware Instruction Prompt for Zero-shot Document Image Question Answering (opens in a new tab) (June 2023)
 - Chain-Of-Thought Prompting Under Streaming Batch: A Case Study (opens in a new tab) (June 2023)
 - Red Teaming Language Model Detectors with Language Models (opens in a new tab) (May 2023)
 - Gorilla: Large Language Model Connected with Massive APIs (opens in a new tab) (May 2023)
 - Deliberate then Generate: Enhanced Prompting Framework for Text Generation (opens in a new tab) (May 2023)
 - What does the Failure to Reason with "Respectively" in Zero/Few-Shot Settings Tell Us about Language Models? (opens in a new tab) (May 2023)
 - ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning (opens in a new tab) (May 2023)
 - SheetCopilot: Bringing Software Productivity to the Next Level through Large Language Models (opens in a new tab) (May 2023)
 - Grammar Prompting for Domain-Specific Language Generation with Large Language Models (opens in a new tab) (May 2023)
 - Mitigating Label Biases for In-context Learning (opens in a new tab) (May 2023)
 - Short Answer Grading Using One-shot Prompting and Text Similarity Scoring Model (opens in a new tab) (May 2023)
 - Strategic Reasoning with Language Models (opens in a new tab) (May 2023)
 - Dissecting Chain-of-Thought: A Study on Compositional In-Context Learning of MLPs (opens in a new tab) (May 2023)
 - Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models (opens in a new tab) (May 2023)
 - Leveraging Training Data in Few-Shot Prompting for Numerical Reasoning (opens in a new tab) (May 2023)
 - Exploring Effectiveness of GPT-3 in Grammatical Error Correction: A Study on Performance and Controllability in Prompt-Based Methods (opens in a new tab) (May 2023)
 - NOTABLE: Transferable Backdoor Attacks Against Prompt-based NLP Models (opens in a new tab) (May 2023)
 - Tab-CoT: Zero-shot Tabular Chain of Thought (opens in a new tab) (May 2023)
 - Evaluating GPT-3 Generated Explanations for Hateful Content Moderation (opens in a new tab) (May 2023)
 - Prompt-Guided Retrieval Augmentation for Non-Knowledge-Intensive Tasks (opens in a new tab) (May 2023)
 - [Zero- and Few-Shot Event Detection via Prompt-Based Meta Learning]https://arxiv.org/abs/2305.17373 (opens in a new tab)) (May 2023)
 - Chain-of-Thought Hub: A Continuous Effort to Measure Large Language Models' Reasoning Performance (opens in a new tab) (May 2023)
 - Large Language Models Can be Lazy Learners: Analyze Shortcuts in In-Context Learning (opens in a new tab) (May 2023)
 - Heterogeneous Value Evaluation for Large Language Models (opens in a new tab) (May 2023)
 - PromptNER: Prompt Locating and Typing for Named Entity Recognition (opens in a new tab) (May 2023)
 - Small Language Models Improve Giants by Rewriting Their Outputs (opens in a new tab) (May 2023)
 - On the Planning Abilities of Large Language Models -- A Critical Investigation (opens in a new tab) (May 2023)
 - Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Large Language Models (opens in a new tab) (May 2023)
 - PRODIGY: Enabling In-context Learning Over Graphs (opens in a new tab) (May 2023)
 - Large Language Models are Few-Shot Health Learners (opens in a new tab) (May 2023)
 - Role-Play with Large Language Models (opens in a new tab) (May 2023)
 - Measuring Inductive Biases of In-Context Learning with Underspecified Demonstrations (opens in a new tab) (May 2023)
 - Fact-Checking Complex Claims with Program-Guided Reasoning (opens in a new tab) (May 2023)
 - Large Language Models as Tool Makers (opens in a new tab) (May 2023)
 - Iterative Forward Tuning Boosts In-context Learning in Language Models (opens in a new tab) (May 2023)
 - SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks (opens in a new tab) (May 2023)
 - Interactive Natural Language Processing (opens in a new tab) (May 2023)
 - An automatically discovered chain-of-thought prompt generalizes to novel models and datasets (opens in a new tab) (May 2023)
 - Large Language Model Guided Tree-of-Thought (opens in a new tab) (May 2023)
 - Active Retrieval Augmented Generation (opens in a new tab) (May 2023)
 - A PhD Student's Perspective on Research in NLP in the Era of Very Large Language Models (opens in a new tab) (May 2023)
 - Visual Chain of Thought: Bridging Logical Gaps with Multimodal Infillings (opens in a new tab) (May 2023)
 - Mirages: On Anthropomorphism in Dialogue Systems (opens in a new tab) (May 2023)
 - Model evaluation for extreme risks (opens in a new tab) (May 2023)
 - Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting (opens in a new tab) (May 2023)
 - Cognitive Reframing of Negative Thoughts through Human-Language Model Interaction (opens in a new tab) (May 2023)
 - PromptClass: Weakly-Supervised Text Classification with Prompting Enhanced Noise-Robust Self-Training (opens in a new tab) (May 2023)
 - Augmented Large Language Models with Parametric Knowledge Guiding (opens in a new tab) (May 2023)
 - Aligning Large Language Models through Synthetic Feedback (opens in a new tab) (May 2023)
 - Concept-aware Training Improves In-context Learning Ability of Language Models (opens in a new tab) (May 2023)
 - FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance (opens in a new tab) (May 2023)
 - Enhancing Black-Box Few-Shot Text Classification with Prompt-Based Data Augmentation (opens in a new tab) (May 2023)
 - Detecting automatically the layout of clinical documents to enhance the performances of downstream natural language processing (opens in a new tab) (May 2023)
 - "Is the Pope Catholic?" Applying Chain-of-Thought Reasoning to Understanding Conversational Implicatures (opens in a new tab) (May 2023)
 - Let's Think Frame by Frame: Evaluating Video Chain of Thought with Video Infilling and Prediction (opens in a new tab) (May 2023)
 - Generating Data for Symbolic Language with Large Language Models (opens in a new tab) (May 2023)
 - Make a Choice! Knowledge Base Question Answering with In-Context Learning (opens in a new tab) (May 2023)
 - Improving Language Models via Plug-and-Play Retrieval Feedback (opens in a new tab) (May 2023)
 - Multi-Granularity Prompts for Topic Shift Detection in Dialogue (opens in a new tab) (May 2023)
 - The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning (opens in a new tab) (May 2023)
 - Can Language Models Understand Physical Concepts? (opens in a new tab) (May 2023)
 - Evaluating Factual Consistency of Summaries with Large Language Models (opens in a new tab) (May 2023)
 - Dr.ICL: Demonstration-Retrieved In-context Learning (opens in a new tab) (May 2023)
 - Probing in Context: Toward Building Robust Classifiers via Probing Large Language Models (opens in a new tab) (May 2023)
 - Skill-Based Few-Shot Selection for In-Context Learning (opens in a new tab) (May 2023)
 - Exploring Chain-of-Thought Style Prompting for Text-to-SQL (opens in a new tab) (May 2023)
 - Enhancing Chat Language Models by Scaling High-quality Instructional Conversations (opens in a new tab) (May 2023)
 - On Learning to Summarize with Large Language Models as References (opens in a new tab) (May 2023)
 - Learning to Generate Novel Scientific Directions with Contextualized Literature-based Discovery (opens in a new tab) (May 2023)
 - Active Learning Principles for In-Context Learning with Large Language Models (opens in a new tab) (May 2023)
 - Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs (opens in a new tab) (May 2023)
 - Improving Factuality and Reasoning in Language Models through Multiagent Debate (opens in a new tab) (May 2023)
 - ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on\ Chat-based Large Language Models (opens in a new tab) (May 2023)
 - WikiChat: A Few-Shot LLM-Based Chatbot Grounded with Wikipedia (opens in a new tab) (May 2023)
 - Query Rewriting for Retrieval-Augmented Large Language Models (opens in a new tab) (May 2023)
 - Discrete Prompt Optimization via Constrained Generation for Zero-shot Re-ranker (opens in a new tab) (May 2023)
 - Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought Method (opens in a new tab) (May 2023)
 - Small Language Models Improve Giants by Rewriting Their Outputs (opens in a new tab) (May 2023)
 - Prompting and Evaluating Large Language Models for Proactive Dialogues: Clarification, Target-guided, and Non-collaboration (opens in a new tab) (May 2023)
 - Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning (opens in a new tab) (May 2023)
 - Mitigating Language Model Hallucination with Interactive Question-Knowledge Alignment (opens in a new tab) (May 2023)
 - Making Language Models Better Tool Learners with Execution Feedback (opens in a new tab) (May 2023)
 - Text-to-SQL Error Correction with Language Models of Code (opens in a new tab) (May 2023)
 - Decomposed Prompting for Machine Translation Between Related Languages using Large Language Models (opens in a new tab) (May 2023)
 - SPARSEFIT: Few-shot Prompting with Sparse Fine-tuning for Jointly Generating Predictions and Natural Language Explanations (opens in a new tab) (May 2023)
 - "According to ..." Prompting Language Models Improves Quoting from Pre-Training Data (opens in a new tab) (May 2023)
 - Prompt-based methods may underestimate large language models' linguistic generalizations (opens in a new tab) (May 2023)
 - Chain of Knowledge: A Framework for Grounding Large Language Models with Structured Knowledge Bases (opens in a new tab) (May 2023)
 - Measuring Inductive Biases of In-Context Learning with Underspecified Demonstrations (opens in a new tab) (May 2023)
 - Automated Few-shot Classification with Instruction-Finetuned Language Models (opens in a new tab) (May 2023)
 - Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies (opens in a new tab) (May 2023)
 - MvP: Multi-view Prompting Improves Aspect Sentiment Tuple Prediction (opens in a new tab) (May 2023)
 - Learning Interpretable Style Embeddings via Prompting LLMs (opens in a new tab) (May 2023)
 - Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting (opens in a new tab) (May 2023)
 - Fact-Checking Complex Claims with Program-Guided Reasoning (opens in a new tab) (May 2023)
 - A Benchmark on Extremely Weakly Supervised Text Classification: Reconcile Seed Matching and Prompting Approaches (opens in a new tab) (May 2023)
 - This Prompt is Measuring <MASK>: Evaluating Bias Evaluation in Language Models (opens in a new tab) (May 2023)
 - Enhancing Cross-lingual Natural Language Inference by Soft Prompting with Multilingual Verbalizer (opens in a new tab) (May 2023)
 - Evaluating Prompt-based Question Answering for Object Prediction in the Open Research Knowledge Graph (opens in a new tab) (May 2023)
 - Explaining How Transformers Use Context to Build Predictions (opens in a new tab) (May 2023)
 - PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMs (opens in a new tab) (May 2023)
 - PromptNER: A Prompting Method for Few-shot Named Entity Recognition via k Nearest Neighbor Search (opens in a new tab) (May 2023)
 - Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning (opens in a new tab) (May 2023)
 - Enhancing Few-shot NER with Prompt Ordering based Data Augmentation (opens in a new tab) (May 2023)
 - Chain-of-thought prompting for responding to in-depth dialogue questions with LLM (opens in a new tab) (May 2023)
 - How to Prompt LLMs for Text-to-SQL: A Study in Zero-shot, Single-domain, and Cross-domain Settings (opens in a new tab) (May 2023)
 - Evaluation of medium-large Language Models at zero-shot closed book generative question answering (opens in a new tab) (May 2023)
 - Few-Shot Dialogue Summarization via Skeleton-Assisted Prompt Transfer (opens in a new tab) (May 2023)
 - Can NLP Models Correctly Reason Over Contexts that Break the Common Assumptions? (opens in a new tab) (May 2023)
 - Reasoning Implicit Sentiment with Chain-of-Thought Prompting (opens in a new tab) (May 2023)
 - Writing your own book: A method for going from closed to open book QA to improve robustness and performance of smaller LLMs (opens in a new tab) (May 2023)
 - AutoTrial: Prompting Language Models for Clinical Trial Design (opens in a new tab) (May 2023)
 - CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing (opens in a new tab) (May 2023)
 - Controlling the Extraction of Memorized Data from Large Language Models via Prompt-Tuning (opens in a new tab) (May 2023)
 - Prompting with Pseudo-Code Instructions (opens in a new tab) (May 2023)
 - TrueTeacher: Learning Factual Consistency Evaluation with Large Language Models (opens in a new tab) (May 2023)
 - Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors (opens in a new tab) (May 2023)
 - Exploiting Biased Models to De-bias Text: A Gender-Fair Rewriting Model (opens in a new tab) (May 2023)
 - Learning In-context Learning for Named Entity Recognition (opens in a new tab) (May 2023)
 - Take a Break in the Middle: Investigating Subgoals towards Hierarchical Script Generation (opens in a new tab) (May 2023)
 - TEPrompt: Task Enlightenment Prompt Learning for Implicit Discourse Relation Recognition (opens in a new tab) (May 2023)
 - Large Language Models can be Guided to Evade AI-Generated Text Detection (opens in a new tab) (May 2023)
 - Temporal Knowledge Graph Forecasting Without Knowledge Using In-Context Learning (opens in a new tab) (May 2023)
 - Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization (opens in a new tab) (May 2023)
 - Think Outside the Code: Brainstorming Boosts Large Language Models in Code Generation (opens in a new tab) (May 2023)
 - Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback (opens in a new tab) (May 2023)
 - ConvXAI: Delivering Heterogeneous AI Explanations via Conversations to Support Human-AI Scientific Writing (opens in a new tab) (May 2023)
 - StructGPT: A General Framework for Large Language Model to Reason over Structured Data (opens in a new tab) (May 2023)
 - Towards Expert-Level Medical Question Answering with Large Language Models (opens in a new tab) (May 2023)
 - Large Language Models are Built-in Autoregressive Search Engines (opens in a new tab) (May 2023)
 - MsPrompt: Multi-step Prompt Learning for Debiasing Few-shot Event Detection (opens in a new tab) (May 2023)
 - Exploring the Impact of Layer Normalization for Zero-shot Neural Machine Translation (opens in a new tab) (May 2023)
 - SGP-TOD: Building Task Bots Effortlessly via Schema-Guided LLM Prompting (opens in a new tab) (May 2023)
 - Multi-modal Visual Understanding with Prompts for Semantic Information Disentanglement of Image (opens in a new tab) (May 2023)
 - Soft Prompt Decoding for Multilingual Dense Retrieval (opens in a new tab) (May 2023)
 - PaLM 2 Technical Report (opens in a new tab) (May 2023)
 - Are LLMs All You Need for Task-Oriented Dialogue? (opens in a new tab) (April 2023)
 - HiPrompt: Few-Shot Biomedical Knowledge Fusion via Hierarchy-Oriented Prompting (opens in a new tab) (April 2023)
 - Approximating Human Evaluation of Social Chatbots with Prompting (opens in a new tab) (April 2023)
 - Automated Reading Passage Generation with OpenAI's Large Language Model (opens in a new tab) (April 2023)
 - WebBrain: Learning to Generate Factually Correct Articles for Queries by Grounding on Large Web Corpus (opens in a new tab) (April 2023)
 - Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition (opens in a new tab) (April 2023)
 - GPT detectors are biased against non-native English writers (opens in a new tab) (April 2023)
 - Zero-Shot Next-Item Recommendation using Large Pretrained Language Models (opens in a new tab) (April 2023)
 - Large Language Models as Master Key: Unlocking the Secrets of Materials Science with GPT (opens in a new tab) (April 2023)
 - Efficiently Aligned Cross-Lingual Transfer Learning for Conversational Tasks using Prompt-Tuning (opens in a new tab) (April 2023)
 - Better Language Models of Code through Self-Improvement (opens in a new tab) (April 2023)
 - PromptORE -- A Novel Approach Towards Fully Unsupervised Relation Extraction (opens in a new tab) (April)
 - Assessing Language Model Deployment with Risk Cards (April 2023)
 - Enhancing Large Language Models with Climate Resources (opens in a new tab) (March 2023)
 - BloombergGPT: A Large Language Model for Finance (opens in a new tab) (March 2023)
 - Medical Intervention Duration Estimation Using Language-enhanced Transformer Encoder with Medical Prompts (opens in a new tab) (March 2023)
 - Soft-prompt tuning to predict lung cancer using primary care free-text Dutch medical notes (opens in a new tab) (March 2023)
 - TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs (opens in a new tab) (March 2023)
 - Larger Probes Tell a Different Story: Extending Psycholinguistic Datasets Via In-Context Learning (opens in a new tab) (March 2023)
 - Linguistically Informed ChatGPT Prompts to Enhance Japanese-Chinese Machine Translation: A Case Study on Attributive Clauses (opens in a new tab) (March 2023)
 - Knowledge-augmented Frame Semantic Parsing with Hybrid Prompt-tuning (opens in a new tab) (March 2023)
 - Debiasing Scores and Prompts of 2D Diffusion for Robust Text-to-3D Generation (opens in a new tab) (March 2023)
 - Zero-shot Model Diagnosis (opens in a new tab) (March 2023)
 - Prompting Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages (opens in a new tab) (March 2023)
 - SPeC: A Soft Prompt-Based Calibration on Mitigating Performance Variability in Clinical Notes Summarization (opens in a new tab) (March 2023)
 - Large Language Models and Simple, Stupid Bugs (opens in a new tab) (March 2023)
 - Can Generative Pre-trained Transformers (GPT) Pass Assessments in Higher Education Programming Courses? (opens in a new tab) (Mar 2023)
 - SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models (opens in a new tab) (Mar 2023)
 - Large Language Models in the Workplace: A Case Study on Prompt Engineering for Job Type Classification (opens in a new tab) (March 2023)
 - ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction (opens in a new tab) (March 2023)
 - MathPrompter: Mathematical Reasoning using Large Language Models (opens in a new tab) (March 2023)
 - Prompt-Based Learning for Thread Structure Prediction in Cybersecurity Forums (opens in a new tab) (March 2023)
 - Choice Over Control: How Users Write with Large Language Models using Diegetic and Non-Diegetic Prompting (opens in a new tab) (March 2023)
 - Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering (opens in a new tab) (March 2023)
 - Soft Prompt Guided Joint Learning for Cross-Domain Sentiment Analysis (opens in a new tab) (March 2023)
 - SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks (opens in a new tab) (March 2023)
 - Goal Driven Discovery of Distributional Differences via Language Descriptions (opens in a new tab) (Feb 2023)
 - Navigating the Grey Area: Expressions of Overconfidence and Uncertainty in Language Models (opens in a new tab) (Feb 2023)
 - TabGenie: A Toolkit for Table-to-Text Generation (opens in a new tab) (Feb 2023)
 - SGL-PT: A Strong Graph Learner with Graph Prompt Tuning (opens in a new tab) (Feb 2023)
 - Few-Shot Table-to-Text Generation with Prompt-based Adapter (opens in a new tab) (Feb 2023)
 - Language Models Are Few-shot Learners for Prognostic Prediction (opens in a new tab) (Feb 2023)
 - STA: Self-controlled Text Augmentation for Improving Text Classifications (opens in a new tab) (Feb 2023)
 - Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback (opens in a new tab) (Feb 2023)
 - How Generative AI models such as ChatGPT can be (Mis)Used in SPC Practice, Education, and Research? An Exploratory Study (opens in a new tab) (Feb 2023)
 - Grimm in Wonderland: Prompt Engineering with Midjourney to Illustrate Fairytales (opens in a new tab) (Feb 2023)
 - LabelPrompt: Effective Prompt-based Learning for Relation Classification (opens in a new tab) (Feb 2023)
 - Language Model Crossover: Variation through Few-Shot Prompting (opens in a new tab) (Feb 2023)
 - Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition (opens in a new tab) (Feb 2023)
 - The Capacity for Moral Self-Correction in Large Language Models (opens in a new tab) (Feb 2023)
 - Prompting for Multimodal Hateful Meme Classification (opens in a new tab) (Feb 2023)
 - PLACES: Prompting Language Models for Social Conversation Synthesis (opens in a new tab) (Feb 2023)
 - Toolformer: Language Models Can Teach Themselves to Use Tools (opens in a new tab) (Feb 2023)
 - Commonsense-Aware Prompting for Controllable Empathetic Dialogue Generation (opens in a new tab) (Feb 2023)
 - Crawling the Internal Knowledge-Base of Language Models (opens in a new tab) (Jan 2023)
 - Legal Prompt Engineering for Multilingual Legal Judgement Prediction (opens in a new tab) (Dec 2022)
 - Investigating Prompt Engineering in Diffusion Models (opens in a new tab) (Nov 2022)
 - Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering (opens in a new tab) (Sep 2022)
 - Conversing with Copilot: Exploring Prompt Engineering for Solving CS1 Problems Using Natural Language (opens in a new tab) (Oct 2022)
 - Piloting Copilot and Codex: Hot Temperature, Cold Prompts, or Black Magic? (opens in a new tab) (Oct 2022)
 - Plot Writing From Scratch Pre-Trained Language Models (opens in a new tab) (July 2022)
 - Survey of Hallucination in Natural Language Generation (opens in a new tab) (Feb 2022)