Politeness in AI Prompts Affects Accuracy, Study Finds

Research reveals that prompt tone significantly influences LLM accuracy. Polite prompts may boost performance while impolite ones degrade it.

Polite language in prompts could change how large language models perform. A new analysis suggests that the tone of a user's request measurably affects the accuracy of the model's output.

How Tone Changes Model Output

Researchers tested several LLMs including GPT-4 and Claude 3. They used questions phrased with polite language, neutral wording and impolite demands. The results showed a clear pattern. Polite prompts often produced more correct answers. Rude or demanding phrasing led to higher error rates.

The effect was not uniform across all models. Some models showed a stronger sensitivity to tone. Others shifted their response style without large accuracy changes. The study controlled for question difficulty and domain.

One possible explanation involves the training data. Models learn from human interactions that often reward politeness. The data reflects social norms where polite requests receive more careful responses.

Why This Matters

Millions of people now use LLMs for work, research and daily tasks. Many users may not realize that their phrasing style can skew results. This is especially important in professional settings where accuracy matters. A bluntly worded query could lead to worse outcomes.

Developers who build applications on top of LLMs also need to account for this effect. A frontend that normalizes user prompts could improve consistency. The finding also raises equity concerns. Users from different cultural backgrounds may express politeness differently, potentially leading to biased AI performance.

Implications for Prompt Design

Prompt engineering guides often focus on structure and context. This research adds a new layer: tone. The polite penalty for mistreatment is small but measurable. Users who want the best results should consider adding polite phrasing such as please or thank you.

Future models could be trained to become more robust to tonal variations. For now, the advice is simple. A little courtesy may not just be good manners. It could also produce better answers.

Study Finds Politeness in AI Prompts Can Impact Model Accuracy

How Tone Changes Model Output

Why This Matters

Implications for Prompt Design

Related Articles

A simple prompt tweak can dramatically improve AI image quality

No-Code AI: Training LLaMA 2 Chatbots Becomes Accessible to Everyone

Intel Unveils Massive Memory AI Chip for Data Centers