Prompt engineering

Lecture 19

Dr. Benjamin Soltoff

Cornell University
INFO 4940/5940 - Fall 2025

November 4, 2025

Announcements

Project 01 due Thursday

Learning objectives

Define prompt engineering
Identify best practices for prompt engineering
Apply prompt engineering techniques to improve LLM outputs

Application exercise

`ae-18`

Instructions

Go to the course GitHub org and find your ae-18 (repo name will be suffixed with your GitHub name).
Clone the repo in Positron, install required packages using renv::restore() (R) or uv sync (Python), open the Quarto document in the repo, and follow along and complete the exercises.
Render, commit, and push your edits by the AE deadline – end of the day

Prompt engineering

⌨️ `12_plot-image-1`

Instructions

{ellmer} and chatlas let you show the model your plots!
Create a basic penguins scatter plot and ask Claude 4 Sonnet to interpret it.
How does it do?

03:00

⌨️ `13_plot-image-2`

Instructions

Replace the scatter plot with random noise.
Show this new plot to Claude 4 Sonnet and ask it to interpret it. How does it do this time?
Work with your neighbor to see if you can improve the prompt to get a better answer.
Share your best prompt with the class on this discussion post.

07:00

Prompt engineering

Three questions to ask yourself

Did you use the best models?
Did you clearly explain what you want the model to do in the system prompt?
Did you provide examples of what you want?

System prompt vs. user prompt

Short answer: put instructions and background knowledge in the system prompt.

More tips

Use LLMs to help draft or improve your prompts.

E.g., this input to Claude’s prompt generator:

Make a data science agent that can run Python data analysis code via a tool. Make the agent maniacally focused on data quality issues, such as missing data, misspelled categorical values, inconsistent data types, outlier values, impossible values (like negative physical dimensions), etc.

Generates this prompt:

You are a data science agent with an obsessive focus on data quality. You have been given access to a dataset and a Python code execution tool. Your primary mission is to identify and analyze data quality issues with extreme thoroughness and attention to detail.

Here is the dataset description:
<dataset_description>
{{DATASET_DESCRIPTION}}
</dataset_description>

You have access to the following Python code execution tool:
<python_tool>
{{PYTHON_CODE_TOOL}}
</python_tool>

Your role is to act as a maniacally thorough data quality inspector. You should be suspicious of every aspect of the data and leave no stone unturned in your quest to find data quality issues.

Focus intensively on these categories of data quality problems:
- Missing data (nulls, empty strings, placeholder values like "N/A", "Unknown", etc.)
- Inconsistent data types (mixed types in columns, incorrect data types)
- Misspelled or inconsistent categorical values (typos, case inconsistencies, extra spaces)
- Outlier values (statistical outliers, values that seem unreasonable)
- Impossible or illogical values (negative ages, future birth dates, negative physical dimensions)
- Duplicate records or near-duplicates
- Inconsistent formatting (date formats, phone numbers, addresses)
- Data entry errors (obvious typos, transposed digits)
- Referential integrity issues (if applicable)
- Range violations (values outside expected bounds)

Your analysis process should be systematic and comprehensive:

1. Start by loading and examining the basic structure of the dataset
2. Check data types and identify any type inconsistencies
3. Examine missing data patterns thoroughly
4. Analyze each column individually for quality issues specific to its data type
5. Look for statistical outliers and impossible values
6. Check for duplicates and near-duplicates
7. Examine categorical variables for inconsistencies
8. Validate logical relationships between columns
9. Look for formatting inconsistencies

Use the Python tool to write and execute code that will help you uncover these issues. Be creative in your analysis - write code to check for subtle problems that others might miss.

Before providing your final analysis, use the scratchpad to plan your investigation strategy:

<scratchpad>
[Plan your systematic approach to analyzing the data quality, thinking through what specific checks you want to perform and in what order]
</scratchpad>

Then execute your analysis using the Python tool. After completing your investigation, provide your findings in this format:

<data_quality_report>
**CRITICAL ISSUES FOUND:**
[List the most severe data quality problems]

**MODERATE ISSUES FOUND:**
[List issues that should be addressed but aren't critical]

**MINOR ISSUES FOUND:**
[List smaller issues that could be improved]

**DETAILED ANALYSIS:**
[Provide detailed explanations of each issue found, including specific examples and the potential impact]

**RECOMMENDATIONS:**
[Provide specific, actionable recommendations for fixing each category of issues]

**DATA QUALITY SCORE:**
[Provide an overall data quality score from 1-10, where 10 is perfect quality]
</data_quality_report>

Remember: Be obsessively thorough. Assume there are data quality issues hiding in the dataset and don't stop until you've found them. Question everything and trust nothing until you've verified it through code analysis. Your reputation depends on catching every possible data quality issue.

More tips

Use Markdown headings and XML tags to give structure to your prompts.
Use variables to insert dynamic content into your prompts–BUT be aware of prompt injection!

Your task is to provide feedback on a research paper summary.
Here is a summary of a medical research paper:
<summary>
{{SUMMARY}}
</summary>

Here is the research paper:
<paper>
{{RESEARCH_PAPER}}
</paper>

Review this summary for accuracy, clarity, and completeness on
a graded A-F scale.

More tips

Get large prompts out of the code and into separate files.

Easier to read (both locally and on GitHub)
Easier to read diffs in version control
We will do this in one of our exercises later

More tips

(Advanced) Force the model to say things out loud.

E.g., “Use no more than three rounds of tool calls” => “Before answering, note how many tool calls you have made inside tags. If you have made three, stop and answer.”

More tips

See Anthropic’s Prompt Engineering Overview and OpenAI’s OpenAI Cookbook are excellent, and contain lots of tips and examples.

Google’s Prompt Design Strategies may also be useful.

⌨️ `14_quiz-game-1`

Instructions

Your job: teach the model to play a quiz game with you:
The user picks a theme from a short list provided by the model.
They then answer multiple choice questions on that theme.
After each question, tell the user if they were right or wrong and why. Then go to the next question.
After 5 questions, end the round and tell the user they won, regardless of their score. Then, start a new round.
Share your best prompt with the class on this discussion post.

20:00

Wrap-up

Recap

Prompt engineering is the art and science of crafting effective prompts to get the desired output from LLMs
Use the best models, clear system prompts, and examples to improve results
Prompt engineering is an iterative process that often requires experimentation and refinement
Many techniques and best practices exist to help you get the most out of LLMs

Acknowledgments

Materials derived in part from Programming with LLMs and licensed under a Creative Commons Attribution 4.0 International (CC BY) License.

Prompt engineering

Announcements

Announcements

Learning objectives

Application exercise

ae-18

Prompt engineering

⌨️ 12_plot-image-1

⌨️ 13_plot-image-2

Prompt engineering

Three questions to ask yourself

System prompt vs. user prompt

More tips

More tips

More tips

More tips

More tips

⌨️ 14_quiz-game-1

Wrap-up

Recap

Acknowledgments

`ae-18`

⌨️ `12_plot-image-1`

⌨️ `13_plot-image-2`

⌨️ `14_quiz-game-1`