Anthropic Interview Questions

Reliable AI systems, safety evaluation, agent behavior, and enterprise LLM workflows.

4 questions

How would you make an AI assistant safer for enterprise use?hard

Answer

Use permissioning, tool constraints, evals, audit logs, and clear refusal behavior.

Explanation

A strong answer discusses data boundaries, least-privilege tools, prompt-injection defenses, human escalation, and monitoring unsafe outputs.

Follow-upHow do you test prompt injection?

Answer

Clear task decomposition, constrained tools, state tracking, retries, and evaluation.

Explanation

Reliability improves when the agent has explicit plans, limited actions, validation checks, recovery paths, and observability for each tool call.

Follow-upWhen should an agent ask for help?

Answer

Create tasks that require locating, combining, and reasoning over distant information.

Explanation

Needle tests alone are not enough. Include summarization, contradiction detection, multi-hop retrieval, and realistic document workflows.

Follow-upWhat failure modes are common?

Answer

Minimize data, redact where possible, enforce access control, and log safely.

Explanation

Discuss PII handling, retention policy, tenant isolation, encryption, and review workflows for sensitive outputs.

Follow-upWhat should never be logged?