code-review-excellence
Install this skill
npx skills add wshobson/agentsWorks across Claude Code, Cursor, Codex, Copilot & Antigravity
The code-review-excellence skill shifts the focus of pull request evaluations from surface-level gatekeeping to a collaborative, educational process. Instead of treating reviews as mere error-spotting, this skill guides you through a structured, multi-phase assessment. You begin by gathering context, move into high-level architectural evaluation, and conclude with specific, actionable feedback that avoids subjective nitpicking. By employing questioning techniques rather than dictatorial commands, this skill helps improve overall code maintainability, security, and performance. It emphasizes identifying logic gaps and architectural flaws while delegating trivial formatting tasks to automated tools. This methodical approach fosters a culture of technical growth, ensuring that every review cycle serves as an opportunity for team knowledge sharing and long-term codebase health rather than simply a barrier to merging new features.
When to Use This Skill
- β’Onboarding junior developers through detailed, educational feedback cycles
- β’Standardizing PR expectations across distributed engineering squads
- β’Performing deep-dive architectural assessments on large feature branches
- β’Reducing the feedback loop duration by prioritizing high-impact logic flaws
How to Invoke This Skill
Example prompts that trigger this skill in Claude Code, Cursor, or Antigravity:
- βReview this pull request and provide constructive feedback
- βHelp me audit this code for security vulnerabilities
- βCritique my implementation approach for architectural consistency
- βDraft a set of review comments that prioritize learning
- βHow can I suggest improvements without sounding judgmental?
- βCheck this branch against our performance and security standards
Pro Tips
- π‘**Prioritize Feedback**: Focus on critical issues (bugs, security, architecture) first, addressing minor suggestions or style preferences afterwards or via automated tools.
- π‘**Frame Feedback as Questions**: Instead of declarative statements like 'This is wrong,' ask 'Have you considered X?' or 'What happens if Y occurs?' to encourage critical thinking and learning.
- π‘**Balance Praise and Critique**: Always acknowledge good solutions, well-written sections, or effective problem-solving to maintain morale and reinforce positive coding habits.
What this skill does
- β’Conducts structured multi-phase PR analysis covering architecture, security, and performance
- β’Translates subjective feedback into objective, actionable questions
- β’Categorizes issues into critical blockers versus optional, non-blocking suggestions
- β’Enforces separation between automated linting concerns and manual logic verification
- β’Provides templates for evaluating security vulnerabilities and test coverage
When not to use it
- βWhen immediate, high-priority hotfixes require bypassing standard peer review
- βFor projects lacking automated linters or CI/CD pipelines where manual formatting checks are still necessary
Example workflow
- Review the linked issue and PR description for context
- Assess if the PR size is manageable; request splitting if it exceeds 400 lines
- Perform a high-level scan of the architectural design and file organization
- Analyze code logic line-by-line for edge cases and security risks
- Generate a summary response using collaborative, inquiry-based language
- Apply appropriate status labels like Approved or Request Changes
Prerequisites
- βActive pull request or branch to review
- βExisting automated linter (Prettier/Black) in the repository
- βDefined team coding standards
Pitfalls & limitations
- !Over-commenting on trivial formatting that should be handled by automation
- !Focusing on developer style preferences rather than functional correctness
- !Providing overly vague feedback like 'this looks wrong' without justification
FAQ
How it compares
Generic prompts often focus on basic error hunting; this skill structures the review process into a professional, growth-oriented workflow that treats the reviewer as a mentor.
π Full skill instructions β original source: wshobson/agents
Transform code reviews from gatekeeping to knowledge sharing through constructive feedback, systematic analysis, and collaborative improvement.
## When to Use This Skill
- Reviewing pull requests and code changes
- Establishing code review standards for teams
- Mentoring junior developers through reviews
- Conducting architecture reviews
- Creating review checklists and guidelines
- Improving team collaboration
- Reducing code review cycle time
- Maintaining code quality standards
## Core Principles
### 1. The Review Mindset
**Goals of Code Review:**
- Catch bugs and edge cases
- Ensure code maintainability
- Share knowledge across team
- Enforce coding standards
- Improve design and architecture
- Build team culture
**Not the Goals:**
- Show off knowledge
- Nitpick formatting (use linters)
- Block progress unnecessarily
- Rewrite to your preference
### 2. Effective Feedback
**Good Feedback is:**
- Specific and actionable
- Educational, not judgmental
- Focused on the code, not the person
- Balanced (praise good work too)
- Prioritized (critical vs nice-to-have)
β Bad: "This is wrong."
β
Good: "This could cause a race condition when multiple users
access simultaneously. Consider using a mutex here."
β Bad: "Why didn't you use X pattern?"
β
Good: "Have you considered the Repository pattern? It would
make this easier to test. Here's an example: [link]"
β Bad: "Rename this variable."
β
Good: "[nit] Consider userCount instead of uc for
clarity. Not blocking if you prefer to keep it."### 3. Review Scope
**What to Review:**
- Logic correctness and edge cases
- Security vulnerabilities
- Performance implications
- Test coverage and quality
- Error handling
- Documentation and comments
- API design and naming
- Architectural fit
**What Not to Review Manually:**
- Code formatting (use Prettier, Black, etc.)
- Import organization
- Linting violations
- Simple typos
## Review Process
### Phase 1: Context Gathering (2-3 minutes)
Before diving into code, understand:
1. Read PR description and linked issue
2. Check PR size (>400 lines? Ask to split)
3. Review CI/CD status (tests passing?)
4. Understand the business requirement
5. Note any relevant architectural decisions### Phase 2: High-Level Review (5-10 minutes)
1. **Architecture & Design**
- Does the solution fit the problem?
- Are there simpler approaches?
- Is it consistent with existing patterns?
- Will it scale?
2. **File Organization**
- Are new files in the right places?
- Is code grouped logically?
- Are there duplicate files?
3. **Testing Strategy**
- Are there tests?
- Do tests cover edge cases?
- Are tests readable?### Phase 3: Line-by-Line Review (10-20 minutes)
For each file:
1. **Logic & Correctness**
- Edge cases handled?
- Off-by-one errors?
- Null/undefined checks?
- Race conditions?
2. **Security**
- Input validation?
- SQL injection risks?
- XSS vulnerabilities?
- Sensitive data exposure?
3. **Performance**
- N+1 queries?
- Unnecessary loops?
- Memory leaks?
- Blocking operations?
4. **Maintainability**
- Clear variable names?
- Functions doing one thing?
- Complex code commented?
- Magic numbers extracted?### Phase 4: Summary & Decision (2-3 minutes)
1. Summarize key concerns
2. Highlight what you liked
3. Make clear decision:
- β
Approve
- π¬ Comment (minor suggestions)
- π Request Changes (must address)
4. Offer to pair if complex## Review Techniques
### Technique 1: The Checklist Method
## Security Checklist
- [ ] User input validated and sanitized
- [ ] SQL queries use parameterization
- [ ] Authentication/authorization checked
- [ ] Secrets not hardcoded
- [ ] Error messages don't leak info
## Performance Checklist
- [ ] No N+1 queries
- [ ] Database queries indexed
- [ ] Large lists paginated
- [ ] Expensive operations cached
- [ ] No blocking I/O in hot paths
## Testing Checklist
- [ ] Happy path tested
- [ ] Edge cases covered
- [ ] Error cases tested
- [ ] Test names are descriptive
- [ ] Tests are deterministic### Technique 2: The Question Approach
Instead of stating problems, ask questions to encourage thinking:
β "This will fail if the list is empty."
β
"What happens if items is an empty array?"
β "You need error handling here."
β
"How should this behave if the API call fails?"
β "This is inefficient."
β
"I see this loops through all users. Have we considered
the performance impact with 100k users?"### Technique 3: Suggest, Don't Command
## Use Collaborative Language
β "You must change this to use async/await"
β
"Suggestion: async/await might make this more readable:
typescript
async function fetchUser(id: string) {
const user = await db.query('SELECT * FROM users WHERE id = ?', id);
return user;
}
What do you think?"
β "Extract this into a function"
β
"This logic appears in 3 places. Would it make sense to
extract it into a shared utility function?"
### Technique 4: Differentiate Severity
Use labels to indicate priority:
π΄ [blocking] - Must fix before merge
π‘ [important] - Should fix, discuss if disagree
π’ [nit] - Nice to have, not blocking
π‘ [suggestion] - Alternative approach to consider
π [learning] - Educational comment, no action needed
π [praise] - Good work, keep it up!
Example:
"π΄ [blocking] This SQL query is vulnerable to injection.
Please use parameterized queries."
"π’ [nit] Consider renaming data to userData for clarity."
"π [praise] Excellent test coverage! This will catch edge cases."## Language-Specific Patterns
### Python Code Review
# Check for Python-specific issues
# β Mutable default arguments
def add_item(item, items=[]): # Bug! Shared across calls
items.append(item)
return items
# β
Use None as default
def add_item(item, items=None):
if items is None:
items = []
items.append(item)
return items
# β Catching too broad
try:
result = risky_operation()
except: # Catches everything, even KeyboardInterrupt!
pass
# β
Catch specific exceptions
try:
result = risky_operation()
except ValueError as e:
logger.error(f"Invalid value: {e}")
raise
# β Using mutable class attributes
class User:
permissions = [] # Shared across all instances!
# β
Initialize in __init__
class User:
def __init__(self):
self.permissions = []### TypeScript/JavaScript Code Review
// Check for TypeScript-specific issues
// β Using any defeats type safety
function processData(data: any) { // Avoid any
return data.value;
}
// β
Use proper types
interface DataPayload {
value: string;
}
function processData(data: DataPayload) {
return data.value;
}
// β Not handling async errors
async function fetchUser(id: string) {
const response = await fetch(/api/users/${id});
return response.json(); // What if network fails?
}
// β
Handle errors properly
async function fetchUser(id: string): Promise<User> {
try {
const response = await fetch(/api/users/${id});
if (!response.ok) {
throw new Error(HTTP ${response.status});
}
return await response.json();
} catch (error) {
console.error('Failed to fetch user:', error);
throw error;
}
}
// β Mutation of props
function UserProfile({ user }: Props) {
user.lastViewed = new Date(); // Mutating prop!
return <div>{user.name}</div>;
}
// β
Don't mutate props
function UserProfile({ user, onView }: Props) {
useEffect(() => {
onView(user.id); // Notify parent to update
}, [user.id]);
return <div>{user.name}</div>;
}## Advanced Review Patterns
### Pattern 1: Architectural Review
When reviewing significant changes:
1. **Design Document First**
- For large features, request design doc before code
- Review design with team before implementation
- Agree on approach to avoid rework
2. **Review in Stages**
- First PR: Core abstractions and interfaces
- Second PR: Implementation
- Third PR: Integration and tests
- Easier to review, faster to iterate
3. **Consider Alternatives**
- "Have we considered using [pattern/library]?"
- "What's the tradeoff vs. the simpler approach?"
- "How will this evolve as requirements change?"### Pattern 2: Test Quality Review
// β Poor test: Implementation detail testing
test('increments counter variable', () => {
const component = render(<Counter />);
const button = component.getByRole('button');
fireEvent.click(button);
expect(component.state.counter).toBe(1); // Testing internal state
});
// β
Good test: Behavior testing
test('displays incremented count when clicked', () => {
render(<Counter />);
const button = screen.getByRole('button', { name: /increment/i });
fireEvent.click(button);
expect(screen.getByText('Count: 1')).toBeInTheDocument();
});
// Review questions for tests:
// - Do tests describe behavior, not implementation?
// - Are test names clear and descriptive?
// - Do tests cover edge cases?
// - Are tests independent (no shared state)?
// - Can tests run in any order?### Pattern 3: Security Review
## Security Review Checklist
### Authentication & Authorization
- [ ] Is authentication required where needed?
- [ ] Are authorization checks before every action?
- [ ] Is JWT validation proper (signature, expiry)?
- [ ] Are API keys/secrets properly secured?
### Input Validation
- [ ] All user inputs validated?
- [ ] File uploads restricted (size, type)?
- [ ] SQL queries parameterized?
- [ ] XSS protection (escape output)?
### Data Protection
- [ ] Passwords hashed (bcrypt/argon2)?
- [ ] Sensitive data encrypted at rest?
- [ ] HTTPS enforced for sensitive data?
- [ ] PII handled according to regulations?
### Common Vulnerabilities
- [ ] No eval() or similar dynamic execution?
- [ ] No hardcoded secrets?
- [ ] CSRF protection for state-changing operations?
- [ ] Rate limiting on public endpoints?## Giving Difficult Feedback
### Pattern: The Sandwich Method (Modified)
Traditional: Praise + Criticism + Praise (feels fake)
Better: Context + Specific Issue + Helpful Solution
Example:
"I noticed the payment processing logic is inline in the
controller. This makes it harder to test and reuse.
[Specific Issue]
The calculateTotal() function mixes tax calculation,
discount logic, and database queries, making it difficult
to unit test and reason about.
[Helpful Solution]
Could we extract this into a PaymentService class? That
would make it testable and reusable. I can pair with you
on this if helpful."### Handling Disagreements
When author disagrees with your feedback:
1. **Seek to Understand**
"Help me understand your approach. What led you to
choose this pattern?"
2. **Acknowledge Valid Points**
"That's a good point about X. I hadn't considered that."
3. **Provide Data**
"I'm concerned about performance. Can we add a benchmark
to validate the approach?"
4. **Escalate if Needed**
"Let's get [architect/senior dev] to weigh in on this."
5. **Know When to Let Go**
If it's working and not a critical issue, approve it.
Perfection is the enemy of progress.## Best Practices
1. **Review Promptly**: Within 24 hours, ideally same day
2. **Limit PR Size**: 200-400 lines max for effective review
3. **Review in Time Blocks**: 60 minutes max, take breaks
4. **Use Review Tools**: GitHub, GitLab, or dedicated tools
5. **Automate What You Can**: Linters, formatters, security scans
6. **Build Rapport**: Emoji, praise, and empathy matter
7. **Be Available**: Offer to pair on complex issues
8. **Learn from Others**: Review others' review comments
## Common Pitfalls
- **Perfectionism**: Blocking PRs for minor style preferences
- **Scope Creep**: "While you're at it, can you also..."
- **Inconsistency**: Different standards for different people
- **Delayed Reviews**: Letting PRs sit for days
- **Ghosting**: Requesting changes then disappearing
- **Rubber Stamping**: Approving without actually reviewing
- **Bike Shedding**: Debating trivial details extensively
## Templates
### PR Review Comment Template
## Summary
[Brief overview of what was reviewed]
## Strengths
- [What was done well]
- [Good patterns or approaches]
## Required Changes
π΄ [Blocking issue 1]
π΄ [Blocking issue 2]
## Suggestions
π‘ [Improvement 1]
π‘ [Improvement 2]
## Questions
β [Clarification needed on X]
β [Alternative approach consideration]
## Verdict
β
Approve after addressing required changes## Resources
- **references/code-review-best-practices.md**: Comprehensive review guidelines
- **references/common-bugs-checklist.md**: Language-specific bugs to watch for
- **references/security-review-guide.md**: Security-focused review checklist
- **assets/pr-review-template.md**: Standard review comment template
- **assets/review-checklist.md**: Quick reference checklist
- **scripts/pr-analyzer.py**: Analyze PR complexity and suggest reviewers
How to Use This Skill Unit
Option A: Project-Specific (Recommended)
- Click "Download" above
- In your project, create the directory:
.agent/skills/code-review-excellence/ - Save the file as
SKILL.md - The agent will automatically discover the skill based on its description.
Option B: Global Installation (All Agents)
Save the file to these locations to make it available across all projects:
- Claude Code:
~/.claude/skills/wshobson/agents/code-review-excellence/SKILL.md - Cursor:
~/.cursor/skills/wshobson/agents/code-review-excellence/SKILL.md - Antigravity:
~/.gemini/antigravity/skills/wshobson/agents/code-review-excellence/SKILL.md
π Install with CLI:npx skills add wshobson/agents