cohort-analysis
Scannednpx machina-cli add skill phuryn/pm-skills/cohort-analysis --openclawCohort Analysis & Retention Explorer
Purpose
Analyze user engagement and retention patterns by cohort to identify trends in user behavior, feature adoption, and long-term engagement. Combine quantitative insights with qualitative research recommendations.
How It Works
Step 1: Read and Validate Your Data
- Accept CSV, Excel, or JSON data files with user cohort information
- Verify data structure: cohort identifier, time periods, engagement metrics
- Check for missing values and data quality issues
- Summarize key statistics (cohort sizes, date ranges, metrics available)
Step 2: Generate Quantitative Analysis
- Calculate cohort retention rates and engagement trends
- Identify retention curves, drop-off patterns, and anomalies
- Compute feature adoption rates across cohorts
- Calculate month-over-month or period-over-period changes
- Generate Python analysis scripts using pandas and numpy if requested
Step 3: Create Visualizations
- Generate retention heatmaps (cohorts vs. time periods)
- Create line charts showing cohort progression
- Build comparison charts for feature adoption
- Visualize drop-off points and engagement trends
- Output as interactive charts or static images
Step 4: Identify Insights & Patterns
- Spot one or more significant patterns:
- Early churn in specific cohorts
- Late-stage engagement changes
- Feature adoption clusters
- Seasonal or temporal trends
- Highlight surprising findings and deviations
- Compare cohort performance to establish baselines
Step 5: Suggest Follow-Up Research
- Recommend qualitative research methods:
- Targeted user interviews with churning users
- Feature usage surveys with engaged cohorts
- Session replays of key interaction patterns
- Win/loss analysis for high vs. low retention cohorts
- Design follow-up quantitative studies
- Suggest A/B tests or feature experiments
Usage Examples
Example 1: Upload CSV Data
Upload cohort_engagement.csv with columns: cohort_month, weeks_active,
user_id, feature_x_usage, engagement_score
Request: "Analyze retention patterns and identify why Q4 2025 cohorts
underperform compared to Q3"
Example 2: Describe Data Format
"I have monthly user cohorts from Jan-Dec 2025. Each row shows:
cohort date, user ID, purchase frequency, and support tickets.
Analyze which cohorts show best long-term retention."
Example 3: Feature Adoption Analysis
Upload feature_usage.xlsx with cohort adoption data.
Request: "Compare adoption curves for our new feature across cohorts.
Which cohorts adopted fastest? Any patterns?"
Key Capabilities
- Data Reading: Import CSV, Excel, JSON, SQL query results
- Retention Analysis: Calculate and visualize retention rates over time
- Cohort Comparison: Compare metrics across cohort groups
- Anomaly Detection: Flag unusual patterns or drop-offs
- Python Scripts: Generate reusable analysis code for ongoing analysis
- Visualizations: Create heatmaps, charts, and interactive dashboards
- Research Design: Suggest targeted follow-up studies and interview approaches
- Statistical Summary: Provide quantitative metrics and correlation analysis
Tips for Best Results
- Include time dimension: Provide data across multiple time periods
- Define cohort clearly: Make cohort grouping explicit (signup month, feature launch date, etc.)
- Provide context: Explain product changes, launches, or events during the period
- Multiple metrics: Include retention, engagement, feature usage, revenue, etc.
- Sufficient data: At least 3-4 cohorts for meaningful pattern identification
- Request specific output: Ask for visualizations, Python scripts, or research recommendations
Output Format
You'll receive:
- Data Summary: Cohort overview and data quality assessment
- Quantitative Findings: Key metrics, retention rates, and trend analysis
- Visualizations: Charts showing retention curves, adoption patterns
- Pattern Identification: 2-3 significant insights from the data
- Research Recommendations: Specific qualitative and quantitative follow-ups
- Analysis Scripts (if requested): Python code for reproducible analysis
- Next Steps: Prioritized actions based on findings
Further Reading
Source
git clone https://github.com/phuryn/pm-skills/blob/main/pm-data-analytics/skills/cohort-analysis/SKILL.mdView on GitHub Overview
This skill analyzes user engagement by cohort to uncover retention trends, feature adoption, and segment-level insights. It combines quantitative metrics with qualitative recommendations to guide product optimization and strategic decisions.
How This Skill Works
It ingests cohort data from CSV, Excel, or JSON, validates structure and quality, then computes retention rates and feature adoption metrics. It generates visualizations such as retention heatmaps and line charts, highlights patterns and anomalies, and provides actionable insights along with optional reusable Python scripts.
When to Use It
- Analyze retention patterns by cohort and identify long-term engagement shifts
- Study how different cohorts adopt new features over time
- Investigate churn patterns across cohorts and time periods
- Spot engagement trends and sudden shifts tied to product changes
- Compare cohort performance against baselines to measure impact of changes
Quick Start
- Step 1: Upload your cohort data (CSV, Excel, or JSON) and ensure cohort_id, time_period, and engagement metrics are present
- Step 2: Run the quantitative analysis to compute retention and adoption metrics and generate visuals
- Step 3: Review insights, identify patterns, and design follow-up research or experiments
Best Practices
- Include a clearly defined time dimension and multiple time periods
- Explicitly define cohort criteria (signup month, launch date, etc.)
- Provide context on product changes, launches, or events during the period
- Track multiple metrics (retention, engagement, feature usage, revenue)
- Use at least 3-4 cohorts for reliable pattern detection
Example Use Cases
- Example 1: Upload cohort_engagement.csv and analyze retention vs. time periods to identify underperforming cohorts
- Example 2: Describe data format and determine which cohorts show the best long-term retention
- Example 3: Compare feature adoption curves across cohorts to identify fastest adopters and patterns
- Example 4: Visualize cohort heatmaps to spot early churn clusters and late-stage engagement changes
- Example 5: Propose follow-up qualitative interviews with churning cohorts based on insights