Which classifiers are evaluated?

The tool automatically compares multiple scikit-learn classifiers; you can extend the candidate list via configuration.

What metrics are reported?

Accuracy, precision, recall, F1, ROC-AUC, plus cross-validation scores and a confusion matrix.

How do I export the trained model?

Use the CLI's --output option to save the best-performing model to a file like model.pkl.

classification-helper

Scanned

npx machina-cli add skill dkyazzentwatwa/chatgpt-skills/classification-helper --openclaw

Files (1)

SKILL.md

869 B

Classification Helper

Train and evaluate classification models with automatic model selection.

Features

Auto Model Selection: Compare multiple classifiers
Hyperparameter Tuning: Grid/random search
Evaluation Metrics: Accuracy, precision, recall, F1, ROC-AUC
Cross-Validation: K-fold validation
Confusion Matrix: Detailed error analysis
Feature Importance: Top predictive features
Model Export: Save trained models

CLI Usage

python classification_helper.py --data train.csv --target class --test test.csv --output model.pkl

Dependencies

scikit-learn>=1.3.0
pandas>=2.0.0
numpy>=1.24.0
matplotlib>=3.7.0
seaborn>=0.12.0

Source

git clone https://github.com/dkyazzentwatwa/chatgpt-skills/blob/main/classification-helper/SKILL.mdView on GitHub

Overview

Classification Helper automates training and evaluation of tabular classifiers with automatic model selection. It compares multiple models, tunes hyperparameters, and reports comprehensive metrics such as accuracy, precision, recall, F1, and ROC-AUC, plus cross-validation results, confusion matrices, feature importance, and model export.

How This Skill Works

The tool iterates over candidate classifiers, performs grid or random search for hyperparameters, and evaluates each model using cross-validated metrics. It surfaces feature importance and confusion matrices for error analysis and can export the best-performing model for deployment.

When to Use It

You want to compare multiple classifiers quickly with automated model selection.
You need hyperparameter tuning via grid or random search.
You require robust evaluation metrics (accuracy, precision, recall, F1, ROC-AUC) and cross-validation.
You want detailed error analysis with a confusion matrix and feature importance.
You plan to export a trained model for deployment.

Quick Start

Step 1: Prepare train.csv with features and a target column, and run: python classification_helper.py --data train.csv --target class --test test.csv --output model.pkl
Step 2: Review the generated metrics, cross-validation results, and the confusion matrix.
Step 3: Save or deploy the best model from model.pkl and use it for predictions.

Best Practices

Start with a diverse set of classifiers (logistic regression, random forest, gradient boosting) to establish baselines.
Limit hyperparameter search space and use cross-validation to prevent overfitting.
Preprocess data (handle missing values, scaling) before modeling.
Validate top models on a held-out test set and compare metrics.
Use feature importance to interpret model decisions and guide feature engineering.

Example Use Cases

Email spam detection on tabular features.
Customer churn prediction from user activity data.
Credit risk scoring using borrower attributes.
Medical diagnostic support with structured health data.
Fraud detection on transaction records.

Frequently Asked Questions

Add this skill to your agents