🎯

datanalysis-credit-risk

🎯Skill

from github/awesome-copilot

VibeIndex|
What it does
|

Credit risk data analysis skill from Awesome GitHub Copilot, a community-created collection of custom agents and instructions for enhancing GitHub Copilot across different domains and use cases.

Overview

A GitHub Copilot skill providing a complete data cleaning and variable screening pipeline for credit risk pre-loan modeling. It executes an 11-step process from raw data loading through organization-level analysis, abnormal period filtering, missing rate calculation, IV/PSI-based variable selection, null importance denoising, high-correlation removal, and cleaning report generation — all without modifying original data.

Key Features

  • 11-step automated pipeline: Executes data loading, organization sample analysis, OOS data separation, abnormal month filtering, missing rate calculation, high-missing feature removal, low-IV filtering, high-PSI removal, null importance denoising, high-correlation removal, and Excel report export in sequence.
  • Organization-level analysis: Computes per-organization sample counts and bad sample rates, supports out-of-sample (OOS) organization separation, and applies per-organization thresholds for IV distribution, PSI stability, and value ratio distribution.
  • Statistical variable screening: Removes features with missing rates above threshold (default 0.6), overall IV below 0.1, unstable PSI exceeding threshold across months/organizations, and high correlations based on original gain — each step configurable via parameters.
  • Null importance denoising: Uses label permutation with configurable tree count (n_estimators=100), depth (max_depth=5), and gain threshold (gain_threshold=50) to identify and remove noise features that don't contribute meaningful signal.
  • Non-destructive with full reporting: Executes each step independently without deleting original data, and exports a comprehensive Excel cleaning report containing details and statistics from all 11 processing steps.

Who is this for?

This skill is designed for data scientists and risk analysts working in credit risk modeling who need to preprocess raw lending data before building predictive models. It is especially valuable for teams handling multi-organization credit portfolios, where automated variable screening across organizations, time periods, and statistical thresholds is essential for building robust, stable credit scoring models.

📦

Same repository

github/awesome-copilot(314 items)

datanalysis-credit-risk

Installation

Vibe Index InstallInstalls to .claude/skills/ - auto-recognized by Claude Code
npx vibeindex add github/awesome-copilot --skill datanalysis-credit-risk
skills.sh Install⚠ Installs to .agents/skills/ - may not be auto-recognized by Claude Code
npx skills add github/awesome-copilot --skill datanalysis-credit-risk
Manual InstallCopy SKILL.md content and save to the path below
~/.claude/skills/datanalysis-credit-risk/SKILL.md

SKILL.md

6,757Installs
-
AddedMar 2, 2026

More from this repository10

🎯
git-commit🎯Skill

A community-curated collection of custom agents, prompts, instructions, and skills to enhance GitHub Copilot across different domains, languages, and use cases.

🎯
gh-cli🎯Skill

A community-curated collection of custom agents, prompts, instructions, and skills to enhance GitHub Copilot across different domains, languages, and use cases.

🎯
prd🎯Skill

Generates comprehensive Product Requirements Documents (PRDs) by systematically analyzing requirements and crafting detailed technical specifications

🎯
documentation-writer🎯Skill

Expert technical writer guided by the Diataxis framework, creating tutorials, how-to guides, reference docs, and explanation pages for software documentation.

🎯
excalidraw-diagram-generator🎯Skill

Generates Excalidraw diagrams from natural language descriptions, creating visual representations of processes, systems, relationships, and ideas.

🎯
refactor🎯Skill

A community-curated collection of custom agents, prompts, instructions, and skills to enhance GitHub Copilot across different domains, languages, and use cases.

🎯
java-springboot🎯Skill

Provides Spring Boot best practices covering project setup, dependency injection, REST API design, data access with JPA, security configuration, and testing strategies.

🎯
multi-stage-dockerfile🎯Skill

Creates optimized multi-stage Dockerfiles for any language or framework, separating builder and runtime stages to produce smaller, more secure container images.

🎯
create-readme🎯Skill

Creates README.md files for projects.

🎯
memory-merger🎯Skill

Merges mature lessons from domain memory files into instruction files, consolidating knowledge with minimal redundancy across global, user, and workspace scopes.