Fraud ML Engine
End-to-end fraud detection ML system combining feature engineering pipelines, multi-family model benchmarking (XGBoost, LightGBM, deep learning, anomaly detection), SHAP/LIME explainability, and production real-time scoring API.
Evidence & Verification
Metrics and claims on this page are tied to the linked artifacts below (repository docs, experiment outputs, and deployment pages when available).
Tags
Technologies
Overview
Fraud ML Engine is a standalone system focused on transaction-side fraud detection and scoring. It distills production lessons from 3+ years of fraud operations at ITSEC Asia and BRI into an implementation that balances model performance, operational reliability, and analyst explainability.
Feature Engineering
The feature layer captures transaction risk signals, velocity features across time windows, merchant and device behavioral indicators, and account-level anomaly context. The pipeline is designed for low-latency transformation so inference remains suitable for real-time authorization workflows.
Model Zoo
The engine benchmarks four model families in one framework:
- Gradient boosting (XGBoost, LightGBM) for strong tabular baselines
- Deep sequence models (LSTM autoencoder variants) for temporal anomaly cues
- Classical anomaly models (Isolation Forest) for unsupervised risk discovery
- Hybrid scoring ensembles for robustness across shifting fraud patterns
Explainability
SHAP and LIME are integrated for analyst-facing decision support. Each score can be decomposed into feature-level contributions, enabling compliance reporting, investigation triage, and faster feedback loops between fraud analysts and model owners.
Serving Architecture
The system exposes FastAPI endpoints for real-time scoring and health/status monitoring. The serving layer is tuned for sub-50ms scoring paths under typical production payload sizes.
API Surface
The engine exposes structured endpoints for integration:
POST /api/v1/predict— single transaction scoringPOST /api/v1/batch_predict— batch scoring for historical analysisPOST /api/v1/explain/{id}— SHAP/LIME explanation for a specific predictionPOST /api/v1/benchmark/run— model comparison benchmark executionGET /api/v1/benchmark/results— benchmark result retrievalGET /api/v1/model_info— active model metadata and version info
EDA Dashboard
An interactive exploratory data analysis dashboard built with Plotly Dash provides visual diagnostics for fraud pattern investigation, feature distribution analysis, and model performance monitoring. A separate Streamlit app serves the SHAP/LIME explainability UI. Together these support the feedback loop between fraud analysts and model development.
Relationship
This project complements Fraudware Analyzer: Fraud ML Engine covers transaction-side fraud scoring, while Fraudware Analyzer focuses on malware-side threat analysis and static reverse-engineering workflows.