Bank Account Fraud Dataset Suite (NeurIPS 2022)

^^。

Bank Account Fraud Dataset Suite (NeurIPS 2022)

financebankingclassificationtabularml ethics

￥10

已售 0

532.2MB

数据标识：D17171475549508153

发布时间：2024/05/31

The Bank Account Fraud (BAF) suite of datasets has been published at NeurIPS 2022 and it comprises a total of 6 different synthetic bank account fraud tabular datasets. BAF is a realistic, complete, and robust test bed to evaluate novel and existing methods in ML and fair ML, and the first of its kind!

This suite of datasets is:

Realistic, based on a present-day real-world dataset for fraud detection;
Biased, each dataset has distinct controlled types of bias;
Imbalanced, this setting presents a extremely low prevalence of positive class;
Dynamic, with temporal data and observed distribution shifts;
Privacy preserving, to protect the identity of potential applicants we have applied differential privacy techniques (noise addition), feature encoding and trained a generative model (CTGAN).

Each dataset is composed of:

1 million instances;
30 realistic features used in the fraud detection use-case;
A column of “month”, providing temporal information about the dataset;
Protected attributes, (age group, employment status and % income).

Detailed information (datasheet) on the suite: https://github.com/feedzai/bank-account-fraud/blob/main/documents/datasheet.pdf

Check out the github repository for more resources and some example notebooks: https://github.com/feedzai/bank-account-fraud

Read the NeurIPS 2022 paper here: https://arxiv.org/abs/2211.13358

Learn more about Feedzai Research here: https://research.feedzai.com/

Please, use the following citation of BAF dataset suite

@article{jesusTurningTablesBiased2022,   title={Turning the {{Tables}}: {{Biased}}, {{Imbalanced}}, {{Dynamic Tabular Datasets}} for {{ML Evaluation}}},   author={Jesus, S{\'e}rgio and Pombal, Jos{\'e} and Alves, Duarte and Cruz, Andr{\'e} and Saleiro, Pedro and Ribeiro, Rita P. and Gama, Jo{\~a}o and Bizarro, Pedro},   journal={Advances in Neural Information Processing Systems},   year={2022} }

看了又看

验证报告

以下为卖家选择提供的数据验证报告：

Bank Account Fraud Dataset Suite (NeurIPS 2022)

￥10

已售 0

532.2MB

申请报告

Bank Account Fraud Dataset Suite (NeurIPS 2022)

关于典枢

下载与支持

服务协议

关于我们

官方公众号

技术交流群