以下为卖家选择提供的数据验证报告:
数据描述
This Dataset is a compilation of others public datasets on Kaggle related to LLM - DAIGT competition.
My objective here was to facilitate the use of this data in the process of augmenting the training data provided to us, in the original dataset for the competition.
As this is just a Data Preprocessing job, I'll link each repository here, if you want to check the original public Datasets. I'll provide also the tag kaggle_repo
for each dataset, so you can see from each repository came each data on the final csv data:
LLM Generated Essays for the Detect AI Comp
kaggle_repo
: 1- Dataset
ArguGPT
DAIGT | External Dataset
kaggle_repo
: 3- Dataset
DAIGT Data | Llama 70b and Falcon 180b
kaggle_repo
: 4- Dataset
LLM-generated essay using PaLM from Google Gen-AI
kaggle_repo
: 7- Dataset
Hello, Claude! 1000 Essays from 7 Persuade prompts
kaggle_repo
: 6- Dataset
Persuade Corpus 2.0
kaggle_repo
: 8- Dataset
DAIGT Proper Train Dataset
kaggle_repo
: 5- Dataset
Feedback Prize 3
kaggle_repo
: 9- Dataset

DAIGT - One Place, All Data
119.96MB
申请报告