醒醒

verify-tagDAIGT - One Place, All Data

educationcomputer scienceintermediatenlptexttext classification

3

已售 0
119.96MB

数据标识:D17220386347908267

发布时间:2024/07/27

以下为卖家选择提供的数据验证报告:

数据描述

This Dataset is a compilation of others public datasets on Kaggle related to LLM - DAIGT competition.

My objective here was to facilitate the use of this data in the process of augmenting the training data provided to us, in the original dataset for the competition.

As this is just a Data Preprocessing job, I'll link each repository here, if you want to check the original public Datasets. I'll provide also the tag kaggle_repo for each dataset, so you can see from each repository came each data on the final csv data:

LLM Generated Essays for the Detect AI Comp

ArguGPT

DAIGT | External Dataset

DAIGT Data | Llama 70b and Falcon 180b

LLM-generated essay using PaLM from Google Gen-AI

Hello, Claude! 1000 Essays from 7 Persuade prompts

Persuade Corpus 2.0

DAIGT Proper Train Dataset

Feedback Prize 3

data icon
DAIGT - One Place, All Data
3
已售 0
119.96MB
申请报告